September 13, 2024Open Access

CS-Eval—A Concise Benchmark for Evaluating the Security Risks of Large Language Models

Key Points

Key points are not available for this paper at this time.

Abstract

Large language models (LLMs) are essential to the field of natural language processing, and as their applications expand, security risks have become increasingly prominent. This paper introduces a novel benchmark for evaluating LLMs security, termed CS-Eval, designed to effectively assess the models ability to address vulnerabilities. CS-Eval targets seven key security risks: ethical dilemmas, marginal topics, error detection, detailed event handling, cognitive bias, logical reasoning, and privacy identification, and establishes a Multi-Security Hazard Dataset (MSHD). The evaluated models include GPT-4o, Llama-3-70B, Claude-3-Opus, ERNIE-4.0, Abab-6.5, Qwen1.5-110B, Gemini-1.5-Pro, Doubao-Pro, SenseChat-V5, and GLM-4. We analyzed each models performance in relation to these security risks and provided recommendations for improvement. Experimental results demonstrate varying levels of effectiveness across models, with GPT-4o exhibiting the best overall performance. Moreover, the relationship between security enhancement and model capa-bility is nonlinear, indicating that improving safety requires a multifaceted approach, considering various factors in both development and application.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zhang et al. (Fri,) studied this question.

www.synapsesocial.com/papers/68e58a50b6db643587525d17 — DOI: https://doi.org/10.20944/preprints202409.1098.v1

Authors

Zihan Zhang

Yongbing Gao

Lidong Yang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

CS-Eval—A Concise Benchmark for Evaluating the Security Risks of Large Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider