What question did this study set out to answer?

This research aims to improve the security and correctness of Python code generated by LLMs through integration with SAST tools and fine-tuning techniques.

April 4, 2026Open Access

CodeEnhancer: LLM-generated Python code enhancement through SAST integration and fine-tuning

Key Points

This research aims to improve the security and correctness of Python code generated by LLMs through integration with SAST tools and fine-tuning techniques.
Developed CodeEnhancer framework combining LLMs with SAST tools like Pylint and Bandit
Implemented iterative validation pipeline for code issue identification and remediation
Fine-tuned LLMs on expert-written and framework-refined code samples
Conducted comparative experiments using LLMSecEval and SecurityEval datasets
Eliminated 82.8% of initial vulnerabilities in LLM-generated code using the validation pipeline
Achieved 18.4% vulnerable code snippets with framework-tuned model compared to 43.6% and 54.7% from baseline and expert-tuned models
Reduced final vulnerability rates to 6.7% on LLMSecEval and 3.5% on SecurityEval datasets

Abstract

Despite the rapid adoption of Large Language Models (LLMs) for automatic code generation, their output often exhibits syntax errors, security vulnerabilities, and functional inconsistencies. To address these issues, we present CodeEnhancer, a two-stage framework that tightly integrates LLMs with static application security testing (SAST) tools and targeted fine-tuning. The goal is to produce more secure and functionally correct Python code. In the first stage, our iterative validation pipeline couples LLM-generated code with tools such as Pylint and Bandit. These tools automatically identify and remediate issues through structured feedback loops. When applied to the GPT-4o model, this process eliminated 82.8% of the initial vulnerabilities and resolved all the detected functional correctness issues when tested on the LLMSecEval dataset. In the second stage, we fine-tune the LLMs using two types of secure code examples: expert-written samples and code refined by our framework. Comparative experiments demonstrate that the framework-tuned model outperforms the baseline and expert-tuned models. The framework-tuned model generates only 18.4% vulnerable code snippets on the LLMSecEval dataset, whereas the baseline and expert-tuned models produce 43.6% and 54.7% vulnerable code snippets, respectively. The framework-tuned model reduces final vulnerability rates to 6.7% on LLMSecEval and 3.5% on the SecurityEval dataset. Our results highlight the synergistic effect of integrating static analysis with feedback-informed fine-tuning. They also reveal limitations in current evaluation metrics and dataset representativeness. These findings suggest a scalable, robust approach to achieving more secure, trustworthy, and practical AI-assisted code generation. • Combines language models with SAST Tools to enhance Syntax, security and functional correctness Python code. • First approach to address syntax, security, and functional correctness in LLM-generated code. • Automated feedback and learning process helps LLMs generate more secure, correct code. • Fine-tuning on framework-refined code leads to better security than training on expert-written code. • Scalable approach enables robust and trustworthy AI-assisted code generation and refinement with minimal manual effort.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Lee et al. (Wed,) studied this question.

www.synapsesocial.com/papers/69d0aefd659487ece0fa4e64 — DOI: https://doi.org/10.1016/j.knosys.2026.115925

Authors

Jongmin Lee

Khang Mai

Nakul D. Ghate

Journals

Knowledge-Based Systems

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

CodeEnhancer: LLM-generated Python code enhancement through SAST integration and fine-tuning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion