What question did this study set out to answer?

The aim is to develop a transferable model for software vulnerability detection across different projects.

February 28, 2026Open Access

Data-Driven Transferable Modeling for Cross-Project Software Vulnerability Detection via Dual-Feature Stacking Ensemble

Key Points

The aim is to develop a transferable model for software vulnerability detection across different projects.
Utilized a dual-feature stacking ensemble approach.
Extracted code semantic features via gated graph neural networks.
Incorporated expert-designed metrics.
Employed TrAdaBoost for cross-domain data modeling.
Adaptively fused features to overcome fixed-weight limitations.
Achieved an average AUC of 0.814 in vulnerability detection.
Significantly outperformed existing mainstream baseline models.
Demonstrated effective model generalization across diverse datasets.

Abstract

In recent years, deep learning-based vulnerability detection has drawn wide attention for its data-driven ability to analyze code semantics and learn vulnerability patterns without predefined models. However, data distribution differences across projects limit model generalization. Transfer learning provides a solution, yet most studies ignore expert-designed metrics. This paper proposes Decpvd, a data-driven cross-project software vulnerability detection method based on a dual-feature stacking ensemble. It builds an adaptive and transferable model using only code and vulnerability label data from source and target projects. It extracts code semantic features via Gated Graph Neural Networks, incorporates expert metrics from tools, performs cross-domain data-driven modeling with TrAdaBoost, and adaptively fuses the two features through stacking, overcoming fixed-weight fusion limitations. Experiments on six cross-project groups from three real datasets (FFmpeg, LibTIFF, LibPNG) show that Decpvd achieves an average AUC of 0.814, significantly outperforming mainstream baselines.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yu Liu

Bin Liu

Shihai Wang

Journals

Mathematics

Actions

Institutions

Beihang University

Changsha Normal University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Data-Driven Transferable Modeling for Cross-Project Software Vulnerability Detection via Dual-Feature Stacking Ensemble

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study