What question did this study set out to answer?

The aim is to develop a method to accurately predict disease-associated peptides using only positive data.

February 17, 2026Open Access

DPAS: disease-associated peptide anomaly score for identifying pathogenic peptides via one-class learning

Key Points

The aim is to develop a method to accurately predict disease-associated peptides using only positive data.
Employed one-class classification approaches focusing solely on positive-labeled data.
Used three classifiers: One-Class Support Vector Machines, Isolation Forest, and Autoencoders.
Evaluated predictions based on mean reconstruction errors from Autoencoders.
Autoencoders achieved the best performance among the classifiers.
Introduced the Disease Peptide Anomaly Score (DPAS) for prioritizing peptides.
DPAS incorporates anomaly scores and feature importance for effective peptide ranking.

Abstract

Abstract Predicting disease-associated peptides is a challenging task in bioinformatics, mostly hindered by the lack of reliable negative datasets, leading to biased predictions. In this study, we propose a one-class classification approach that focuses exclusively on positive-labeled data. We employed three classifiers namely One-Class Support Vector Machines (OCSVM), Isolation Forest, and Autoencoders to classify disease-associated peptides, with Autoencoders yielding the best results. The Autoencoders trained on the positive dataset effectively differentiated the inliers from outliers which is further evaluated by mean reconstruction errors. Our method combines various sequence based features together. This framework provides an efficient solution for predicting disease-associated peptides that also overcomes the traditional binary classification approaches. To enhance interpretability and peptide prioritization, we introduce a new scoring metric Disease Peptide Anomaly Score (DPAS) which combines model-derived anomaly scores with feature importance values obtained using SHAP (SHapley Additive exPlanations). DPAS facilitates the ranking of peptides based on their likelihood of being disease-associated, offering a robust and interpretable approach for peptide biomarker discovery.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zoya Khalid

Razia Khalid

Osman Uğur Sezerman

Journals

Scientific Reports

Actions

Institutions

COMSATS University Islamabad

National University of Computer and Emerging Sciences

Acıbadem University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DPAS: disease-associated peptide anomaly score for identifying pathogenic peptides via one-class learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider