What question did this study set out to answer?

The aim is to predict the pro-inflammatory potential of peptides using machine learning based on their properties and sequence patterns.

February 13, 2026Open Access

Predicting proinflammatory peptides using machine learning models

Key Points

The aim is to predict the pro-inflammatory potential of peptides using machine learning based on their properties and sequence patterns.
Utilized amino acid-based methods and k-mer sequence patterns for prediction.
Evaluated model performance using a peptide dataset from PIP-EL and Orange Data Mining platform.
Employed various algorithms including Random Forest, Gradient Boosting, Logistic Regression, and Neural Networks.
Amino acid-based models achieved AUC values of 0.965 and 0.955 for Random Forest and Gradient Boosting, respectively.
K-mer methods achieved an AUC of 0.980 for both Logistic Regression and Neural Networks.
Classification accuracy, F1-score, precision, and recall ranged from 0.91 to 0.93.

Abstract

Pro-inflammatory peptides are key immune signaling molecules that contribute to vaccine and immunotherapy development. Machine learning enables rapid, accurate, and high-throughput prediction of these peptides, complementing traditional experimental approaches with computational methods. This study tests the hypothesis that a peptide’s pro-inflammatory potential can be predicted from its physicochemical properties using an amino acid–based methods, or from k-mer sequence patterns using bag-of-words methods. Model performance was evaluated using a peptide dataset from PIP-EL implemented in the Orange Data Mining platform. The amino acid–based methods employing tree-based algorithms using Random Forest and Gradient Boosting achieved area under the ROC curve (AUC) values of 0.965 and 0.955, respectively, while k-mer (k = 5) methods using Logistic Regression and Neural Networks both achieved AUC values of 0.980, with AUCs consistently above 0.95 for k = 3–8. Performance metrics were calculated for these models, with classification accuracy, F1-score, precision, and recall ranging from 0.91 to 0.93, and Matthews correlation coefficients (MCC) ranging between 0.82 and 0.86. These results demonstrate that properly configured machine learning models can effectively predict pro-inflammatory peptides computationally. While the findings are broadly consistent with previous studies, direct performance comparisons should be interpreted cautiously due to differences in algorithms, underlying hypotheses, therapeutic targets, and datasets. Overall, this study evaluates two machine learning approaches and presents reproducible models with strong performance metrics, which help inform future peptide-based wet-lab therapeutic research.

Bookmark

View Full Paper

Cite This Study

Yanling Lin (Sat,) studied this question.

synapsesocial.com/papers/698ebf6985a1ff6a93016e6a https://doi.org/https://doi.org/10.13021/mars/15248

Bookmark

View Full Paper