Genetic aberrations are among the critical driving factors of lung cancer. Importantly, the impact of genetic variations on proteomic dysregulations with the goal of characterizing potential diagnostic biomarkers at the population-level requires additional investigation. Modeling such proteogenomic interactions is crucial in understanding early-stage biological disruptions to inform biomarker discovery, successful clinical trials, and developing effective therapeutics. We investigated two complementary aspects of lung cancer risk. First, we performed a genome-wide association study of lung cancer using population-scale datasets, then examined whether lung cancer risk-associated variants influence plasma protein levels using the UK Biobank Pharma Proteomics Project data. Second, we identified plasma proteomic dysregulations in presymptomatic and symptomatic patients with the objective of pinpointing diagnostic biomarkers through leveraging machine learning methods. Using the identified proteins, machine learning models achieved median cross-validated AUCs of 0.85–0.88 (0–4 years before diagnosis YBD), 0.81–0.84 (5–9 YBD), and 0.80–0.86 (0–9 YBD). Performing survival analyses within the 5–9 YBD group, elevated levels of eight proteins, such as CALCB, PLAUR, and CD74, were found to significantly associate with lower survival. We identified 22 disease-associated proteins, of which 14 have been previously implicated in lung cancer, including CEACAM5, CXCL17, GDF15, WFDC2 along with 8 novel proteins. These proteins were enriched in pathways related to cytokine signaling, interleukin regulation, neutrophil degranulation, and lung fibrosis. While these findings do not establish mechanistic causality, they highlight proteomic alterations reflecting systemic changes preceding the diagnosis. Our study contributes to understanding genome–proteome relationships in lung cancer and identifies circulating proteins warranting further investigation as potential early biomarkers for screening and risk stratification. Lung cancer is the leading cause of cancer-related deaths worldwide and is often diagnosed at late stage, when treatment is less effective. Early detection is essential but challenging, as symptoms typically appear only in advanced stages of the disease. In this study, we analyzed genetic data and proteins in blood collected years before a lung cancer diagnosis. We identified 22 proteins whose levels change years before lung cancer onset. These proteins are linked to the immune system, inflammation, and lung tissue changes. Our findings suggest that blood-based markers may help support earlier lung cancer detection and improve risk assessment and could inform future strategies for disease monitoring and prevention. Johnson et al. investigate genetic variants linked to lung cancer risk and plasma proteins predicting future diagnosis. Protein levels in blood change up to 9 years before disease onset, highlighting 22 proteins involved in immune signalling, inflammation, and long fibrosis.
Building similarity graph...
Analyzing shared references across papers
Loading...
Marcela A. Johnson
Shirley Nieves-Rodriguez
Liping Hou
Communications Medicine
Johnson & Johnson (United States)
Janssen (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Johnson et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69b6069b83145bc643d1c94e — DOI: https://doi.org/10.1038/s43856-026-01500-1
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: