What question did this study set out to answer?

The aim is to develop a deep learning framework for clinical outcome prediction by integrating diverse patient data modalities.

April 5, 2026

Abstract 1482: A deep learning-based multimodal integration framework for clinical outcome prediction

Key Points

The aim is to develop a deep learning framework for clinical outcome prediction by integrating diverse patient data modalities.
Developed a multimodal integration framework using deep learning techniques.
Utilized TCGA non-small cell lung cancer data to predict overall survival.
Combined whole slide images, gene expression, and mutation data using a transformer architecture.
Applied Cox proportional hazards model for survival association and feature contribution interpretation.
Achieved a C-index of 0.64 for TCGA and showed robust performance across datasets.
Identified 349 core OS-related genes linked to cancer pathways.
Found significant prognostic values for multiple independent datasets.
Identified 11 exclusive genes for immunotherapy linked to immune-related pathways.

Abstract

Abstract In oncology, a diverse range of advanced approaches, including medical imaging, genomic and transcriptomic profiling, and clinical data analysis, are utilized to comprehensively characterize each patient's tumor. This integrative strategy provides actionable insights that inform personalized care plans and therapeutic decisions, ultimately aiming to optimize patient outcomes. However, practical challenges persist, including how to integrate unstructured imaging data and address missing data modalities.Here, we present a new deep learning based, multimodal integration framework to address these challenges. Our approach incorporates a single modal loss, calculated between each modal representation and the patient's clinical outcome, which encourages the model to learn more clinical outcome relevant features from each modality. Furthermore, a mutual information estimator is implemented to enable the model to explore exclusive features within each data modality. For multimodal fusion, we leveraged a transformer architecture to combine different modalities’ embeddings into unified patient-level representations. In our study, we specifically focused on whole slide images, gene expression and mutation. We utilized TCGA non-small cell lung cancer (NSCLC) data (n=989) to develop our model for predicting overall survival (OS), and validated its performance in three independent datasets: CPTAC (n=208), ConcertAI RWD360® linked Caris datasets (ConcertAI/Caris: n=2176), City of hope (COH, n=84). We observed high and robust performance across the four datasets (TCGA: 0.64 ± 0.03; CPTAC: 0.60 ± 0.04; ConcertAI/Caris: 0.59 ± 0.03; COH: 0.61 ± 0.04) using C-index as the evaluation metric. Further evaluating survival association in the three independent datasets indicated significant prognostic values of our model in both univariable (CPTAC: HR = 3.29, p = 0.007; ConcertAI/Caris: HR = 1.50, p = 5e-6; COH: HR = 3.16, p = 0.008) and multivariable (CPTAC: HR = 2.72, p = 0.03; ConcertAI/Caris: HR = 1.40, p = 0.002; COH: HR = 3.81, p = 0.02) Cox proportional hazards model after adjusting for known prognostic clinical factors.To interpret our model, we applied integrated gradients method to understand each feature’s contribution to model output. Specifically, we identified 349 core OS-related genes in NSCLC, enriching in cancer-related pathways such as epithelial mesenchymal transition, focal adhesion and TNFA signaling via NFKB. Stratifying patient cohorts by treatment types allowed us to further identify exclusive genes per treatment. As a result, we identified 11 exclusive genes for immunotherapy, with enrichment in immune-related and metabolism pathways.In summary, we developed a multimodal integration framework that predicts clinical outcomes with high and robust performance. Interpretating our framework reveals potential prognostic and predictive markers to advance therapeutic development. Citation Format: Baoyi Zhang, Helen Tian, Thanh Bui, Yookyung Christy Choi, Mona H. Cai, Peter Ansell, Aditee Shrotre, Steven Chirieleison, Kevin Kolahi, Xi Zhao, Josue Samayoa, Weilong Zhao. A deep learning-based multimodal integration framework for clinical outcome prediction abstract. In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 1 (Regular Abstracts); 2026 Apr 17-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2026;86(7 Suppl):Abstract nr 1482.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Baoyi Zhang

Helen Tian

Thanh Cong Bui

Journals

Cancer Research

Actions

Institutions

University of Chicago

AbbVie (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Abstract 1482: A deep learning-based multimodal integration framework for clinical outcome prediction

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider