Background: A major technical challenge in single-cell transcriptomics is the absence of an integrative analytic pipeline that can simultaneously leverage gene regulatory network (GRN) architecture, AI-assisted gene panel discovery, and functional relevance analyses to generate coherent biological insights. Existing approaches often treat these components independently, focusing on clusters, marker genes, or predictive features without integrating them into a mechanistically grounded framework. Consequently, comprehensive screening that links regulatory association, gene signature screening, and functional interpretation within single-cell datasets remains limited, underscoring the need for an integrated strategy. Methods: We developed an integrative bioinformatics pipeline based on Gene regulatory network–AI–Functional Analysis (GAFA), combining latent-space integration, unsupervised clustering, diffusion pseudotime analysis, lineage-resolved generalized additive modeling, GRN inference, and machine learning-based gene panel discovery. This framework enables systematic mapping of cell-state structure, reconstruction of differentiation and effector trajectories, and identification of transcriptional and regulatory features strongly associated with clinical outcomes. As a case study, we applied the pipeline to NK cell transcriptomes from six CML patients (two early relapse, two late relapse, two durable treatment-free remission—TFR; 15 samples) collected at TKI discontinuation and 6–12 months after therapy cessation. Results: We reanalyzed publicly available scRNA-seq data from a previously published CML cohort to evaluate NK-cell transcriptional programs associated with treatment-free remission and relapse. We resolved six transcriptionally distinct NK cell states spanning CD56bright-like cytokine-responsive, early activated, terminally mature, cytotoxic, lymphoid trafficking, and HLA-DR+ immunoregulatory populations, each exhibiting outcome-specific compositional differences. Pseudotime analysis revealed two major NK cell lineages—a maturation trajectory and a cytotoxic effector trajectory. TFR samples displayed balanced occupancy of both lineages, whereas early relapse samples showed marked depletion of the maturation branch and preferential accumulation in cytotoxic end states. AI-guided feature selection and random forest modeling identified an 18-gene panel that distinguished NK cells from TFR and relapse samples in an exploratory manner. Among them, CST7, FCER1G, GNLY, GZMA, and HLA-C were conventional NK-associated genes, whereas ACTB, CYBA, IFITM2, IFITM3, LYZ, MALAT1, MT2A, MYOM2, NFKBIA, PIM1, S100A8, S100B, and TSC22D3 were novel. The GRN inference further uncovered outcome-specific regulatory modules, with RUNX3, EOMES, ELK4, and REL regulons enriched in TFR, whereas FOSL2 and MAF regulons were enriched in relapse, and their downstream targets linked to IFN-γ signaling, metabolic reprogramming, and immunoregulatory feedback circuits. Conclusions: This AI-enabled single-cell analysis demonstrates how NK cell state composition, differentiation trajectories, and regulatory network rewiring collectively shape TFR versus relapse following TKI discontinuation in CML. The integrative pipeline provides a modular framework that could be extended to additional datasets for data-driven biomarker discovery and mechanistic stratification, and highlights candidate transcriptional regulators and NK cell programs that may be leveraged to improve remission durability, pending validation in larger patient cohorts.
Building similarity graph...
Analyzing shared references across papers
Loading...
Borra et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69d8940c6c1944d70ce04f7d — DOI: https://doi.org/10.3390/biology15070588
Santoshi Borra
Da Yan
Robert S. Welner
Biology
University of Alabama at Birmingham
Indiana University Bloomington
Auburn University
Building similarity graph...
Analyzing shared references across papers
Loading...