What question did this study set out to answer?

To evaluate whether an AI platform integrated with a knowledge graph can improve clinical trial matching efficiency and equity in oncology.

April 10, 2026Open Access

Transforming oncology clinical trial matching through neuro-symbolic, multi-agent AI and an oncology-specific knowledge graph: a prospective evaluation in 3804 patients

Key Points

To evaluate whether an AI platform integrated with a knowledge graph can improve clinical trial matching efficiency and equity in oncology.
Screened 3804 consecutive patients with metastatic or progressive malignancies over 12 months.
Employed a multi-agent architecture combining LLM-based extraction, knowledge graph, and prioritization engine.
Compared performance against manual screening and various GPT-4 baselines regarding trial matching metrics.
Achieved an F1 score of 0.82 compared to the GPT-4 zero-shot baseline of 0.47.
Reduced median screening time from 120 minutes (manual) to 30 minutes (automated and clinical review combined).
Identified 17,912 oncologist-confirmed matches from 23,912 candidate pairs, with a median time-to-recommendation of less than 7 days.

Abstract

Background: Clinical trial enrollment in oncology remains critically low, with fewer than 5% of eligible adults participating, in large part due to the complexity and labor intensity of eligibility screening.We prospectively evaluated a neuro-symbolic, multi-agent artificial intelligence (AI) platform integrating domain-specific large language model (LLM) agents, an oncology-specific knowledge graph, a real-time recommendation engine, and human-in-the-loop review to determine whether automated extraction and reasoning can safely improve trial identification, efficiency, and equity at scale.Methods: Consecutive patients N = 3804; Eastern Cooperative Oncology Group (ECOG) 0-2 balanced for cancer type incidence with metastatic or progressive malignancies were screened across a 12-month period.A multiagent architecture-OncoAgents (LLM-based extraction and reasoning agents), OncoGraph (oncology knowledge graph), OncoRecommend (prioritization engine), and OncoSet (expert-curated corpus)-carried out automated data extraction, harmonization, and trial matching over 157 367 clinical pages (86.5 M tokens).Dual oncologists produced a gold standard of trial eligibility labels (Cohen's = 0.92).The primary unit of analysis was the patient-trial pair.Baselines included manual screening, GPT-4 zero-shot prompting, GPT-4 chain-of-thought, and frontier GPT-4o extraction/matching benchmarks.Outcomes included sensitivity, specificity, precision, F1 score, calibration of eligibility confidence scores, time-to-recommendation, fairness across demographic subgroups, and operational burden.Results: The multi-agent neuro-symbolic system achieved an F1 score of 0.82 (95% confidence interval 0.81-0.83).In comparison, the GPT-4 zero-shot baseline achieved an F1 of 0.47, and the GPT-4 chain-of-thought baseline achieved an F1 of 0.67.Per-patient screening time decreased from a median of 120 min (manual review) to 30 min total (15 min automated processing + 15 min clinical review).Across the cohort, the system processed 157 000 pages, screened 23 912 candidate patient-trial pairs, and produced 17 912 oncologist-confirmed matches, with median time-torecommendation <7 days.No demographic subgroup exceeded a 10-percentage point F1 gap; the largest observed difference was 7 points between white and black/African American patients.Ablation experiments showed that both knowledge graph grounding and multi-agent decomposition contributed materially to performance and efficiency.Eligibility confidence scores exhibited reasonable calibration in the clinically relevant operating range.Conclusions: A neuro-symbolic, multi-agent architecture that couples LLM-based extraction with ontology-grounded, deterministic eligibility reasoning improved the accuracy, throughput, and timeliness of oncology clinical trial matching versus LLM-only baselines, while preserving clinician oversight and maintaining modest subgroup performance gaps.These results support scalable, equity-aware deployment of AI-assisted trial screening in routine oncology practice.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

A. Loaiza-Bonilla

C. Yost

S. Kurnaz

Journals

ESMO Real World Data and Digital Oncology

Actions

Institutions

Creighton University

Lynn University

Phoenix (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Transforming oncology clinical trial matching through neuro-symbolic, multi-agent AI and an oncology-specific knowledge graph: a prospective evaluation in 3804 patients

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study