What question did this study set out to answer?

Evaluate the landscape of clinical evaluation for medical AI by analyzing systematic reviews and their trials.

May 3, 2026

A quantitative analysis of global AI medical studies: gaps in randomized controlled trials.

Key Points

Evaluate the landscape of clinical evaluation for medical AI by analyzing systematic reviews and their trials.
Conducted a scoping review of 218 systematic reviews published from September 2023 to September 2024.
Classified 4667 primary studies by research stage, geography, specialty, and study design.
Analyzed the percentage of studies classified as randomized controlled trials (2.4%).
Only 2.4% of identified studies were randomized controlled trials (RCTs), with 88.2% being preclinical.
Among RCTs, 67.3% were single-center studies and 71.7% did not adhere to reporting guidelines.
Favorable outcomes were reported in 82.3% of RCTs, but methodological concerns were prevalent, especially in allocation concealment.

Abstract

Medical artificial intelligence (AI) has advanced rapidly, yet a comprehensive quantitative overview of its clinical evaluation landscape remains lacking. We conducted a scoping review of 218 systematic reviews published between September 2023 and September 2024, from which 4667 primary studies were identified and classified by research stage, geography, specialty, and study design. Most studies were preclinical (88.2%, 4114/4667), while only 2.4% (113/4667) were randomized controlled trials (RCTs). Research was highly geographically concentrated: the top 10 contributing countries accounted for 75.5% of total country contributions, of which the United States and China contributed 47.5%. Among all primary studies, neoplasms (32.5%, 1518/4667) and musculoskeletal disorders (14.1%, 656/4667) were the most common, whereas digestive (38.1%, 43/113) and circulatory diseases (12.4%, 14/113) accounted for most RCTs. Among RCTs, 67.3% (76/113) were single-center, and 71.7% (81/113) did not report adherence to any reporting guideline. Overall, 82.3% (93/113) of RCTs reported favorable outcomes, although methodological concerns were common, particularly in allocation concealment and blinding. These findings indicate that medical AI research remains heavily skewed toward early-stage development, with limited high-quality clinical evidence. Strengthening trial design, multicenter collaboration, and reporting transparency will be critical to support the safe and equitable integration of AI into clinical practice.

Bookmark

A quantitative analysis of global AI medical studies: gaps in randomized controlled trials.

Key Points

Abstract

Cite This Study