What question did this study set out to answer?

The aim is to evaluate the effectiveness of AI tools in diagnosing benign skin lesions through sensitivity and specificity analysis.

April 19, 2026Open Access

How good are artificial intelligence tools at identifying benign skin lesions? A systematic review and meta-analysis of the specificity of artificial intelligence tools in diagnosing suspicious skin lesions

Puntos clave

The aim is to evaluate the effectiveness of AI tools in diagnosing benign skin lesions through sensitivity and specificity analysis.
Conducted a comprehensive search on multiple databases following PRISMA 2020 guidelines.
Included studies with recorded sensitivity and specificity metrics.
Assessed nine studies meeting inclusion criteria.
Extracted data on AI model type, sample size, and diagnostic metrics.
Assessed risk of bias using QUADAS-2 framework.
AI tools showed specificity ranging from 70% to 95%, with a pooled specificity of 82%.
Sensitivity ranged from 65% to 96%, with a pooled sensitivity of 89%.
AI tools performed well in distinguishing between malignant and benign skin lesions.

Resumen

Abstract Background Artificial intelligence (AI) is a transformative diagnostic tool in dermatology. As the prevalence of skin cancer rises and pressure on health services increases, there is an increasing demand for efficient diagnostic tools. Therefore, it is highly relevant to evaluate the diagnostic abilities of AI tools with a focus on not just the sensitivity, but also the specificity, to reduce unnecessary referrals and skin biopsies for benign skin lesions. Objectives To evaluate the effectiveness of AI tools in diagnosing benign skin lesions, with a primary focus on calculating the sensitivity and specificity of AI tools when analysing suspicious skin lesions. Methods A comprehensive search was conducted on multiple databases, adhering to the PRISMA 2020 guidelines. Studies with recorded sensitivity, specificity and diagnosis were included. Nine studies meeting the inclusion criteria were assessed. Data extracted included type of AI model used, sample size and important diagnostic metrics. Risk of bias was also assessed using the Quality Assessment of Diagnostic Accuracy Studies version 2 (QUADAS-2) framework. Results Across the included studies, AI tools demonstrated a specificity of 70–95% with a pooled specificity of 82%, and a sensitivity ranging from 65% to 96% with a pooled sensitivity of 89% when analysing suspicious skin lesions as malignant vs. benign. Conclusions AI tools possess significant potential to streamline dermatological diagnostics, especially in resource-restricted clinical setups. High sensitivity will minimize false negatives, which is crucial for early detection, and high specificity will allow the use of AI tools to autonomously discharge patients with benign skin lesions. However, there is a need for further optimization and training of AI models on more diverse datasets.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Nirula et al. (Fri,) studied this question.

synapsesocial.com/papers/69e47440010ef96374d8ff50 https://doi.org/https://doi.org/10.1093/skinhd/vzag021

Me gusta

Guardar

Ver artículo completo