Automated identification of lumbar vertebrae plays a key role in modern spine analysis, offering valuable assistance for diagnostic assessment and preoperative decision-making. Despite recent progress in deep learning-based detection methods, accurately localizing vertebral structures remains challenging due to anatomical variability and heterogeneous image quality. To address the difficulty of capturing subtle vertebral structures, we introduce a Multi-Scale Channel Attention Block (MSCABlock) integrated into the YOLO11 backbone. Unlike conventional attention-based or multi-scale convolutional designs, MSCABlock jointly exploits channel-wise feature interaction and multi-scale receptive fields to enhance both local detail sensitivity and contextual representation, while preserving computational efficiency. The proposed approach is designed to improve detection performance without significantly increasing model complexity. Our model is trained and validated using only the AP-view images from the Burapha University Lumbar-Spine Dataset (BUU-LSPINE), which provides well-annotated lumbar spine X-ray images from 400 unique patients. The proposed approach operates in a fully end-to-end manner, allowing vertebrae to be identified directly from input images without relying on handcrafted feature engineering or complex preprocessing pipelines. Experimental evaluations show that the proposed model achieves strong detection performance, with mAP@0.5 and mAP@0.5–0.95 reaching 0.982 and 0.79, respectively, alongside a precision of 0.93 and a recall of 0.975. Compared with the YOLO11 baseline, ablation and efficiency analyses demonstrate that MSCABlock consistently improves detection performance. It introduces only marginal increases in model parameters and computational cost, thereby preserving a lightweight architecture and maintaining efficient inference. These results show that the optimized YOLO11-based system generalizes well across lumbar levels. It maintains reliable detection under challenging conditions, providing robust automated localization to support large-scale clinical spine analysis.
Fredj et al. (Tue,) studied this question.