What question did this study set out to answer?

The research aims to enhance vehicle appearance detection accuracy in complex environments using a modified YOLOv11-seg model.

April 21, 2026Open Access

A segmented model for automobile appearance detection based on improved YOLOv11-seg

Key Points

The research aims to enhance vehicle appearance detection accuracy in complex environments using a modified YOLOv11-seg model.
Introduced the MCALayerPlus module for multi-scale feature extraction.
Implemented an improved ShapeIoU loss function for better shape matching and convergence.
Conducted experiments on a specialized automotive dataset to evaluate model performance.
Achieved mean Average Precision (mAP@0.5) of 94.09% and mAP@0.5:0.95 of 77.12%.
Obtained precision of 91.31% and recall of 90.75%.
Maintained a lightweight model size of 5.75 MB with a processing speed of 45.3 FPS.

Abstract

Conventional models encounter challenges in detecting vehicle appearance components in intricate settings because of their limited small-target recognition capability and suboptimal fusion of multi-scale features. To address these issues, we propose an enhanced vehicle appearance segmentation model based on the YOLOv11-seg framework. Central to our approach is the MCALayerPlus module, designed to concurrently process targets across a wide range of scales. By executing multi-scale feature extraction, the model effectively suppresses false detections arising from cluttered backgrounds. Furthermore, we incorporate an improved ShapeIoU loss function, which integrates a size-sensitivity factor and a category-aware shape penalty term. This integration sharpens shape-matching precision, captures nuanced feature representations, and accelerates model convergence. Experimental results on a specialized automotive dataset demonstrate state-of-the-art performance, achieving a mean Average Precision (mAP@0.5) of 94.09%, an mAP@0.5:0.95 of 77.12%, precision of 91.31%, and recall of 90.75%. Notably, the model maintains a lightweight profile (5.75 MB), ensuring high-speed inference (45.3 FPS) suitable for real-time deployment in intelligent transportation systems.

Bookmark

View Full Paper

Cite This Study

Huang et al. (Sat,) studied this question.

synapsesocial.com/papers/69e7138bcb99343efc98cfc1 https://doi.org/https://doi.org/10.1038/s41598-026-49649-y

Bookmark

View Full Paper