What question did this study set out to answer?

The research aims to develop and evaluate multi-object detection models for identifying goat behavior in intensive farming systems.

March 28, 2026Open Access

Evaluation of multi-object detection models for automatedgoat behavior identification in intensive farming facilities

Puntos clave

The research aims to develop and evaluate multi-object detection models for identifying goat behavior in intensive farming systems.
Developed a Multi-Object Detection model to classify goat behaviors into four categories: eating, standing, drinking, lying.
Recorded zenithal videos under varying light conditions at an experimental goat farm with 10,740 labeled annotations.
Trained, validated, and tested 13 models using Transformer- and CNN-based architectures, notably YOLO-based models.
YOLOX-based model achieved a mean Average Precision (mAP) of 0.957, demonstrating the highest performance.
Challenges in detecting drinking behavior were linked to visual similarity and distortion effects.
Implementing focal loss and image undistortion strategies improved detection accuracy for the drinking behavior.

Resumen

Computer vision offers significant potential for the continuous, stress-free, and cost-effective monitoring of animal behavior, yet its application in goat farming remains limited. In this study, a Multi-Object Detection (MOD) model was developed to classify goat behavior into four categories: eating, standing, drinking, and lying. Zenithal videos were recorded under both light and dark conditions on an experimental goat farm, resulting in 10,740 labelled annotations used to train, validate, and test 13 models leveraging Transformer- and CNN-based pretrained architectures. YOLO-based models (YOLOv8 and YOLOX) achieved the highest overall performances across both large and lightweight versions, demonstrating high detection capability and potential suitability for hardware-constrained scenarios. YOLOX-based MOD model is preferred for goat behavior detection due to its superior classification accuracy, fast inference speed, and fully open-source license, enabling flexible customization, deployment, and reproducibility. Other models, particularly DAB-DETR and H-DINO, underperformed, especially in detecting drinking behavior, which represents the most challenging class due to its visual similarity with standing, class imbalance, and fisheye distortion effects that affect the frame regions where drinkers are located. Mitigation strategies, including focal loss and distortion correction, improved detection accuracy for this class and reduced performance variability. The developed MOD model can be deployed for continuous group-level monitoring of goats, paving the way for scalable and efficient solutions for advanced behavioral analyses. Future works will focus on integrating tracking algorithms for animal-level insights, as well as on evaluating model generalizability across different farming conditions and goat breeds. • Computer vision applications for goat farming are still very limited in literature. • Various models for goat behavior detection were developed and tested. • YOLOX achieved a mAP @ 0.50 of 0.957 with strong performances across the classes. • Drinking detection is hard due to visual similarity, class imbalance, distortion. • Frame undistortion improves drinking detection performance and reduces variability.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Méndez et al. (Sun,) studied this question.

synapsesocial.com/papers/69c770418bbfbc51511e07d9 https://doi.org/https://doi.org/10.1016/j.aiia.2026.03.006

Me gusta

Guardar

Ver artículo completo