March 3, 2026Open Access

Utilizing Multimodal Logic Fusion to Identify the Types of Food Waste Sources

Key Points

The multimodal logic fusion method ensured robust classification accuracy amid different light conditions, preventing model failure.
The image recognition model achieved a remarkable accuracy of 99.46% within the optimal brightness range of 120-240 cd m<sup>-2</sup>.
While the audio model maintained an illumination-independent accuracy of 0.80, it served as a crucial backup when light intensity fell below optimal levels.
Validated on an independent test set, this innovative fusion method achieved an overall accuracy of 90.25%, promising for real-world applications.

Abstract

It is a challenge to identify food waste sources in all-weather industrial environments, as variable lighting conditions can compromise the effectiveness of visual recognition models. This study proposes and validates a robust, interpretable, and adaptive multimodal logic fusion method in which sensor dominance is dynamically assigned based on real-time illuminance intensity. The method comprises two foundational components: (1) a lightweight MobileNetV3 + EMA model for image recognition; and (2) an audio model employing Fast Fourier Transform (FFT) for feature extraction and Support Vector Machine (SVM) for classification. The key contribution of this system lies in its environment-aware conditional logic. The image model MobileNetV3 + EMA achieves an accuracy of 99.46% within the optimal brightness range (120-240 cd m-2), significantly outperforming the audio model. However, its performance degrades significantly outside the optimal range, while the audio model maintains an illumination-independent accuracy of 0.80, a recall of 0.78, and an F1 score of 0.80. When light intensity falls below the threshold of 84 cd m-2, the audio recognition results take precedence. This strategy ensures robust classification accuracy under variable environmental conditions, preventing model failure. Validated on an independent test set, the fusion method achieves an overall accuracy of 90.25%, providing an interpretable and resilient solution for real-world industrial deployment.

Bookmark

View Full Paper

Bookmark

View Full Paper

Utilizing Multimodal Logic Fusion to Identify the Types of Food Waste Sources

Key Points

Abstract

Cite This Study