March 3, 2026Open Access

Approches basées sur le dialogue pour une reconnaissance d'images transparente

Key Points

Dialogue-based image recognition enhances transparency in decision-making processes, addressing key flaws.
The approach employs frameworks focused on visual attributes and global similarities for better classification.
Agents engaged in dialogue combine visual data and external knowledge to enrich complex scene understanding.
Implementing human-machine dialogue facilitates model editing to correct unexpected AI behaviors, fostering trust.

Abstract

Despite their impressive performance, modern foundation models for image recognition still struggle with complex reasoning involving multiple sources of information and can exhibit unexpected behaviours such as spurious correlations or inaccurate predictions. Moreover, their decision processes remain opaque, limiting trust and interpretability. This thesis introduces dialogue-based image recognition to address these challenges, a novel paradigm for performing and understanding visual recognition through structured interactions between agents. Three frameworks have been developed. The first, inspired by argumentative dialogue, enables two agents to transparently deliberate over image classification using global prototype similarities and local visual attributes derived from foundation models. The second extends the dialogue-based approach to complex scene understanding, where agents combine visual information and a knowledge base to verify image descriptions. The third explores human-machine dialogue for model editing, allowing users to identify and correct unexpected behaviours. Together, these contributions establish the first exploration of dialogue as a foundation for explainable visual recognition, showing how it can tackle key challenges of opacity, complex reasoning, and unexpected behaviours in image recognition.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Dao Thauvin

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Approches basées sur le dialogue pour une reconnaissance d'images transparente

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study