March 3, 2026Open Access

Mobile Auslan: A multimodal dialogue-centered sign language learning system

Key Points

The system significantly improves dialogue support in sign language learning, enhancing user interaction and expressiveness.
Recognition performance achieves improved accuracy across varied signing viewpoints, accommodating diverse learner needs effectively.
Proposed through a combination of pose-based multi-view augmentation and multi-agent language modeling, the system increases semantic understanding.
While the technology advances learning experiences, it still faces limitations regarding continuous signing and vocabulary size, suggesting areas for future work.

Abstract

Learning sign language is not only a gateway to linguistic competence but also to cultural participation, self-expression, and meaningful interaction within the Deaf community. Recent advances in sign language education technologies have made notable progress in supporting vocabulary acquisition and sentence-level translation. However, dialogue, as one critical component of natural language use, remains largely absent from existing systems. We identify two underlying challenges that contribute to this gap: the lack of recognition robustness to viewpoint variation, which constrains expressive freedom during signing; and the limited semantic modeling capabilities needed to support discourse-level interpretation and interaction. To address these issues, we propose a learning-centered system that integrates pose-based multi-view augmentation and a multi-agent language modeling workflow. This system supports free-form input across diverse signing perspectives and provides structured feedback across word, sentence, and dialogue levels. Built as a modular and deployable platform, the system demonstrates strong recognition performance and learner engagement across varied input conditions. Through this integration of spatial robustness and semantic scaffolding, our work advances the design of sign language learning technologies toward more interactive, expressive, and pedagogically grounded experiences. However, the current system remains limited by its focus on successive (non-continuous) signing and by its moderate vocabulary and restricted pedagogical scope. Future work will therefore extend lexical coverage, incorporate more natural continuous signing, and investigate richer educational functions. • A View-invariant Pose-based Isolated Auslan Sign Recognition. • A Multi-agent Workflow Providing Transparent Dialogue Support. • An Interactive, Learner-paced Design for Auslan Learning System.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper