Assistive technologies play an essential role for people with visual impairment to encourage independence and enhance the quality of life. Despite the artificial intelligence (AI) advancements, the current smart assistive systems still have potential limitations that restrict their usefulness in practical settings. Assistive technologies are ineffective in delivering real-time, context-aware environmental understanding because of inadequate integration of visual, tactile, and linguistic cues. These challenges hinder cognitive awareness, delay feedback generation, and limit deployment on lightweight platforms. Notably, large complex models require high computation, which potentially impacts their usability, affordability, and processing speed. The primary objective of this study is to convey essential visual-tactile information by a short descriptive keyword, not to describe the entire scene, to support the visually impaired in understanding material properties quickly. This study designs a lightweight transformer decoder-based text generation (TDTG) model that fuses tactile and visual signals for texture text generation, enabling accurate texture recognition without the need for large models by generating a short descriptive keyword. This short, precise output is much simpler to interpret through audio feedback, thereby enhancing usability and reducing cognitive burden. In order to mitigate data imbalance and enhance generalization, a class-specific deep convolutional generative adversarial network augments underrepresented texture categories. The TDTG framework evaluation on the touch-vision-language (TVL) dataset for generation and classification capabilities with existing models. It attains a superior balance between contextual text quality, lightweight architecture, and multimodal adaptability, providing a practical and scalable solution for real-time assistive interaction and environmental awareness enhancement for the visually impaired.
Building similarity graph...
Analyzing shared references across papers
Loading...
Raniyah Wazirali
Complex & Intelligent Systems
SHILAP Revista de lepidopterología
Saudi Electronic University
Building similarity graph...
Analyzing shared references across papers
Loading...
Raniyah Wazirali (Fri,) studied this question.
www.synapsesocial.com/papers/69a75e8bc6e9836116a293c6 — DOI: https://doi.org/10.1007/s40747-026-02232-4