Introduction Recent advances in neural networks have introduced a new paradigm for robotic inverse kinematics. However, existing methods remain limited by insufficient feature extraction and suboptimal integration of multi-source information, preventing them from achieving high accuracy, broad generalization, and real-time performance on robots with diverse and complex kinematic structures. Methods In this work, we propose HarmoAtt-IK, an adaptive multimodal neural inverse kinematics approach designed for real-time inference and zero-collection training. Built upon the CycleIK framework, the proposed method introduces a novel adaptive multimodal attention fusion mechanism (HarmoAtt) that dynamically integrates the complementary strengths of spatial, channel, and cross-dimensional attention. It employs a temperature-adaptive Softmax function coupled with a compact weight-generation network to perform multidimensional extraction and adaptive enhancement of input features. We further introduce a composite loss function integrating an improved Smooth-L1 loss, a sign-invariant quaternion loss, and a Shannon entropy regularizer to enhance training stability and overall accuracy. Leveraging forward differential kinematics, our method enables rapid, cross-platform deployment by generating training data solely from URDF models, eliminating the need for costly physical data collection and manual annotation. Results Experimental evaluations on five humanoid platforms exhibiting substantial kinematic diversity demonstrate that HarmoAtt-IK attains maximum reductions of 76.4% in terminal positional error and 55.1% in rotational error relative to the baseline, while consistently improving the model’s inference success rate across all tested platforms by up to 5.76 percentage points. Discussion These results indicate that the proposed HarmoAtt-IK significantly outperforms baseline methods in both accuracy and reliability across diverse kinematic structures, highlighting the effectiveness of the adaptive multimodal attention mechanism and composite loss design. This further supports its potential for scalable, real-time deployment on a wide range of robotic platforms.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xukun Liu
Xie Fengjuan
Yu Liu
SHILAP Revista de lepidopterología
Frontiers in Neurorobotics
Northwest Institute of Mechanical and Electrical Engineering
Building similarity graph...
Analyzing shared references across papers
Loading...
Liu et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69f04d9f727298f751e71dfc — DOI: https://doi.org/10.3389/fnbot.2026.1769924