What question did this study set out to answer?

This research aims to improve human activity recognition using mmWave radar by addressing challenges in sparsity and noise through a novel model.

April 10, 2026Open Access

Tac-Mamba: A Pose-Guided Cross-Modal State Space Model with Trust-Aware Gating for mmWave Radar Human Activity Recognition

Key Points

This research aims to improve human activity recognition using mmWave radar by addressing challenges in sparsity and noise through a novel model.
Developed Tac-Mamba, a cross-modal state space model.
Introduced a topology-guided distillation scheme to extract structural priors from visual skeletons.
Implemented a Trust-Aware Cross-Modal Attention (TACMA) module to enhance feature reliability.
Utilized a Lightweight Temporal Mamba Block (LTMB) for capturing long-range dependencies.
Achieved competitive accuracies of 95.37% for multimodal and 87.54% for radar-only recognition.
Reduced model parameters to 0.86M, indicating efficiency.
Maintained low inference latency of 1.89 ms, enhancing real-time capabilities.

Abstract

Millimeter-wave (mmWave) radar point clouds offer a privacy-preserving solution for Human Activity Recognition (HAR), but their inherent sparsity and noise limit single-modal performance. While multimodal fusion mitigates this issue, existing methods often suffer from severe negative transfer during visual degradation and incur high computational costs, unsuitable for edge devices. To address these challenges, we propose Tac-Mamba, a lightweight cross-modal state space model. First, we introduce a topology-guided distillation scheme that uses a Spatial Mamba teacher to extract structural priors from visual skeletons. These priors are then explicitly distilled into a Point Transformer v3 (PTv3) radar student with a modality dropout strategy. We also developed a Trust-Aware Cross-Modal Attention (TACMA) module to prevent negative transfer. It evaluates the reliability of visual features through a SiLU-activated cross-modal bilinear interaction, smoothly degrading to a pure radar-driven fallback projection when visual inputs are corrupted. Finally, a Lightweight Temporal Mamba Block (LTMB) with a Zero-Parameter Cross-Gating (ZPCG) mechanism captures long-range kinematic dependencies with linear complexity. Experiments on the public MM-Fi dataset under strict cross-environment protocols demonstrate that Tac-Mamba achieves competitive accuracies of 95.37% (multimodal) and 87.54% (radar-only) with only 0.86M parameters and 1.89 ms inference latency. These results highlight the model’s exceptional robustness to modality missingness and its feasibility for edge deployment.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haiyi Wu

K. K. Zhao

Wei Yao

Journals

Electronics

Actions

Institutions

University of Chinese Academy of Sciences

Shanghai Institute of Microsystem and Information Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Tac-Mamba: A Pose-Guided Cross-Modal State Space Model with Trust-Aware Gating for mmWave Radar Human Activity Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study