July 11, 2024Open Access

VideoMamba: Spatio-Temporal Selective State Space Model

Key Points

Key points are not available for this paper at this time.

Abstract

We introduce VideoMamba, a novel adaptation of the pure Mamba architecture, specifically designed for video recognition. Unlike transformers that rely on self-attention mechanisms leading to high computational costs by quadratic complexity, VideoMamba leverages Mamba's linear complexity and selective SSM mechanism for more efficient processing. The proposed Spatio-Temporal Forward and Backward SSM allows the model to effectively capture the complex relationship between non-sequential spatial and sequential temporal information in video. Consequently, VideoMamba is not only resource-efficient but also effective in capturing long-range dependency in videos, demonstrated by competitive performance and outstanding efficiency on a variety of video understanding benchmarks. Our work highlights the potential of VideoMamba as a powerful tool for video understanding, offering a simple yet effective baseline for future research in video analysis.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Park et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68e60ad1b6db64358759e3bf — DOI: https://doi.org/10.48550/arxiv.2407.08476

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

VideoMamba: State Space Model for Efficient Video Understanding· 2024 · 2 citations
Snakes and Ladders: Two Steps Up for VideoMamba· 2024
ETMamba: An Effective Temporal Model for Video Action Recognition· 2026
Vision Mamba: A Comprehensive Survey and Taxonomy· 2025 · 55 citations
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction· 2025

Authors

Jinyoung Park

Hee-Seon Kim

Kangwook Ko

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

VideoMamba: Spatio-Temporal Selective State Space Model

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion