What type of study is this?

This is a Quantitative Study study (also classified as: Experimental Study).

October 19, 2025Open Access

AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation

Key Points

AnyPos, combined with ATARA, improves bimanual manipulation tasks, enhancing scalability and efficiency.
The framework shows a 51% improvement in test accuracy, significantly outperforming traditional methods.
Utilizing an inverse dynamics model and automated validation, AnyPos accelerates data collection over 30 times.
Experiments reveal higher success rates in tasks like lifting and pick-and-place, indicating strong real-world applicability.

Abstract

Vision-language-action (VLA) models have shown promise on task-conditioned control in complex settings such as bimanual manipulation. However, the heavy reliance on task-specific human demonstrations limits their generalization and incurs high data acquisition costs. In this work, we present a new notion of task-agnostic action paradigm that decouples action execution from task-specific conditioning, enhancing scalability, efficiency, and cost-effectiveness. To address the data collection challenges posed by this paradigm -- such as low coverage density, behavioral redundancy, and safety risks -- we introduce ATARA (Automated Task-Agnostic Random Actions), a scalable self-supervised framework that accelerates collection by over 30 compared to human teleoperation. To further enable effective learning from task-agnostic data, which often suffers from distribution mismatch and irrelevant trajectories, we propose AnyPos, an inverse dynamics model equipped with Arm-Decoupled Estimation and a Direction-Aware Decoder (DAD). We additionally integrate a video-conditioned action validation module to verify the feasibility of learned policies across diverse manipulation tasks. Extensive experiments show that the AnyPos-ATARA pipeline yields a 51% improvement in test accuracy and achieves 30-40% higher success rates in downstream tasks such as lifting, pick-and-place, and clicking, using replay-based video validation. Project Page: https: //embodiedfoundation. github. io/vidarₐnypos

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Tan et al. (Thu,) studied this question.

www.synapsesocial.com/papers/68f4b10d3d9d770bbc696fe8 — DOI: https://doi.org/10.48550/arxiv.2507.12768

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

Hengkai Tan

Feng Yao

Xinyi Mao

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion