March 3, 2026Open Access

HyT-Pose: Accurate Pose Estimation by Iteratively Fusing Global and Local Context

Key Points

HyT-Pose achieves significant enhancements in human pose estimation accuracy, addressing high-frequency detail recovery.
It reaches a notable 76.3 AP score on the COCO validation set, highlighting its effectiveness in standardized tests.
Analysis employs a hybrid architecture, integrating iterative refinement and a learnable linear upsampling mechanism.
This approach may revolutionize how pose estimation technologies are developed, emphasizing global structural context.

Abstract

High-resolution feature recovery is pivotal for accurate Human Pose Estimation (HPE), yet prevailing decoders often rely on local, fixed upsampling operators (e.g., deconvolution) that fail to capture the global structural context of human poses. This limitation inevitably leads to the loss of high-frequency details and the generation of artifacts. To bridge this gap, we present HyT-Pose, a novel hybrid architecture that recasts pose decoding as an iterative, structure-aware super-resolution task. Distinct from traditional reconstruction paradigms, HyT-Pose introduces a strictly synchronized ‘‘Enhance-Amplify-Refine’’ framework. Specifically, we propose a Learnable Linear Upsampling (LLU) mechanism that leverages global receptive fields to adaptively ‘‘infer’’ missing spatial details rather than merely interpolating them. This mechanism is synergized with Transformer-based global context enhancement and a Multi-scale Dynamic Refinement unit (FBlock) to progressively purify feature representations. Extensive experiments on COCO, MPII, and CrowdPose benchmarks demonstrate that HyT-Pose significantly outperforms state-of-the-art methods. Notably, it achieves 76.3 AP on COCO val, establishing a new paradigm for high-precision and efficient pose estimation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yunxiang Liu

Jiakai Pan

Journals

SHILAP Revista de lepidopterología

IEEE Access

Actions

Institutions

Shanghai Institute of Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

HyT-Pose: Accurate Pose Estimation by Iteratively Fusing Global and Local Context

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study