Reasoning models have demonstrated remarkable progress in solving complex and logic-intensive tasks by generating extended Chain-of-Thoughts (CoTs) prior to arriving at a final answer. Yet, the emergence of this "slow-thinking" paradigm, with numerous tokens generated in sequence, inevitably introduces substantial computational overhead. To this end, it highlights an urgent need for effective acceleration. This survey aims to provide a comprehensive overview of recent advances in efficient reasoning. It categorizes existing works into three key directions: (1) shorter - compressing lengthy CoTs into concise yet effective reasoning chains; (2) smaller - developing compact language models with strong reasoning capabilities through techniques such as knowledge distillation, other model compression techniques, and reinforcement learning; and (3) faster - designing efficient decoding strategies to accelerate inference of reasoning models. A curated collection of papers discussed in this survey is available in our GitHub repository: https://github.com/fscdc/Awesome-Efficient-Reasoning-Models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Feng et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68f6196ee0bbbc94fac361b6 — DOI: https://doi.org/10.48550/arxiv.2504.10903
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:
Sicheng Feng
Gongfan Fang
Xinyin Ma
Building similarity graph...
Analyzing shared references across papers
Loading...