September 24, 2025Open Access

Uncertainty Under the Curve: A Sequence-Level Entropy Area Metric for Reasoning LLM

Key Points

The Entropy Area Score method effectively quantifies uncertainty in answer generation.
EAS shows strong correlation with answer entropy across various models and datasets.
This approach identifies high-potential samples, enhancing student model accuracy on math benchmarks.
EAS improves data quality assessment in large language model training through efficient interpretation.

Abstract

In this work, we introduce Entropy Area Score (EAS), a simple yet effective metric to quantify uncertainty in the answer generation process of reasoning large language models (LLMs). EAS requires neither external models nor repeated sampling, it integrates token-level predictive entropy from the model itself to capture the evolution of uncertainty during generation. Empirical results show that EAS is strongly correlated with answer entropy across models and datasets. In training data selection, EAS identifies high-potential samples and consistently outperforms Pass Rate filtering under equal sample budgets, improving student model accuracy on math benchmarks. EAS is both efficient and interpretable, offering a practical tool for uncertainty modeling and data quality assessment in LLM training.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yong Zhu

Lin Sun

Gang Zhao

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Uncertainty Under the Curve: A Sequence-Level Entropy Area Metric for Reasoning LLM

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider