What does this research mean for the field?

The SA-Cost model, which uses an attention mechanism to analyze semantic correlations among scheduling primitives, improves prediction accuracy and achieves significant execution speedups over existing models like Ansor and AMOS for tensor program tuning on GPUs. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The study aims to develop a novel cost model, SA‐Cost, to enhance automatic scheduling optimization for tensor programs in deep learning compilers.

April 21, 2026

SA‐Cost: Structure‐Aware Cost Model With Attention for Tensor Program Tuning

Key Points

The study aims to develop a novel cost model, SA‐Cost, to enhance automatic scheduling optimization for tensor programs in deep learning compilers.
Reformulated feature extraction to analyze inter-relationships and semantic correlations among scheduling primitives.
Incorporated an attention mechanism to identify key scheduling decisions.
Evaluated performance based on decision space distribution, primitive significance, hardware awareness, and resource utilization efficiency.
SA‐Cost surpasses existing models, achieving an average speedup over Ansor and AMOS on GPU platforms.

Abstract

ABSTRACT Automatic scheduling optimization is a critical technology for improving the performance of deep learning compilers (DLCs). As neural network models grow in scale and hardware platforms diversify, the automatic generation of high‐performance tensor programs has become a focal point of research. A robust cost model is essential for automatic scheduling, as it enables the selection of the most optimal scheduling scheme from a multitude of possibilities. This paper introduces SA‐Cost, a novel cost model for DLCs. Unlike conventional approaches that treat tensor programs as a whole during feature extraction, our method reformulates feature extraction as an analysis of the inter‐relationships and semantic correlations among scheduling primitives. By capturing the semantic correlations among scheduling decisions via an attention mechanism, we construct a more precise evaluation model. The model performs a comprehensive assessment across four dimensions: decision space distribution, primitive significance, hardware awareness, and resource utilization efficiency. Furthermore, we incorporate an attention mechanism to identify key scheduling decisions and their contributions, improving the cost model's prediction accuracy. Experimental results demonstrate that on the GPU platform, SA‐Cost achieves an average speedup of over Ansor and over AMOS.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Junguo Liao

Zenghua Cheng

Yonghua Hu

Journals

Concurrency and Computation Practice and Experience

Actions

Institutions

Hunan University of Science and Technology

Galaxy Biotech (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

SA‐Cost: Structure‐Aware Cost Model With Attention for Tensor Program Tuning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study