What question did this study set out to answer?

This research aims to enhance machine translation efficiency while maintaining accuracy in low-cost environments.

April 27, 2026Open Access

Gated transformer with shallow decoder for machine translation

Puntos clave

This research aims to enhance machine translation efficiency while maintaining accuracy in low-cost environments.
Introduced Gated Transformer with Shallow Decoder (GTSD) for translation tasks.
Employed a gating mechanism to integrate attention and the feed-forward network (FFN).
Conducted experiments using WMT14 en-de and en-fr datasets to evaluate performance.
Achieved significant efficiency improvements with minimal accuracy loss in translation tasks.
Demonstrated a strong efficiency–performance trade-off in experiments compared to traditional models.

Resumen

Large language models (LLMs) have substantially advanced multilingual translation, yet their computational cost limits deployment in resource-constrained scenarios and latency-critical applications. Traditional Transformer-based neural machine translation (NMT) systems remain valuable in these settings. In research on improving the efficiency of the Transformer, linearization techniques that reduce the time complexity of the Transformer to O(N) only show advantages for very long texts. To address this gap, we propose a lightweight architecture, Gated Transformer with Shallow Decoder (GTSD), designed specifically for low-cost and short-text translation. The proposed method employs a gating mechanism to fuse attention and the feed-forward network (FFN), optimizes the redundant cross-attention resulting from the transformation from multi-head to single-head, and adopts a deep encoder-shallow decoder architecture. Furthermore, the proposed method supports simple cost reduction with minimal loss in accuracy. Finally, a series of experiments conducted on WMT14 en-de and WMT14 en-fr datasets demonstrate the proposed approach attains a strong efficiency–performance trade-off.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Li et al. (Fri,) studied this question.

www.synapsesocial.com/papers/69eefc6dfede9185760d36fa — DOI: https://doi.org/10.1038/s41598-026-49583-z

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

Fangfang Li

Fengjing Yin

Shirui Deng

Journals

Scientific Reports

Actions

Institutions

Central South University

National University of Defense Technology

Changsha University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Gated transformer with shallow decoder for machine translation

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion