Amid globalisation and growing cross-language information needs, machine translation is crucial for overcoming language barriers.Deep learning has advanced it, but transformer faces limitations: insufficient efficiency in capturing long-range dependencies and poor performance in low-resource translation.To address these, this study proposes three core solutions: 1) a hybrid LSTM-transformer architecture fusing LSTM's gating mechanism (long-sequence modelling) and transformer's self-attention (global context capture); 2) an adaptive gradient clipping (AGC) strategy for training stability; 3) dynamic weight sharing with adversarial domain adaptation to enhance cross-language transfer.Experiments on WMT14 English-German/French corpora show the model's BLEU value is 2.8 higher than benchmark Transformer, with 18% faster convergence; in English Romanian low-resource scenarios, the transfer mechanism boosts BLEU by 5.3.This study validates the hybrid architecture and optimisation strategies, offering new ideas for efficient gradient optimisation and low-resource translation models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Meizhen Zou (Thu,) studied this question.
www.synapsesocial.com/papers/69c772058bbfbc51511e228e — DOI: https://doi.org/10.1504/ijict.2026.152530
Meizhen Zou
International Journal of Information and Communication Technology
ZheJiang Academy of Agricultural Sciences
Shaoxing University
Building similarity graph...
Analyzing shared references across papers
Loading...