What question did this study set out to answer?

The aim is to enhance the neural machine translation framework by addressing limitations of fixed-length vectors in encoder-decoder models.

September 1, 2014Open Access

Neural Machine Translation by Jointly Learning to Align and Translate

Key Points

The aim is to enhance the neural machine translation framework by addressing limitations of fixed-length vectors in encoder-decoder models.
Proposes a new model that automates the alignment of source sentences to target words.
Utilizes an encoder-decoder architecture to process translations.
Implements soft-search techniques for relevant sentence parts.
Achieved translation performance comparable to state-of-the-art phrase-based systems.
Qualitative analysis indicates strong agreement between soft-alignments and intuitive expectations.

Abstract

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance. The models proposed recently for neural machine translation often belong to a family of encoder-decoders and consists of an encoder that encodes a source sentence into a fixed-length vector from which a decoder generates a translation. In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly. With this new approach, we achieve a translation performance comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation. Furthermore, qualitative analysis reveals that the (soft-)alignments found by the model agree well with our intuition.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Dzmitry Bahdanau (Mon,) studied this question.

www.synapsesocial.com/papers/696402a6f797a36a6d30d8c4 — DOI: https://doi.org/10.48550/arxiv.1409.0473

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches· 2014 · 1,121 citations
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation· 2014 · 3,360 citations
Sequence to Sequence Learning with Neural Networks· 2014 · 3,524 citations
Statistical phrase-based translation· 2003 · 3,278 citations
Domain Adaptation via Pseudo In-Domain Data Selection· 2011 · 493 citations

Neural Machine Translation by Jointly Learning to Align and Translate

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion