Google-Neural-Machine-Translation-GNMT-

It is a tensorflow implementation of GNMT published by google

Keyword: Machine Translation

abstract

Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NMT's use in practical deployments and services, where both accuracy and speed are essential. In this work, we present GNMT, Google's Neural Machine Translation system, which attempts to address many of these issues. Our model consists of a deep LSTM network with 8 encoder and 8 decoder layers using attention and residual connections. To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder. To accelerate the final translation speed, we employ low-precision arithmetic during inference computations. To improve handling of rare words, we divide words into a limited set of common sub-word units ("wordpieces") for both input and output. This method provides a good balance between the flexibility of "character"-delimited models and the efficiency of "word"-delimited models, naturally handles translation of rare words, and ultimately improves the overall accuracy of the system. Our beam search technique employs a length-normalization procedure and uses a coverage penalty, which encourages generation of an output sentence that is most likely to cover all the words in the source sentence. On the WMT'14 English-to-French and English-to-German benchmarks, GNMT achieves competitive results to state-of-the-art. Using a human side-by-side evaluation on a set of isolated simple sentences, it reduces translation errors by an average of 60% compared to Google's phrase-based production system.

What is implement in this tensorflow project is

bidirection lstm
stacked residual lstm
attention model

the structure is something like this:

What will be implemented

subword and one-to-many will be implemented in the future

References

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
BUILD		BUILD
README.md		README.md
Stack_Residual_RNNCell.py		Stack_Residual_RNNCell.py
__init__.py		__init__.py
data_utils.py		data_utils.py
demo.gif		demo.gif
linear_modern.py		linear_modern.py
model.png		model.png
seq2seq_for_MT.py		seq2seq_for_MT.py
seq2seq_model.py		seq2seq_model.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google-Neural-Machine-Translation-GNMT-

abstract

What is implement in this tensorflow project is

What will be implemented

References

About

Releases

Packages

Languages

belvo/Google-Neural-Machine-Translation-GNMT-

Folders and files

Latest commit

History

Repository files navigation

Google-Neural-Machine-Translation-GNMT-

abstract

What is implement in this tensorflow project is

What will be implemented

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages