simultaneous-nmt

Simultaneous neural machine translation that uses prediction on the source side. This code is a Theano implementation of EMNLP18 paper Prediction Improves Simultaneous Neural Machine Translation. Our implementations are based on dl4mt-simul-trans repository developed by Gu et al.

Dataset

We have used WMT'15 corpora as our dataset for pretraining and training our agent parameters.
Newstest 2013 for validation and testset.
The data should be tokenized and Byte Pair Encoded.

Pretraining

The first step of training the model starts with pretraining Environment. The parameters of the uni-directional LSTM can be changed using the function pretrain_config() in config.py. After setting up configuration, pretrain can be started:

$ export THEANO_FLAGS=device=gpu,floatX=float32
$ python pretrain_uni.py

Training the Agent

Like pretraining the settings of the Model can be configured in config.py. Then training the Agent can be started using sh run_train.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
prediction		prediction
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
actors.py		actors.py
bleu.py		bleu.py
config.py		config.py
config_previous.py		config_previous.py
data_iterator.py		data_iterator.py
insepection.py		insepection.py
itchat.pkl		itchat.pkl
layers.py		layers.py
mteval.sh		mteval.sh
nmt_uni.py		nmt_uni.py
optimizer.py		optimizer.py
plot_heatmap.ipynb		plot_heatmap.ipynb
policy.py		policy.py
pretrain_uni.py		pretrain_uni.py
reward.py		reward.py
run_eval.sh		run_eval.sh
run_train.sh		run_train.sh
show_progress.ipynb		show_progress.ipynb
simultrans_beam.py		simultrans_beam.py
simultrans_eval.py		simultrans_eval.py
simultrans_model.py		simultrans_model.py
simultrans_model_clean.py		simultrans_model_clean.py
simultrans_model_clean_unchanged.py		simultrans_model_clean_unchanged.py
simultrans_train.py		simultrans_train.py
translate.py		translate.py
translate.sh		translate.sh
translate_uni.py		translate_uni.py
translate_uni.sh		translate_uni.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simultaneous-nmt

Dataset

Pretraining

Training the Agent

About

Releases

Packages

Languages

License

ashkanalinejad/Real-time-translator

Folders and files

Latest commit

History

Repository files navigation

simultaneous-nmt

Dataset

Pretraining

Training the Agent

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages