We introduce MADLAD-400, a manually audited, general domain 3T token
mon...
Data scarcity is a crucial issue for the development of highly multiling...
Neural machine translation (NMT) has progressed rapidly over the past se...
In this paper we share findings from our effort to build practical machi...
Achieving universal translation between all human language pairs is the
...
With the success of large-scale pre-training and multilingual modeling i...
Large text corpora are increasingly important for a wide variety of Natu...
The quality of automatic metrics for machine translation has been
increa...
Machine translation has an undesirable propensity to produce "translatio...
Multilingual Neural Machine Translation (NMT) models have yielded large
...
Existing curriculum learning research in neural machine translation (NMT...
Recent work in Neural Machine Translation (NMT) has shown significant qu...
Noise and domain are important aspects of data quality for neural machin...
In this work, we train a text repair model as a post-processor for Neura...
Lingvo is a Tensorflow framework offering a complete solution for
collab...