Parameter sharing has proven to be a parameter-efficient approach. Previ...
Deploying NMT models on mobile devices is essential for privacy, low lat...
For years the model performance in machine learning obeyed a power-law
r...
This paper describes the submissions of the NiuTrans Team to the WNGT 20...
This paper describes the NiuTrans system for the WMT21 translation effic...
Improving Transformer efficiency has become increasingly attractive rece...
The large attention-based encoder-decoder network (Transformer) has beco...
Unsupervised Bilingual Dictionary Induction methods based on the
initial...
Knowledge distillation has been proven to be effective in model accelera...
8-bit integer inference, as a promising direction in reducing both the
l...
In this paper, we report our recent practice at Tencent for user modelin...
Nowadays, news apps have taken over the popularity of paper-based media,...
Existing approaches have been proposed to tackle unsupervised image-to-i...