In MT evaluation, pairwise comparisons are conducted to identify the bet...
Estimating the expected output quality of generation systems is central ...
Natural language generation (NLG) has received increasing attention, whi...
Despite advances in open-domain dialogue systems, automatic evaluation o...
Sequence to sequence (seq2seq) models are often employed in settings whe...