MT-Adapted Datasheets for Datasets: Template and Repository

05/27/2020
by   Marta R. Costa-Jussà, et al.
0

In this report we are taking the standardized model proposed by Gebru et al. (2018) for documenting the popular machine translation datasets of the EuroParl (Koehn, 2005) and News-Commentary (Barrault et al., 2019). Within this documentation process, we have adapted the original datasheet to the particular case of data consumers within the Machine Translation area. We are also proposing a repository for collecting the adapted datasheets in this research area

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset