IndicBART: A Pre-trained Model for Natural Language Generation of Indic Languages

09/07/2021
by   Raj Dabre, et al.
7

In this paper we present IndicBART, a multilingual, sequence-to-sequence pre-trained model focusing on 11 Indic languages and English. Different from existing pre-trained models, IndicBART utilizes the orthographic similarity between Indic scripts to improve transfer learning between similar Indic languages. We evaluate IndicBART on two NLG tasks: Neural Machine Translation (NMT) and extreme summarization. Our experiments on NMT for 12 language pairs and extreme summarization for 7 languages using multilingual fine-tuning show that IndicBART is competitive with or better than mBART50 despite containing significantly fewer parameters. Our analyses focus on identifying the impact of script unification (to Devanagari), corpora size as well as multilingualism on the final performance. The IndicBART model is available under the MIT license at https://indicnlp.ai4bharat.org/indic-bart .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Simple, Scalable Adaptation for Neural Machine Translation

Fine-tuning pre-trained Neural Machine Translation (NMT) models is the d...
research
03/07/2023

Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora

This paper introduces two multilingual government themed corpora in vari...
research
12/04/2019

Acquiring Knowledge from Pre-trained Model to Neural Machine Translation

Pre-training and fine-tuning have achieved great success in the natural ...
research
06/02/2023

Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

NMT systems trained on Pre-trained Multilingual Sequence-Sequence (PMSS)...
research
05/22/2022

What Do Compressed Multilingual Machine Translation Models Forget?

Recently, very large pre-trained models achieve state-of-the-art results...
research
04/16/2023

A Comprehensive Evaluation of the Copy Mechanism for Natural Language to SPARQL Query Generation

In recent years, the field of neural machine translation (NMT) for SPARQ...
research
01/10/2022

Language-Agnostic Website Embedding and Classification

Currently, publicly available models for website classification do not o...

Please sign up or login with your details

Forgot password? Click here to reset