Multilingual and cross-lingual document classification: A meta-learning approach

01/27/2021
by   Niels van der Heijden, et al.
11

The great majority of languages in the world are considered under-resourced for the successful application of deep learning methods. In this work, we propose a meta-learning approach to document classification in limited-resource setting and demonstrate its effectiveness in two different settings: few-shot, cross-lingual adaptation to previously unseen languages; and multilingual joint training when limited target-language data is available during training. We conduct a systematic comparison of several meta-learning methods, investigate multiple settings in terms of data availability and show that meta-learning thrives in settings with a heterogeneous task distribution. We propose a simple, yet effective adjustment to existing meta-learning methods which allows for better and more stable learning, and set a new state of the art on several languages while performing on-par on others, using only a small amount of labeled data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2021

Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding

Meta learning with auxiliary languages has demonstrated promising improv...
research
03/19/2022

Meta-X_NLG: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation

Recently, the NLP community has witnessed a rapid advancement in multili...
research
04/10/2021

Meta-learning for fast cross-lingual adaptation in dependency parsing

Meta-learning, or learning to learn, is a technique that can help to ove...
research
03/05/2020

Zero-Shot Cross-Lingual Transfer with Meta Learning

Learning what to share between tasks has been a topic of high importance...
research
04/10/2023

MERMAIDE: Learning to Align Learners using Model-Based Meta-Learning

We study how a principal can efficiently and effectively intervene on th...
research
06/02/2021

Minimax and Neyman-Pearson Meta-Learning for Outlier Languages

Model-agnostic meta-learning (MAML) has been recently put forth as a str...
research
03/04/2023

Self-tuning hyper-parameters for unsupervised cross-lingual tokenization

We explore the possibility of meta-learning for the language-independent...

Please sign up or login with your details

Forgot password? Click here to reset