Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model

10/30/2018
by   Muthuraman Chidambaram, et al.
0

Neural language models have been shown to achieve an impressive level of performance on a number of language processing tasks. The majority of these models, however, are limited to producing predictions for only English texts due to limited amounts of labeled data available in other languages. One potential method for overcoming this issue is learning cross-lingual text representations that can be used to transfer the performance from training on English tasks to non-English tasks, despite little to no task-specific non-English data. In this paper, we explore a natural setup for learning cross-lingual sentence representations: the dual-encoder. We provide a comprehensive evaluation of our cross-lingual representations on a number of monolingual, cross-lingual, and zero-shot/few-shot learning tasks, and also give an analysis of different learned cross-lingual embedding spaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2021

Language Embeddings for Typology and Cross-lingual Transfer Learning

Cross-lingual language tasks typically require a substantial amount of a...
research
11/08/2019

Cross-Lingual Relevance Transfer for Document Retrieval

Recent work has shown the surprising ability of multi-lingual BERT to se...
research
01/25/2023

Cross-lingual Argument Mining in the Medical Domain

Nowadays the medical domain is receiving more and more attention in appl...
research
08/23/2023

Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature

Intertextual allusions hold a pivotal role in Classical Philology, with ...
research
05/12/2016

Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning

We introduce polyglot language models, recurrent neural network models t...
research
05/28/2021

Lightweight Cross-Lingual Sentence Representation Learning

Large-scale models for learning fixed-dimensional cross-lingual sentence...
research
10/02/2020

Cross-Lingual Transfer Learning for Complex Word Identification

Complex Word Identification (CWI) is a task centered on detecting hard-t...

Please sign up or login with your details

Forgot password? Click here to reset