Baselines and test data for cross-lingual inference

04/18/2017
by   Željko Agić, et al.
0

Research in natural language inference is currently exclusive to English. Here, we propose to advance toward multilingual evaluation. To that end, we provide test data for four major languages. We experiment with a set of baselines based on cross-lingual embeddings and machine translation. While our best system scores an average accuracy of just over 75 enabling further research in multilingual inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2018

XNLI: Evaluating Cross-lingual Sentence Representations

State-of-the-art natural language processing systems rely on supervision...
research
05/23/2023

Revisiting Machine Translation for Cross-lingual Classification

Machine Translation (MT) has been widely used for cross-lingual classifi...
research
11/10/2020

Multilingual AMR-to-Text Generation

Generating text from structured data is challenging because it requires ...
research
12/07/2020

Evaluating Cross-Lingual Transfer Learning Approaches in Multilingual Conversational Agent Models

With the recent explosion in popularity of voice assistant devices, ther...
research
03/17/2021

SML: a new Semantic Embedding Alignment Transformer for efficient cross-lingual Natural Language Inference

The ability of Transformers to perform with precision a variety of tasks...
research
05/23/2023

Pixel Representations for Multilingual Translation and Data-efficient Cross-lingual Transfer

We introduce and demonstrate how to effectively train multilingual machi...
research
09/09/2021

Subword Mapping and Anchoring across Languages

State-of-the-art multilingual systems rely on shared vocabularies that s...

Please sign up or login with your details

Forgot password? Click here to reset