DeepAI AI Chat
Log In Sign Up

Toward Cross-Lingual Definition Generation for Language Learners

10/12/2020
by   Cunliang Kong, et al.
0

Generating dictionary definitions automatically can prove useful for language learners. However, it's still a challenging task of cross-lingual definition generation. In this work, we propose to generate definitions in English for words in various languages. To achieve this, we present a simple yet effective approach based on publicly available pretrained language models. In this approach, models can be directly applied to other languages after trained on the English dataset. We demonstrate the effectiveness of this approach on zero-shot definition generation. Experiments and manual analyses on newly constructed datasets show that our models have a strong cross-lingual transfer ability and can generate fluent English definitions for Chinese words. We further measure the lexical complexity of generated and reference definitions. The results show that the generated definitions are much simpler, which is more suitable for language learners.

READ FULL TEXT

page 1

page 2

page 3

page 4

06/09/2023

Assisting Language Learners: Automated Trans-Lingual Definition Generation via Contrastive Prompt Learning

The standard definition generation task requires to automatically produc...
03/24/2022

Multitasking Framework for Unsupervised Simple Definition Generation

The definition generation task can help language learners by providing e...
04/23/2022

LitMind Dictionary: An Open-Source Online Dictionary

Dictionaries can help language learners to learn vocabulary by providing...
09/19/2022

ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification

Lexical simplification (LS) is the task of automatically replacing compl...
09/29/2022

COMPILING: A Benchmark Dataset for Chinese Complexity Controllable Definition Generation

The definition generation task aims to generate a word's definition with...
12/07/2020

What Meaning-Form Correlation Has to Compose With

Compositionality is a widely discussed property of natural languages, al...