Low-resource Languages: A Review of Past Work and Future Challenges

06/12/2020
by   Alexandre Magueresse, et al.
0

A current problem in NLP is massaging and processing low-resource languages which lack useful training attributes such as supervised data, number of native speakers or experts, etc. This review paper concisely summarizes previous groundbreaking achievements made towards resolving this problem, and analyzes potential improvements in the context of the overall future research direction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2019

Towards Realistic Practices In Low-Resource Natural Language Processing: The Development Set

Development sets are impractical to obtain for real low-resource languag...
research
02/16/2022

Knowledge Extraction in Low-Resource Scenarios: Survey and Perspective

Knowledge Extraction (KE) which aims to extract structural information f...
research
06/15/2022

Location-based Twitter Filtering for the Creation of Low-Resource Language Datasets in Indonesian Local Languages

Twitter contains an abundance of linguistic data from the real world. We...
research
06/12/2018

Challenges of language technologies for the indigenous languages of the Americas

Indigenous languages of the American continent are highly diverse. Howev...
research
01/18/2019

Automatic Keyboard Layout Design for Low-Resource Latin-Script Languages

We present our approach to automatically designing and implementing keyb...
research
05/04/2023

VendorLink: An NLP approach for Identifying Linking Vendor Migrants Potential Aliases on Darknet Markets

The anonymity on the Darknet allows vendors to stay undetected by using ...
research
09/19/2023

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Democratizing access to natural language processing (NLP) technology is ...

Please sign up or login with your details

Forgot password? Click here to reset