Challenges and Strategies in Cross-Cultural NLP

by   Daniel Hershcovich, et al.

Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce and require, vary not just by language, but also by culture. Although language and culture are tightly linked, there are important differences. Analogous to cross-lingual and multilingual NLP, cross-cultural and multicultural NLP considers these differences in order to better serve users of NLP systems. We propose a principled framework to frame these efforts, and survey existing and potential strategies.



page 1

page 2

page 3

page 4


Dataset Geography: Mapping Language Data to Language Users

As language technologies become more ubiquitous, there are increasing ef...

EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in English

While cultural backgrounds have been shown to affect linguistic expressi...

Identifying Cultural Differences through Multi-Lingual Wikipedia

Understanding cross-cultural differences is an important application of ...

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages

This article presents the results of the evaluation campaign of language...

Towards transparency in NLP shared tasks

This article reports on a survey carried out across the Natural Language...

Is Machine Learning Speaking my Language? A Critical Look at the NLP-Pipeline Across 8 Human Languages

Natural Language Processing (NLP) is increasingly used as a key ingredie...

How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language

More than 43 language loss currently occurs at an accelerated rate becau...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.