NLP for Language Varieties of Italy: Challenges and the Path Forward

09/20/2022
by   Alan Ramponi, et al.
0

Italy is characterized by a one-of-a-kind linguistic diversity landscape in Europe, which implicitly encodes local knowledge, cultural traditions, artistic expression, and history of its speakers. However, over 30 language varieties in Italy are at risk of disappearing within few generations. Language technology has a main role in preserving endangered languages, but it currently struggles with such varieties as they are under-resourced and mostly lack standardized orthography, being mainly used in spoken settings. In this paper, we introduce the linguistic context of Italy and discuss challenges facing the development of NLP technologies for Italy's language varieties. We provide potential directions and advocate for a shift in the paradigm from machine-centric to speaker-centric NLP. Finally, we propose building a local community towards responsible, participatory development of speech and language technologies for languages and dialects of Italy.

READ FULL TEXT
research
03/16/2022

Towards Afrocentric NLP for African Languages: Where We Are and Where We Can Go

Aligning with ACL 2022 special Theme on "Language Diversity: from Low Re...
research
05/25/2022

Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages

In order for NLP technology to be widely applicable and useful, it needs...
research
04/20/2020

The State and Fate of Linguistic Diversity and Inclusion in the NLP World

Language technologies contribute to promoting multilingualism and lingui...
research
04/19/2023

Revitalizing Endangered Languages: AI-powered language learning as a catalyst for language appreciation

According to UNESCO, there are nearly 7,000 languages spoken worldwide, ...
research
06/08/2023

Dealing with Semantic Underspecification in Multimodal NLP

Intelligent systems that aim at mastering language as humans do must dea...
research
04/25/2022

How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language

More than 43 language loss currently occurs at an accelerated rate becau...
research
10/13/2021

Systematic Inequalities in Language Technology Performance across the World's Languages

Natural language processing (NLP) systems have become a central technolo...

Please sign up or login with your details

Forgot password? Click here to reset