Learning about Spanish dialects through Twitter

11/16/2015
by   Bruno Gonçalves, et al.
0

This paper maps the large-scale variation of the Spanish language by employing a corpus based on geographically tagged Twitter messages. Lexical dialects are extracted from an analysis of variants of tens of concepts. The resulting maps show linguistic variation on an unprecedented scale across the globe. We discuss the properties of the main dialects within a machine learning approach and find that varieties spoken in urban areas have an international character in contrast to country areas where dialects show a more regional uniformity.

READ FULL TEXT
research
07/26/2014

Crowdsourcing Dialect Characterization through Twitter

We perform a large-scale analysis of language diatopic variation using g...
research
10/12/2021

A large scale lexical and semantic analysis of Spanish language variations in Twitter

Dialectometry is a discipline devoted to studying the variations of a la...
research
02/22/2017

Dialectometric analysis of language variation in Twitter

In the last few years, microblogging platforms such as Twitter have give...
research
07/02/2015

Determining rural areas vulnerable to illegal dumping using GIS techniques. Case study: Neamt county, Romania

The paper aims to mapping the potential vulnerable areas to illegal dump...
research
06/29/2020

Is Japanese gendered language used on Twitter ? A large scale study

This study analyzes the usage of Japanese gendered language on Twitter. ...
research
04/03/2018

Socioeconomic Dependencies of Linguistic Patterns in Twitter: A Multivariate Analysis

Our usage of language is not solely reliant on cognition but is arguably...
research
12/07/2020

Computing flood probabilities using Twitter: application to the Houston urban area during Harvey

In this paper, we investigate the conversion of a Twitter corpus into ge...

Please sign up or login with your details

Forgot password? Click here to reset