Is Japanese gendered language used on Twitter ? A large scale study

06/29/2020
by   Tiziana Carpi, et al.
0

This study analyzes the usage of Japanese gendered language on Twitter. Starting from a collection of 408 million Japanese tweets from 2015 till 2019 and an additional sample of 2355 manually classified Twitter accounts timelines into gender and categories (politicians, musicians, etc). A large scale textual analysis is performed on this corpus to identify and examine sentence-final particles (SFPs) and first-person pronouns appearing in the texts. It turns out that gendered language is in fact used also on Twitter, in about 6 tweets, and that the prescriptive classification into "male" and "female" language does not always meet the expectations, with remarkable exceptions. Further, SFPs and pronouns show increasing or decreasing trends, indicating an evolution of the language used on Twitter.

READ FULL TEXT

page 8

page 12

03/31/2020

A large-scale Twitter dataset for drug safety applications mined from publicly existing resources

With the increase in popularity of deep learning models for natural lang...
08/23/2018

Arap-Tweet: A Large Multi-Dialect Twitter Corpus for Gender, Age and Language Variety Identification

In this paper, we present Arap-Tweet, which is a large-scale and multi-d...
04/06/2018

Forex trading and Twitter: Spam, bots, and reputation manipulation

Currency trading (Forex) is the largest world market in terms of volume....
05/07/2021

The Shadowy Lives of Emojis: An Analysis of a Hacktivist Collective's Use of Emojis on Twitter

Emojis have established themselves as a popular means of communication i...
07/26/2014

Crowdsourcing Dialect Characterization through Twitter

We perform a large-scale analysis of language diatopic variation using g...
11/26/2019

Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Twitter data has been shown broadly applicable for public health surveil...
11/16/2015

Learning about Spanish dialects through Twitter

This paper maps the large-scale variation of the Spanish language by emp...