Studying Politeness across Cultures Using English Twitter and Mandarin Weibo

08/06/2020
by   Mingyang Li, et al.
0

Modeling politeness across cultures helps to improve intercultural communication by uncovering what is considered appropriate and polite. We study the linguistic features associated with politeness across US English and Mandarin Chinese. First, we annotate 5,300 Twitter posts from the US and 5,300 Sina Weibo posts from China for politeness scores. Next, we develop an English and Chinese politeness feature set, `PoliteLex'. Combining it with validated psycholinguistic dictionaries, we then study the correlations between linguistic features and perceived politeness across cultures. We find that on Mandarin Weibo, future-focusing conversations, identifying with a group affiliation, and gratitude are considered to be more polite than on English Twitter. Death-related taboo topics, lack of or poor choice of pronouns, and informal language are associated with higher impoliteness on Mandarin Weibo compared to English Twitter. Finally, we build language-based machine learning models to predict politeness with an F1 score of 0.886 on Mandarin Weibo and a 0.774 on English Twitter.

READ FULL TEXT

page 8

page 10

research
11/26/2020

Towards Interpretable Multilingual Detection of Hate Speech against Immigrants and Women in Twitter at SemEval-2019 Task 5

his paper describes our techniques to detect hate speech against women a...
research
03/16/2020

LAXARY: A Trustworthy Explainable Twitter Analysis Model for Post-Traumatic Stress Disorder Assessment

Veteran mental health is a significant national problem as large number ...
research
10/25/2021

Fine-tuning of Pre-trained Transformers for Hate, Offensive, and Profane Content Detection in English and Marathi

This paper describes neural models developed for the Hate Speech and Off...
research
11/18/2017

Is China Entering WTO or shijie maoyi zuzhi--a Corpus Study of English Acronyms in Chinese Newspapers

This is one of the first studies that quantitatively examine the usage o...
research
06/25/2021

Fine-grained Geolocation Prediction of Tweets with Human Machine Collaboration

Twitter is a useful resource to analyze peoples' opinions on various top...
research
10/10/2022

Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset

Previous work on Neural Referring Expression Generation (REG) all uses W...
research
04/18/2021

News Meets Microblog: Hashtag Annotation via Retriever-Generator

Hashtag annotation for microblog posts has been recently formulated as a...

Please sign up or login with your details

Forgot password? Click here to reset