Topical differences between Chinese language Twitter and Sina Weibo

12/22/2015
by   Qian Zhang, et al.
0

Sina Weibo, China's most popular microblogging platform, is currently used by over 500M users and is considered to be a proxy of Chinese social life. In this study, we contrast the discussions occurring on Sina Weibo and on Chinese language Twitter in order to observe two different strands of Chinese culture: people within China who use Sina Weibo with its government imposed restrictions and those outside that are free to speak completely anonymously. We first propose a simple ad-hoc algorithm to identify topics of Tweets and Weibo. Different from previous works on micro-message topic detection, our algorithm considers topics of the same contents but with different #tags. Our algorithm can also detect topics for Tweets and Weibos without any #tags. Using a large corpus of Weibo and Chinese language tweets, covering the period from January 1 to December 31, 2012, we obtain a list of topics using clustered #tags that we can then use to compare the two platforms. Surprisingly, we find that there are no common entries among the Top 100 most popular topics. Furthermore, only 9.2% of tweets correspond to the Top 1000 topics on Sina Weibo platform, and conversely only 4.4% of weibos were found to discuss the most popular Twitter topics. Our results reveal significant differences in social attention on the two platforms, with most popular topics on Sina Weibo relating to entertainment while most tweets corresponded to cultural or political contents that is practically non existent in Sina Weibo.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Prediction of Political Leanings of Chinese Speaking Twitter Users

This work presents a supervised method for generating a classifier model...
research
04/08/2023

Effects of Algorithmic Trend Promotion: Evidence from Coordinated Campaigns in Twitter's Trending Topics

In addition to more personalized content feeds, some leading social medi...
research
05/23/2017

TwiInsight: Discovering Topics and Sentiments from Social Media Datasets

Social media platforms contain a great wealth of information which provi...
research
05/06/2020

Exploratory Analysis of Covid-19 Tweets using Topic Modeling, UMAP, and DiGraphs

This paper illustrates five different techniques to assess the distincti...
research
01/18/2022

Emojis as Anchors to Detect Arabic Offensive Language and Hate Speech

We introduce a generic, language-independent method to collect a large p...
research
05/19/2020

Embeddings-Based Clustering for Target Specific Stances: The Case of a Polarized Turkey

On June 24, 2018, Turkey conducted a highly consequential election in wh...
research
04/18/2020

Pro-Russian Biases in Anti-Chinese Tweets about the Novel Coronavirus

The recent COVID-19 pandemic, which was first detected in Wuhan, China, ...

Please sign up or login with your details

Forgot password? Click here to reset