Tracking the Diffusion of Named Entities

12/22/2017
by   Leon Derczynski, et al.
0

Existing studies of how information diffuses across social networks have thus far concentrated on analysing and recovering the spread of deterministic innovations such as URLs, hashtags, and group membership. However investigating how mentions of real-world entities appear and spread has yet to be explored, largely due to the computationally intractable nature of performing large-scale entity extraction. In this paper we present, to the best of our knowledge, one of the first pieces of work to closely examine the diffusion of named entities on social media, using Reddit as our case study platform. We first investigate how named entities can be accurately recognised and extracted from discussion posts. We then use these extracted entities to study the patterns of entity cascades and how the probability of a user adopting an entity (i.e. mentioning it) is associated with exposures to the entity. We put these pieces together by presenting a parallelised diffusion model that can forecast the probability of entity adoption, finding that the influence of adoption between users can be characterised by their prior interactions -- as opposed to whether the users propagated entity-adoptions beforehand. Our findings have important implications for researchers studying influence and language, and for community analysts who wish to understand entity-level influence dynamics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2018

Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives

How did the popularity of the Greek Prime Minister evolve in 2015? How d...
research
12/06/2017

Named Entity Sequence Classification

Named Entity Recognition (NER) aims at locating and classifying named en...
research
05/22/2023

DiffusionNER: Boundary Diffusion for Named Entity Recognition

In this paper, we propose DiffusionNER, which formulates the named entit...
research
12/10/2021

TechRank: A Network-Centrality Approach for Informed Cybersecurity-Investment

The cybersecurity technological landscape is a complex ecosystem in whic...
research
01/13/2015

Towards Deep Semantic Analysis Of Hashtags

Hashtags are semantico-syntactic constructs used across various social n...
research
08/16/2020

TopicBERT: A Transformer transfer learning based memory-graph approach for multimodal streaming social media topic detection

Real time nature of social networks with bursty short messages and their...
research
01/21/2020

Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board

This paper presents a dataset with over 3.3M threads and 134.5M posts fr...

Please sign up or login with your details

Forgot password? Click here to reset