Temporal Activity Path Based Character Correction in Social Networks

06/23/2018
by   Jun Long, et al.
0

Vast amount of multimedia data contains massive and multifarious social information which is used to construct large-scale social networks. In a complex social network, a character should be ideally denoted by one and only one vertex. However, it is pervasive that a character is denoted by two or more vertices with different names, thus it is usually considered as multiple, different characters. This problem causes incorrectness of results in network analysis and mining. The factual challenge is that character uniqueness is hard to correctly confirm due to lots of complicated factors, e.g. name changing and anonymization, leading to character duplication. Early, limited research has shown that previous methods depended overly upon supplementary attribute information from databases. In this paper, we propose a novel method to merge the character vertices which refer to as the same entity but are denoted with different names. With this method, we firstly build the relationship network among characters based on records of social activities participated, which are extracted from multimedia sources. Then define temporal activity paths (TAPs) for each character over time. After that, we measure similarity of the TAPs for any two characters. If the similarity is high enough, the two vertices should be considered to the same character. Based on TAPs, we can determine whether to merge the two character vertices. Our experiments shown that this solution can accurately confirm character uniqueness in large-scale social network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2023

An Error-Correction Model for Information Transmissions of Social Networks

We study the error-correction problem of the communication between two v...
research
04/29/2020

Measuring Information Propagation in Literary Social Networks

We present the task of modeling information propagation in literature, i...
research
02/21/2017

Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization

In this paper we describe a dynamic normalization process applied to soc...
research
09/17/2019

ShamFinder: An Automated Framework for Detecting IDN Homographs

The internationalized domain name (IDN) is a mechanism that enables us t...
research
08/25/2020

Complicating the Social Networks for Better Storytelling: An Empirical Study of Chinese Historical Text and Novel

Digital humanities is an important subject because it enables developmen...
research
03/29/2019

Frowning Frodo, Wincing Leia, and a Seriously Great Friendship: Learning to Classify Emotional Relationships of Fictional Characters

The development of a fictional plot is centered around characters who cl...
research
07/05/2019

Extraction and Analysis of Fictional Character Networks: A Survey

A character network is a graph extracted from a narrative, in which vert...

Please sign up or login with your details

Forgot password? Click here to reset