Semi-Supervised Record Linkage for Construction of Large-Scale Sociocentric Networks in Resource-limited Settings: An application to the SEARCH Study in Rural Uganda and Kenya

08/24/2019
by   Yiqun Chen, et al.
0

This paper presents a novel semi-supervised algorithmic approach to creating large scale sociocentric networks in rural East Africa. We describe the construction of 32 large-scale sociocentric social networks in rural Sub-Saharan Africa. Networks were constructed by applying a semi-supervised record-linkage algorithm to data from census-enumerated residents of the 32 communities included in the SEARCH study (NCT01864603), a community-cluster randomized HIV prevention trial in Uganda and Kenya. Contacts were solicited using a five question name generator in the domains of emotional support, food sharing, free time, health issues and money issues. The fully constructed networks include 170; 028 nodes and 362; 965 edges aggregated across communities (ranging from 4449 to 6829 nodes and from 2349 to 31,779 edges per community). Our algorithm matched on average 30 communities and 50 in census enumeration. Assortative mixing measures for eight different covariates reveal that residents in the network have a very strong tendency to associate with others who are similar to them in age, sex, and especially village. The networks in the SEARCH Study will provide a platform for improved understanding of health outcomes in rural East Africa. The network construction algorithm we present may facilitate future social network research in resource-limited settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

Semi-supervised Community Detection via Structural Similarity Metrics

Motivated by social network analysis and network-based recommendation sy...
research
10/19/2021

Subsampling Spectral Clustering for Large-Scale Social Networks

Online social network platforms such as Twitter and Sina Weibo have been...
research
11/11/2020

A Distributed Algorithm for Overlapped Community Detection in Large-Scale Networks

Overlapped community detection in social networks has become an importan...
research
05/12/2020

SMACD: Semi-supervised Multi-Aspect Community Detection

Community detection in real-world graphs has been shown to benefit from ...
research
01/16/2021

A multilevel clustering technique for community detection

A network is a composition of many communities, i.e., sets of nodes and ...
research
04/29/2020

A Large-Scale Semi-Supervised Dataset for Offensive Language Identification

The use of offensive language is a major problem in social media which h...
research
04/17/2020

Structuring Communities for Sharing Human Digital Memories in a Social P2P Network

A community is sub-network inside P2P networks that partition the networ...

Please sign up or login with your details

Forgot password? Click here to reset