Subdivisions and Crossroads: Identifying Hidden Community Structures in a Data Archive's Citation Network

05/17/2022
by   Sara Lafia, et al.
0

Data archives are an important source of high quality data in many fields, making them ideal sites to study data reuse. By studying data reuse through citation networks, we are able to learn how hidden research communities - those that use the same scientific datasets - are organized. This paper analyzes the community structure of an authoritative network of datasets cited in academic publications, which have been collected by a large, social science data archive: the Interuniversity Consortium for Political and Social Research (ICPSR). Through network analysis, we identified communities of social science datasets and fields of research connected through shared data use. We argue that communities of exclusive data reuse form subdivisions that contain valuable disciplinary resources, while datasets at a "crossroads" broadly connect research communities. Our research reveals the hidden structure of data reuse and demonstrates how interdisciplinary research communities organize around datasets as shared scientific inputs. These findings contribute new ways of describing scientific communities in order to understand the impacts of research data reuse.

READ FULL TEXT

page 10

page 12

page 17

research
11/14/2021

Center-Periphery Structure in Communities: Extracellular Vesicles

Clustering and community detection in networks are of broad interest and...
research
09/09/2019

The Natural Selection of Conservative Science

Social epistemologists have argued that high risk, high reward science h...
research
09/03/2021

From Data Processes to Data Products: Knowledge Infrastructures in Astronomy

We explore how astronomers take observational data from telescopes, proc...
research
05/11/2020

Structuring spreadsheets with ObjTables enables data quality control, reuse, and integration

A central challenge in science is to understand how systems behaviors em...
research
03/12/2020

Analysis of ResearchGate, A Community Detection Approach

We are living in the data age. Communications over scientific networks c...
research
03/23/2022

Author Multidisciplinarity and Disciplinary Roles in Field of Study Networks

When studying large research corpora, "distant reading" methods are vita...
research
05/11/2020

ObjTables: structured spreadsheets that promote data quality, reuse, and integration

A central challenge in science is to understand how systems behaviors em...

Please sign up or login with your details

Forgot password? Click here to reset