Normalization of zero-inflated data: An empirical analysis of a new indicator family and its use with altmetrics data

12/06/2017
by   Lutz Bornmann, et al.
0

Recently, two new indicators (Equalized Mean-based Normalized Proportion Cited, EMNPC; Mean-based Normalized Proportion Cited, MNPC) were proposed which are intended for sparse scientometrics data. The indicators compare the proportion of mentioned papers (e.g. on Facebook) of a unit (e.g., a researcher or institution) with the proportion of mentioned papers in the corresponding fields and publication years (the expected values). In this study, we propose a third indicator (Mantel-Haenszel quotient, MHq) belonging to the same indicator family. The MHq is based on the MH analysis - an established method in statistics for the comparison of proportions. We test (using citations and assessments by peers, i.e. F1000Prime recommendations) if the three indicators can distinguish between different quality levels as defined on the basis of the assessments by peers. Thus, we test their convergent validity. We find that the indicator MHq is able to distinguish between the quality levels in most cases while MNPC and EMNPC are not. Since the MHq is shown in this study to be a valid indicator, we apply it to six types of zero-inflated altmetrics data and test whether different altmetrics sources are related to quality. The results for the various altmetrics demonstrate that the relationship between altmetrics (Wikipedia, Facebook, blogs, and news data) and assessments by peers is not as strong as the relationship between citations and assessments by peers. Actually, the relationship between citations and peer assessments is about two to three times stronger than the association between altmetrics and assessments by peers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2017

Normalization of zero-inflated data: An empirical analysis of a new indicator family

Recently, two new indicators (Equalized Mean-based Normalized Proportion...
research
12/22/2017

Field- and time-normalization of zero-inflated data: An empirical analysis using citation and Twitter data

Thelwall (2017a, 2017b) proposed a new family of field- and time-normali...
research
12/22/2017

Field- and time-normalization of data with many zeros: An empirical analysis using citation and Twitter data

Thelwall (2017a, 2017b) proposed a new family of field- and time-normali...
research
03/29/2023

Data inaccuracy quantification and uncertainty propagation for bibliometric indicators

This study introduces an approach to estimate the uncertainty in bibliom...
research
02/18/2016

An Estimation Method Using Periodic Inspection of Indicators

This paper proposes a new approach for estimating the failure time distr...
research
03/17/2023

Altmetrics can capture research evidence: a study across types of studies in COVID-19 literature

There has been a proliferation of descriptive for COVID-19 papers using ...

Please sign up or login with your details

Forgot password? Click here to reset