Bayesian Propagation of Record Linkage Uncertainty into Population Size Estimation of Human Rights Violations

12/22/2018
by   Mauricio Sadinle, et al.
0

Multiple-systems or capture-recapture estimation are common techniques for population size estimation, particularly in the quantitative study of human rights violations. These methods rely on multiple samples from the population, along with the information of which individuals appear in which samples. The goal of record linkage techniques is to identify unique individuals across samples based on the information collected on them. Linkage decisions are subject to uncertainty when such information contains errors and missingness, and when different individuals have very similar characteristics. Uncertainty in the linkage should be propagated into the stage of population size estimation. We propose an approach called linkage-averaging to propagate linkage uncertainty, as quantified by some Bayesian record linkage methodologies, into a subsequent stage of population size estimation. Linkage-averaging is a two-stage approach in which the results from the record linkage stage are fed into the population size estimation stage. We show that under some conditions the results of this approach correspond to those of a proper Bayesian joint model for both record linkage and population size estimation. The two-stage nature of linkage-averaging allows us to combine different record linkage models with different capture-recapture models, which facilitates model exploration. We present a case study from the Salvadoran civil war, where we are interested in estimating the total number of civilian killings using lists of witnesses' reports collected by different organizations. These lists contain duplicates, typographical and spelling errors, missingness, and other inaccuracies that lead to uncertainty in the linkage. We show how linkage-averaging can be used for transferring the uncertainty in the linkage of these lists into different models for population size estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2021

drpop: Efficient and Doubly Robust Population Size Estimation in R

This paper introduces the R package drpop to flexibly estimate total pop...
research
09/01/2020

Invited Discussion of "A Unified Framework for De-Duplication and Population Size Estimation"

Invited Discussion of "A Unified Framework for De-Duplication and Popula...
research
01/25/2016

Bayesian Estimation of Bipartite Matchings for Record Linkage

The bipartite record linkage task consists of merging two disparate data...
research
04/29/2021

Doubly robust capture-recapture methods for estimating population size

Estimation of population size using incomplete lists (also called the ca...
research
01/23/2019

A new integrated likelihood for estimating population size in dependent dual-record system

Efficient estimation of population size from dependent dual-record syste...
research
10/15/2022

Fisher's Noncentral Hypergeometric Distribution for Population Size Estimation

We introduce a method to make inference on the subgroups' sizes of a het...

Please sign up or login with your details

Forgot password? Click here to reset