Estimation of population size based on capture recapture designs and evaluation of the estimation reliability

05/12/2021
by   Yue You, et al.
5

We propose a modern method to estimate population size based on capture-recapture designs of K samples. The observed data is formulated as a sample of n i.i.d. K-dimensional vectors of binary indicators, where the k-th component of each vector indicates the subject being caught by the k-th sample, such that only subjects with nonzero capture vectors are observed. The target quantity is the unconditional probability of the vector being nonzero across both observed and unobserved subjects. We cover models assuming a single constraint (identification assumption) on the K-dimensional distribution such that the target quantity is identified and the statistical model is unrestricted. We present solutions for linear and non-linear constraints commonly assumed to identify capture-recapture models, including no K-way interaction in linear and log-linear models, independence or conditional independence. We demonstrate that the choice of constraint has a dramatic impact on the value of the estimand, showing that it is crucial that the constraint is known to hold by design. For the commonly assumed constraint of no K-way interaction in a log-linear model, the statistical target parameter is only defined when each of the 2^K - 1 observable capture patterns is present, and therefore suffers from the curse of dimensionality. We propose a targeted MLE based on undersmoothed lasso model to smooth across the cells while targeting the fit towards the single valued target parameter of interest. For each identification assumption, we provide simulated inference and confidence intervals to assess the performance on the estimator under correct and incorrect identifying assumptions. We apply the proposed method, alongside existing estimators, to estimate prevalence of a parasitic infection using multi-source surveillance data from a region in southwestern China, under the four identification assumptions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2020

Partial identification and dependence-robust confidence intervals for capture-recapture surveys

Capture-recapture (CRC) surveys are widely used to estimate the size of ...
research
01/22/2021

Revisiting Identifying Assumptions for Population Size Estimation

The problem of estimating the size of a population based on a subset of ...
research
06/19/2023

On some pitfalls of the log-linear modeling framework for capture-recapture studies in disease surveillance

In epidemiological studies, the capture-recapture (CRC) method is a powe...
research
12/15/2017

Efficient Principally Stratified Treatment Effect Estimation in Crossover Studies with Absorbent Binary Endpoints

Suppose one wishes to estimate the effect of a binary treatment on a bin...
research
09/25/2020

Parameter Restrictions for the Sake of Identification: Is there Utility in Asserting that Perhaps a Restriction Holds?

Statistical modeling can involve a tension between assumptions and stati...
research
04/29/2021

Doubly robust capture-recapture methods for estimating population size

Estimation of population size using incomplete lists (also called the ca...
research
03/10/2019

Stackelberg Independence

The standard model of sequential capacity choices is the Stackelberg qua...

Please sign up or login with your details

Forgot password? Click here to reset