Better understanding of the multivariate hypergeometric distribution with implications in design-based survey sampling

01/03/2021
by   X. G. Duan, et al.
0

Multivariate hypergeometric distribution arises frequently in elementary statistics and probability courses, for simultaneously studying the occurence law of specified events, when sampling without replacement from a finite population with fixed number of classification. Covariance matrix of this distribution is well known to be identical to its multinomial counterpart multiplied by 1-(n-1)/(N-1), with N and n being population and sample sizes, respectively. It appears to however, have been less discussed in the literature about the meaning of this relationship, especially regarding the specific form of the multiplier. Based on an augmenting argument together with probabilistic symmetry, we present a more transparent understanding for the covariance structure of the multivariate hypergeometric distribution. We discuss implications of these combined techniques and provide a unified description about the relative efficiency for estimating population mean based on simple random sampling, probability proportional-to-size sampling and adaptive cluster sampling, with versus without replacement. We also provide insight into the classic random group method for variance estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Some observations on the distribution of order statistics under simple-random-sampling without replacement

This paper examines the distribution of order statistics from simple-ran...
research
08/09/2023

Deficiency bounds for the multivariate inverse hypergeometric distribution

The multivariate inverse hypergeometric (MIH) distribution is an extensi...
research
07/22/2023

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

The use of big data in official statistics and the applied sciences is a...
research
04/24/2020

Variance Reduction for Better Sampling in Continuous Domains

Design of experiments, random search, initialization of population-based...
research
03/24/2019

Cost Issue in Estimation of Proportion in a Finite Population Divided Among Two Strata

The problem of estimation of the proportion of units with a given attrib...
research
01/27/2023

Sampling without replacement from a high-dimensional finite population

It is well known that most of the existing theoretical results in statis...
research
11/29/2017

Bayesian analysis of finite population sampling in multivariate co-exchangeable structures with separable covariance matric

We explore the effect of finite population sampling in design problems w...

Please sign up or login with your details

Forgot password? Click here to reset