Some New Results for Poisson Binomial Models

07/21/2019
by   Evan Rosenman, et al.
3

We consider a problem of ecological inference, in which individual-level covariates are known, but labeled data is available only at the aggregate level. The intended application is modeling voter preferences in elections. In Rosenman and Viswanathan (2018), we proposed modeling individual voter probabilities via a logistic regression, and posing the problem as a maximum likelihood estimation for the parameter vector beta. The likelihood is a Poisson binomial, the distribution of the sum of independent but not identically distributed Bernoulli variables, though we approximate it with a heteroscedastic Gaussian for computational efficiency. Here, we extend the prior work by proving results about the existence of the MLE and the curvature of this likelihood, which is not log-concave in general. We further demonstrate the utility of our method on a real data example. Using data on voters in Morris County, NJ, we demonstrate that our approach outperforms other ecological inference methods in predicting a related, but known outcome: whether an individual votes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2018

Using Poisson Binomial GLMs to Reveal Voter Preferences

We present a new modeling technique for solving the problem of ecologica...
research
07/13/2016

Multiple-Instance Logistic Regression with LASSO Penalty

In this work, we consider a manufactory process which can be described b...
research
06/13/2019

Efficiency of maximum likelihood estimation for a multinomial distribution with known probability sums

For a multinomial distribution, suppose that we have prior knowledge of ...
research
06/18/2021

On the benefits of maximum likelihood estimation for Regression and Forecasting

We advocate for a practical Maximum Likelihood Estimation (MLE) approach...
research
06/03/2019

Multiplicative Effect Modeling: The General Case

Generalized linear models, such as logistic regression, are widely used ...
research
05/21/2020

Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators with Massive Data

Nonuniform subsampling methods are effective to reduce computational bur...
research
12/19/2022

Improving Estimation Efficiency for Two-Phase, Outcome-Dependent Sampling Studies

Two-phase outcome dependent sampling (ODS) is widely used in many fields...

Please sign up or login with your details

Forgot password? Click here to reset