Using Poisson Binomial GLMs to Reveal Voter Preferences

02/04/2018
by   Evan Rosenman, et al.
0

We present a new modeling technique for solving the problem of ecological inference, in which individual-level associations are inferred from labeled data available only at the aggregate level. We model aggregate count data as arising from the Poisson binomial, the distribution of the sum of independent but not identically distributed Bernoulli random variables. We relate individual-level probabilities to individual covariates using both a logistic regression and a neural network. A normal approximation is derived via the Lyapunov Central Limit Theorem, allowing us to efficiently fit these models on large datasets. We apply this technique to the problem of revealing voter preferences in the 2016 presidential election, fitting a model to a sample of over four million voters from the highly contested swing state of Pennsylvania. We validate the model at the precinct level via a holdout set, and at the individual level using weak labels, finding that the model is predictive and it learns intuitively reasonable associations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2019

Some New Results for Poisson Binomial Models

We consider a problem of ecological inference, in which individual-level...
research
01/16/2019

The median of a jittered Poisson distribution

Let N_λ and U be two independent random variables respectively distribut...
research
03/09/2021

Fractional Poisson random sum and its associated normal variance mixture

In this work, we study the partial sums of independent and identically d...
research
12/14/2021

Data-driven chimney fire risk prediction using machine learning and point process tools

Chimney fires constitute one of the most commonly occurring fire types. ...
research
12/02/2007

Summarization and Classification of Non-Poisson Point Processes

Fitting models for non-Poisson point processes is complicated by the lac...
research
09/16/2018

A Data Analytics Framework for Aggregate Data Analysis

In many contexts, we have access to aggregate data, but individual level...
research
06/30/2020

Bucking the Trend: An Agentive Perspective of Managerial Influence on Blogs Attractiveness

Blog management is central to the digitalization of work. However, exist...

Please sign up or login with your details

Forgot password? Click here to reset