Partition Decomposition for Roll Call Data

08/13/2011
by   Greg Leibon, et al.
0

In this paper we bring to bear some new tools from statistical learning on the analysis of roll call data. We present a new data-driven model for roll call voting that is geometric in nature. We construct the model by adapting the "Partition Decoupling Method," an unsupervised learning technique originally developed for the analysis of families of time series, to produce a multiscale geometric description of a weighted network associated to a set of roll call votes. Central to this approach is the quantitative notion of a "motivation," a cluster-based and learned basis element that serves as a building block in the representation of roll call data. Motivations enable the formulation of a quantitative description of ideology and their data-dependent nature makes possible a quantitative analysis of the evolution of ideological factors. This approach is generally applicable to roll call data and we apply it in particular to the historical roll call voting of the U.S. House and Senate. This methodology provides a mechanism for estimating the dimension of the underlying action space. We determine that the dominant factors form a low- (one- or two-) dimensional representation with secondary factors adding higher-dimensional features. In this way our work supports and extends the findings of both Poole-Rosenthal and Heckman-Snyder concerning the dimensionality of the action space. We give a detailed analysis of several individual Senates and use the AdaBoost technique from statistical learning to determine those votes with the most powerful discriminatory value. When used as a predictive model, this geometric view significantly outperforms spatial models such as the Poole-Rosenthal DW-NOMINATE model and the Heckman-Snyder 6-factor model, both in raw accuracy as well as Aggregate Proportional Reduced Error (APRE).

READ FULL TEXT
research
01/06/2021

Factor Modelling for Clustering High-dimensional Time Series

We propose a new unsupervised learning method for clustering a large num...
research
12/03/2021

Combining Embeddings and Fuzzy Time Series for High-Dimensional Time Series Forecasting in Internet of Energy Applications

The prediction of residential power usage is essential in assisting a sm...
research
08/03/2020

Conditional Latent Block Model: a Multivariate Time Series Clustering Approach for Autonomous Driving Validation

Autonomous driving systems validation remains one of the biggest challen...
research
05/15/2020

PrimiTect: Fast Continuous Hough Voting for Primitive Detection

This paper tackles the problem of data abstraction in the context of 3D ...
research
12/18/2019

Geometric Considerations of a Good Dictionary for Koopman Analysis of Dynamical Systems

Representation of a dynamical system in terms of simplifying modes is a ...
research
12/24/2017

A Data-driven Approach to Multi-event Analytics in Large-scale Power Systems Using Factor Model

Multi-event detection and recognition in real time is of challenge for a...
research
05/17/2022

Shape complexity in cluster analysis

In cluster analysis, a common first step is to scale the data aiming to ...

Please sign up or login with your details

Forgot password? Click here to reset