Understanding Political Polarisation using Language Models: A dataset and method

by   Samiran Gode, et al.

Our paper aims to analyze political polarization in US political system using Language Models, and thereby help candidates make an informed decision. The availability of this information will help voters understand their candidates views on the economy, healthcare, education and other social issues. Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years and a Language model based method that helps analyze how polarized a candidate is. Our data is divided into 2 parts, background information and political information about a candidate, since our hypothesis is that the political views of a candidate should be based on reason and be independent of factors such as birthplace, alma mater, etc. We further split this data into 4 phases chronologically, to help understand if and how the polarization amongst candidates changes. This data has been cleaned to remove biases. To understand the polarization we begin by showing results from some classical language models in Word2Vec and Doc2Vec. And then use more powerful techniques like the Longformer, a transformer based encoder, to assimilate more information and find the nearest neighbors of each candidate based on their political view and their background.


Analyzing COVID-19 Tweets with Transformer-based Language Models

This paper describes a method for using Transformer-based Language Model...

Apolitical Intelligence? Auditing Delphi's responses on controversial political issues in the US

As generative language models are deployed in ever-wider contexts, conce...

A tale of two metrics: Polling and financial contributions as a measure of performance

Campaign analysis is an integral part of American democracy and has many...

Information flow in political elections: a stochastic perspective

Often times, a candidate's attractiveness is directly associated with hi...

Discovering Political Topics in Facebook Discussion threads with Spectral Contextualization

We propose a new technique, Spectral Contextualization, to study politic...

A Peek into the Political Biases in Email Spam Filtering Algorithms During US Election 2020

Email services use spam filtering algorithms (SFAs) to filter emails tha...

XAI in Computational Linguistics: Understanding Political Leanings in the Slovenian Parliament

The work covers the development and explainability of machine learning m...

Please sign up or login with your details

Forgot password? Click here to reset