DeepAI AI Chat
Log In Sign Up

Using NLP to measure democracy

by   Thiago Marzagão, et al.
Conselho Administrativo de Defesa Econômica

This paper uses natural language processing to create the first machine-coded democracy index, which I call Automated Democracy Scores (ADS). The ADS are based on 42 million news articles from 6,043 different sources and cover all independent countries in the 1993-2012 period. Unlike the democracy indices we have today the ADS are replicable and have standard errors small enough to actually distinguish between cases. The ADS are produced with supervised learning. Three approaches are tried: a) a combination of Latent Semantic Analysis and tree-based regression methods; b) a combination of Latent Dirichlet Allocation and tree-based regression methods; and c) the Wordscores algorithm. The Wordscores algorithm outperforms the alternatives, so it is the one on which the ADS are based. There is a web application where anyone can change the training set and see how the results change:


page 1

page 2

page 3

page 4


Latent Dirichlet Allocation Models for World Trade Analysis

The international trade is one of the classic areas of study in economic...

Radar de Parité: An NLP system to measure gender representation in French news stories

We present the Radar de Parité, an automated Natural Language Processing...

Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches

The ability to automatically identify industry sector coverage in articl...

Modeling the dynamics of domain specific terminology in diachronic corpora

In terminology work, natural language processing, and digital humanities...

Automated Code Extraction from Discussion Board Text Dataset

This study introduces and investigates the capabilities of three differe...

Web Robot Detection in Academic Publishing

Recent industry reports assure the rise of web robots which comprise mor...