Councils in Action: Automating the Curation of Municipal Governance Data for Research

04/19/2022
by   Eva Maxfield Brown, et al.
0

Large scale comparative research into municipal governance is often prohibitively difficult due to a lack of high-quality data. But, recent advances in speech-to-text algorithms and natural language processing has made it possible to more easily collect and analyze data about municipal governments. In this paper, we introduce an open-source platform, the Council Data Project (CDP), to curate novel datasets for research into municipal governance. The contribution of this work is two-fold: 1. We demonstrate that CDP, as an infrastructure, can be used to assemble reliable comparative data on municipal governance; 2. We provide exploratory analysis of three municipalities to show how CDP data can be used to gain insight into how municipal governments perform over time. We conclude by describing future directions for research on and with CDP such as the development of machine learning models for speaker annotation, outline generation, and named entity recognition for improved linked data.

READ FULL TEXT

page 3

page 7

research
08/02/2019

DELTA: A DEep learning based Language Technology plAtform

In this paper we present DELTA, a deep learning based language technolog...
research
03/29/2021

Contextual Text Embeddings for Twi

Transformer-based language models have been changing the modern Natural ...
research
11/28/2020

Text Mining for Processing Interview Data in Computational Social Science

We use commercially available text analysis technology to process interv...
research
09/19/2023

FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection

Natural language processing (NLP) applications such as named entity reco...
research
11/04/2022

Multilingual Name Entity Recognition and Intent Classification Employing Deep Learning Architectures

Named Entity Recognition and Intent Classification are among the most im...
research
05/29/2021

Tournesol: A quest for a large, secure and trustworthy database of reliable human judgments

Today's large-scale algorithms have become immensely influential, as the...
research
09/01/2021

Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development

This paper introduces a human-in-the-loop (HITL) data annotation pipelin...

Please sign up or login with your details

Forgot password? Click here to reset