BasqueParl: A Bilingual Corpus of Basque Parliamentary Transcriptions

05/03/2022
by   Nayla Escribano, et al.
0

Parliamentary transcripts provide a valuable resource to understand the reality and know about the most important facts that occur over time in our societies. Furthermore, the political debates captured in these transcripts facilitate research on political discourse from a computational social science perspective. In this paper we release the first version of a newly compiled corpus from Basque parliamentary transcripts. The corpus is characterized by heavy Basque-Spanish code-switching, and represents an interesting resource to study political discourse in contrasting languages such as Basque and Spanish. We enrich the corpus with metadata related to relevant attributes of the speakers and speeches (language, gender, party...) and process the text to obtain named entities and lemmas. The obtained metadata is then used to perform a detailed corpus analysis which provides interesting insights about the language use of the Basque political representatives across time, parties and gender.

READ FULL TEXT

page 6

page 7

research
10/23/2022

A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis

Large, diachronic datasets of political discourse are hard to come acros...
research
05/17/2017

Political Footprints: Political Discourse Analysis using Pre-Trained Word Vectors

In this paper, we discuss how machine learning could be used to produce ...
research
06/22/2022

Gaining Insights on U.S. Senate Speeches Using a Time Varying Text Based Ideal Point Model

Estimating political positions of lawmakers has a long tradition in poli...
research
08/28/2018

WikiAtomicEdits: A Multilingual Corpus of Wikipedia Edits for Modeling Language and Discourse

We release a corpus of 43 million atomic edits across 8 languages. These...
research
12/29/2022

Examining Political Rhetoric with Epistemic Stance Detection

Participants in political discourse employ rhetorical strategies – such ...
research
06/26/2023

Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines

The detection of hate speech in political discourse is a critical issue,...
research
02/22/2019

Topology and dynamics of narratives on Brexit propagated by UK press during 2016 and 2017

This article identifies and characterises political narratives regarding...

Please sign up or login with your details

Forgot password? Click here to reset