NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term Memory

10/02/2019
by   Adewale Akinfaderin, et al.
0

There has been several reports in the Nigerian and International media about the Senators and House of Representative Members of the Nigerian National Assembly (NASS) being the highest paid in the world. Despite this high-level of parliamentary compensation and a lack of oversight, most of the legislative duties like bills introduced and vote proceedings are shrouded in mystery without an open and annotated corpus. In this paper, we present results from ongoing research on the categorization of bills introduced in the Nigerian parliament since the fourth republic (1999 - 2018). For this task, we employed a multi-step approach which involves extracting text from scanned and embedded pdfs with low to medium quality using Optical Character Recognition (OCR) tools and labeling them into eight categories. We investigate the performance of document level embedding for feature representation of the extracted texts before using a Bidirectional Long Short-Term Memory (Bi-LSTM) for our classifier. The performance was further compared with other feature representation and machine learning techniques. We believe that these results are well-positioned to have a substantial impact on the quest to meet the basic open data charter principles.

READ FULL TEXT
research
11/27/2018

Document classification using a Bi-LSTM to unclog Brazil's supreme court

The Brazilian court system is currently the most clogged up judiciary sy...
research
10/17/2016

Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification

Recently, neural networks have achieved great success on sentiment class...
research
06/25/2017

A Deep Neural Architecture for Sentence-level Sentiment Classification in Twitter Social Networking

This paper introduces a novel deep learning framework including a lexico...
research
08/11/2017

N-gram and Neural Language Models for Discriminating Similar Languages

This paper describes our submission (named clac) to the 2016 Discriminat...
research
06/26/2023

Integrating Bidirectional Long Short-Term Memory with Subword Embedding for Authorship Attribution

The problem of unveiling the author of a given text document from multip...
research
01/09/2020

Binary and Multitask Classification Model for Dutch Anaphora Resolution: Die/Dat Prediction

The correct use of Dutch pronouns 'die' and 'dat' is a stumbling block f...
research
04/26/2022

Approach to Predicting News – A Precise Multi-LSTM Network With BERT

Varieties of Democracy (V-Dem) is a new approach to conceptualizing and ...

Please sign up or login with your details

Forgot password? Click here to reset