Unsupervised Bias Detection in College Student Newspapers

09/11/2023
by   Adam M. Lehavi, et al.
0

This paper presents a pipeline with minimal human influence for scraping and detecting bias on college newspaper archives. This paper introduces a framework for scraping complex archive sites that automated tools fail to grab data from, and subsequently generates a dataset of 14 student papers with 23,154 entries. This data can also then be queried by keyword to calculate bias by comparing the sentiment of a large language model summary to the original article. The advantages of this approach are that it is less comparative than reconstruction bias and requires less labelled data than generating keyword sentiment. Results are calculated on politically charged words as well as control words to show how conclusions can be drawn. The complete method facilitates the extraction of nuanced insights with minimal assumptions and categorizations, paving the way for a more objective understanding of bias within student newspaper sources.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon

Sentiment lexicons are instrumental for sentiment analysis. One can use ...
research
02/08/2023

Sentiment analysis and opinion mining on educational data: A survey

Sentiment analysis AKA opinion mining is one of the most widely used NLP...
research
06/03/2021

Noisy student-teacher training for robust keyword spotting

We propose self-training with noisy student-teacher approach for streami...
research
02/16/2020

Towards Detection of Subjective Bias using Contextualized Word Embeddings

Subjective bias detection is critical for applications like propaganda d...
research
04/22/2022

Neural Contrastive Clustering: Fully Unsupervised Bias Reduction for Sentiment Classification

Background: Neural networks produce biased classification results due to...
research
03/20/2020

TNT-KID: Transformer-based Neural Tagger for Keyword Identification

With growing amounts of available textual data, development of algorithm...

Please sign up or login with your details

Forgot password? Click here to reset