What's happening in your neighborhood? A Weakly Supervised Approach to Detect Local News

01/15/2023
by   Deven Santosh Shah, et al.
0

Local news articles are a subset of news that impact users in a geographical area, such as a city, county, or state. Detecting local news (Step 1) and subsequently deciding its geographical location as well as radius of impact (Step 2) are two important steps towards accurate local news recommendation. Naive rule-based methods, such as detecting city names from the news title, tend to give erroneous results due to lack of understanding of the news content. Empowered by the latest development in natural language processing, we develop an integrated pipeline that enables automatic local news detection and content-based local news recommendations. In this paper, we focus on Step 1 of the pipeline, which highlights: (1) a weakly supervised framework incorporated with domain knowledge and auto data processing, and (2) scalability to multi-lingual settings. Compared with Stanford CoreNLP NER model, our pipeline has higher precision and recall evaluated on a real-world and human-labeled dataset. This pipeline has potential to more precise local news to users, helps local businesses get more exposure, and gives people more information about their neighborhood safety.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2023

Local Life: Stay Informed Around You, A Scalable Geoparsing and Geotagging Approach to Serve Local News Worldwide

Local news has become increasingly important in the news industry due to...
research
07/13/2023

Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations

Precisely recommending candidate news articles to users has always been ...
research
01/08/2022

Decoupling Makes Weakly Supervised Local Feature Better

Weakly supervised learning can help local feature methods to overcome th...
research
02/15/2022

Media Slant is Contagious

This paper analyzes the influence of partisan content from national cabl...
research
09/25/2020

PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

Keyphrases provide an extremely dense summary of a text. Such informatio...
research
04/17/2023

Do you MIND? Reflections on the MIND dataset for research on diversity in news recommendations

The MIND dataset is at the moment of writing the most extensive dataset ...
research
06/30/2023

A New Task and Dataset on Detecting Attacks on Human Rights Defenders

The ability to conduct retrospective analyses of attacks on human rights...

Please sign up or login with your details

Forgot password? Click here to reset