DeepAI AI Chat
Log In Sign Up

What's happening in your neighborhood? A Weakly Supervised Approach to Detect Local News

by   Deven Santosh Shah, et al.

Local news articles are a subset of news that impact users in a geographical area, such as a city, county, or state. Detecting local news (Step 1) and subsequently deciding its geographical location as well as radius of impact (Step 2) are two important steps towards accurate local news recommendation. Naive rule-based methods, such as detecting city names from the news title, tend to give erroneous results due to lack of understanding of the news content. Empowered by the latest development in natural language processing, we develop an integrated pipeline that enables automatic local news detection and content-based local news recommendations. In this paper, we focus on Step 1 of the pipeline, which highlights: (1) a weakly supervised framework incorporated with domain knowledge and auto data processing, and (2) scalability to multi-lingual settings. Compared with Stanford CoreNLP NER model, our pipeline has higher precision and recall evaluated on a real-world and human-labeled dataset. This pipeline has potential to more precise local news to users, helps local businesses get more exposure, and gives people more information about their neighborhood safety.


page 1

page 2

page 3

page 4


Decoupling Makes Weakly Supervised Local Feature Better

Weakly supervised learning can help local feature methods to overcome th...

Media Slant is Contagious

This paper analyzes the influence of partisan content from national cabl...

Empowering News Recommendation with Pre-trained Language Models

Personalized news recommendation is an essential technique for online ne...

Using user's local context to support local news

American local newspapers have been experiencing a large loss of reader ...

PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

Keyphrases provide an extremely dense summary of a text. Such informatio...

A Cross-lingual Natural Language Processing Framework for Infodemic Management

The COVID-19 pandemic has put immense pressure on health systems which a...

Combining Lexical and Syntactic Features for Detecting Content-dense Texts in News

Content-dense news report important factual information about an event i...