Replication of the Keyword Extraction part of the paper "'Without the Clutter of Unimportant Words': Descriptive Keyphrases for Text Visualization"

08/15/2019
by   Shibamouli Lahiri, et al.
0

"Keyword Extraction" refers to the task of automatically identifying the most relevant and informative phrases in natural language text. As we are deluged with large amounts of text data in many different forms and content - emails, blogs, tweets, Facebook posts, academic papers, news articles - the task of "making sense" of all this text by somehow summarizing them into a coherent structure assumes paramount importance. Keyword extraction - a well-established problem in Natural Language Processing - can help us here. In this report, we construct and test three different hypotheses (all related to the task of keyword extraction) that take us one step closer to understanding how to meaningfully identify and extract "descriptive" keyphrases. The work reported here was done as part of replicating the study by Chuang et al. [3].

READ FULL TEXT

page 19

page 24

page 25

research
09/25/2020

PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

Keyphrases provide an extremely dense summary of a text. Such informatio...
research
07/14/2018

Generating Synthetic Data for Neural Keyword-to-Question Models

Search typically relies on keyword queries, but these are often semantic...
research
09/28/2022

Keyword Extraction from Short Texts with a Text-To-Text Transfer Transformer

The paper explores the relevance of the Text-To-Text Transfer Transforme...
research
07/15/2019

RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation

Keyword extraction is used for summarizing the content of a document and...
research
09/11/2023

Unsupervised Bias Detection in College Student Newspapers

This paper presents a pipeline with minimal human influence for scraping...
research
10/21/2020

Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing

In the era of big data, the advancement, improvement, and application of...
research
07/13/2018

New/s/leak 2.0 - Multilingual Information Extraction and Visualization for Investigative Journalism

Investigative journalism in recent years is confronted with two major ch...

Please sign up or login with your details

Forgot password? Click here to reset