Which techniques does your application use?: An information extraction framework for scientific articles

08/23/2016
by   Soham Dan, et al.
0

Every field of research consists of multiple application areas with various techniques routinely used to solve problems in these wide range of application areas. With the exponential growth in research volumes, it has become difficult to keep track of the ever-growing number of application areas as well as the corresponding problem solving techniques. In this paper, we consider the computational linguistics domain and present a novel information extraction system that automatically constructs a pool of all application areas in this domain and appropriately links them with corresponding problem solving techniques. Further, we categorize individual research articles based on their application area and the techniques proposed/used in the article. k-gram based discounting method along with handwritten rules and bootstrapped pattern learning is employed to extract application areas. Subsequently, a language modeling approach is proposed to characterize each article based on its application area. Similarly, regular expressions and high-scoring noun phrases are used for the extraction of the problem solving techniques. We propose a greedy approach to characterize each article based on the techniques. Towards the end, we present a table representing the most frequent techniques adopted for a particular application area. Finally, we propose three use cases presenting an extensive temporal analysis of the usage of techniques and application areas.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2017

AppTechMiner: Mining Applications and Techniques from Scientific Articles

This paper presents AppTechMiner, a rule-based information extraction fr...
research
02/16/2023

A Survey on Event-based News Narrative Extraction

Narratives are fundamental to our understanding of the world, providing ...
research
08/05/2019

Backronym

The field of Machine Learning research is divided into subject areas, wh...
research
03/17/2011

Extraction of handwritten areas from colored image of bank checks by an hybrid method

One of the first step in the realization of an automatic system of check...
research
10/30/2018

An architecture of open-source tools to combine textual information extraction, faceted search and information visualisation

This article presents our steps to integrate complex and partly unstruct...
research
12/21/2012

Topic Extraction and Bundling of Related Scientific Articles

Automatic classification of scientific articles based on common characte...

Please sign up or login with your details

Forgot password? Click here to reset