Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain

12/28/2022
by   Chengzhi Zhang, et al.
0

Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognition methods will enable subsequent researchers to locate future work sentences more accurately and quickly and reduce the time and cost of acquiring the corpus. The current work on automatic identification of future work sentences is relatively small, and the existing research cannot accurately identify FWS from academic papers, and thus cannot conduct data mining on a large scale. Furthermore, there are many aspects to the content of future work, and the subdivision of the content is conducive to the analysis of specific development directions. In this paper, Nature Language Processing (NLP) is used as a case study, and FWS are extracted from academic papers and classified into different types. We manually build an annotated corpus with six different types of FWS. Then, automatic recognition and classification of FWS are implemented using machine learning models, and the performance of these models is compared based on the evaluation metrics. The results show that the Bernoulli Bayesian model has the best performance in the automatic recognition task, with the Macro F1 reaching 90.73 model has the best performance in the automatic classification task, with the weighted average F1 reaching 72.63 gain a deep understanding of the key content described in FWS, and we also demonstrate that content determination in FWS will be reflected in the subsequent research work by measuring the similarity between future work sentences and the abstracts.

READ FULL TEXT

page 16

page 18

research
10/21/2020

Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing

In the era of big data, the advancement, improvement, and application of...
research
07/08/2015

Mining and Analyzing the Future Works in Scientific Articles

Future works in scientific articles are valuable for researchers and the...
research
11/28/2021

Enhancing Identification of Structure Function of Academic Articles Using Contextual Information

With the enrichment of literature resources, researchers are facing the ...
research
04/15/2018

Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic Reconstruction in Historical Linguistics?

We evaluate the performance of state-of-the-art algorithms for automatic...
research
05/27/2023

A Framework For Refining Text Classification and Object Recognition from Academic Articles

With the widespread use of the internet, it has become increasingly cruc...
research
08/01/2019

Towards a Comprehensive Bibliography for SETI

In this work, we motivate, describe, and announce a living bibliography ...
research
01/06/2022

Automatic Related Work Generation: A Meta Study

Academic research is an exploration activity to solve problems that have...

Please sign up or login with your details

Forgot password? Click here to reset