Utilizing Out-Domain Datasets to Enhance Multi-Task Citation Analysis

by   Dominique Mercier, et al.

Citations are generally analyzed using only quantitative measures while excluding qualitative aspects such as sentiment and intent. However, qualitative aspects provide deeper insights into the impact of a scientific research artifact and make it possible to focus on relevant literature free from bias associated with quantitative aspects. Therefore, it is possible to rank and categorize papers based on their sentiment and intent. For this purpose, larger citation sentiment datasets are required. However, from a time and cost perspective, curating a large citation sentiment dataset is a challenging task. Particularly, citation sentiment analysis suffers from both data scarcity and tremendous costs for dataset annotation. To overcome the bottleneck of data scarcity in the citation analysis domain we explore the impact of out-domain data during training to enhance the model performance. Our results emphasize the use of different scheduling methods based on the use case. We empirically found that a model trained using sequential data scheduling is more suitable for domain-specific usecases. Conversely, shuffled data feeding achieves better performance on a cross-domain task. Based on our findings, we propose an end-to-end trainable multi-task model that covers the sentiment and intent analysis that utilizes out-domain datasets to overcome the data scarcity.


page 1

page 2

page 3

page 4


ImpactCite: An XLNet-based method for Citation Impact Analysis

Citations play a vital role in understanding the impact of scientific li...

A quantitative and qualitative open citation analysis of retracted articles in the humanities

In this article, we show and discuss the results of a quantitative and q...

Structural Scaffolds for Citation Intent Classification in Scientific Publications

Identifying the intent of a citation in scientific papers (e.g., backgro...

SentiCite: An Approach for Publication Sentiment Analysis

With the rapid growth in the number of scientific publications, year aft...

Citation Sentiment Changes Analysis

Metrics for measuring the citation sentiment changes were introduced. Ci...

Article citation study: Context enhanced citation sentiment detection

Citation sentimet analysis is one of the little studied tasks for scient...

Multitask Learning for Citation Purpose Classification

We present our entry into the 2021 3C Shared Task Citation Context Class...

Please sign up or login with your details

Forgot password? Click here to reset