Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts

07/13/2017
by   David Pride, et al.
0

This work looks in depth at several studies that have attempted to automate the process of citation importance classification based on the publications full text. We analyse a range of features that have been previously used in this task. Our experimental results confirm that the number of in text references are highly predictive of influence. Contrary to the work of Valenzuela et al. we find abstract similarity one of the most predictive features. Overall, we show that many of the features previously described in literature are not particularly predictive. Consequently, we discuss challenges and potential improvements in the classification pipeline, provide a critical review of the performance of individual features and address the importance of constructing a large scale gold standard reference dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2019

Highly cited references in PLOS ONE and their in-text usage over time

In this article, we describe highly cited publications in a PLOS ONE ful...
research
05/23/2022

A Natural Language Processing Pipeline for Detecting Informal Data References in Academic Literature

Discovering authoritative links between publications and the datasets th...
research
05/23/2017

Reference String Extraction Using Line-Based Conditional Random Fields

The extraction of individual reference strings from the reference sectio...
research
03/23/2020

Interdisciplinarity metric based on the co-citation network

Quantifying the interdisciplinarity of a research is a relevant problem ...
research
03/27/2023

unarXive 2022: All arXiv Publications Pre-Processed for NLP, Including Structured Full-Text and Citation Network

Large-scale data sets on scholarly publications are the basis for a vari...
research
12/02/2021

Towards Generating Citation Sentences for Multiple References with Intent Control

Machine-generated citation sentences can aid automated scientific litera...
research
11/05/2018

Identifying influential patents in citation networks using enhanced VoteRank centrality

This study proposes the usage of a method called VoteRank, created by Zh...

Please sign up or login with your details

Forgot password? Click here to reset