EXmatcher: Combining Features Based on Reference Strings and Segments to Enhance Citation Matching

06/11/2019
by   Behnam Ghavimi, et al.
0

Citation matching is a challenging task due to different problems such as the variety of citation styles, mistakes in reference strings and the quality of identified reference segments. The classic citation matching configuration used in this paper is the combination of blocking technique and a binary classifier. Three different possible inputs (reference strings, reference segments and a combination of reference strings and segments) were tested to find the most efficient strategy for citation matching. In the classification step, we describe the effect which the probabilities of reference segments can have in citation matching. Our evaluation on a manually curated gold standard showed that the input data consisting of the combination of reference segments and reference strings lead to the best result. In addition, the usage of the probabilities of the segmentation slightly improves the result.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2020

Using BibTeX to Automatically Generate Labeled Data for Citation Field Extraction

Accurate parsing of citation reference strings is crucial to automatical...
research
05/12/2018

Citation Data-set for Machine Learning Citation Styles and Entity Extraction from Citation Strings

Citation parsing is fundamental for search engines within academia and t...
research
05/23/2017

Reference String Extraction Using Line-Based Conditional Random Fields

The extraction of individual reference strings from the reference sectio...
research
02/06/2020

Citation Data of Czech Apex Courts

In this paper, we introduce the citation data of the Czech apex courts (...
research
11/26/2018

ParsRec: A Novel Meta-Learning Approach to Recommending Bibliographic Reference Parsers

Bibliographic reference parsers extract machine-readable metadata such a...
research
02/04/2018

Machine Learning vs. Rules and Out-of-the-Box vs. Retrained: An Evaluation of Open-Source Bibliographic Reference and Citation Parsers

Bibliographic reference parsing refers to extracting machine-readable me...

Please sign up or login with your details

Forgot password? Click here to reset