An Integrated, Conditional Model of Information Extraction and Coreference with Applications to Citation Matching

07/11/2012
by   Ben Wellner, et al.
0

Although information extraction and coreference resolution appear together in many applications, most current systems perform them as ndependent steps. This paper describes an approach to integrated inference for extraction and coreference based on conditionally-trained undirected graphical models. We discuss the advantages of conditional probability training, and of a coreference model structure based on graph partitioning. On a data set of research paper citations, we show significant reduction in error by using extraction uncertainty to improve coreference citation matching accuracy, and using coreference to improve the accuracy of the extracted fields.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2018

Citation Data-set for Machine Learning Citation Styles and Entity Extraction from Citation Strings

Citation parsing is fundamental for search engines within academia and t...
research
08/10/2020

Predicting the Citations of Scholarly Paper

Citation prediction of scholarly papers is of great significance in guid...
research
06/27/2019

OpenCitations

OpenCitations is a scholarly infrastructure organization dedicated to op...
research
05/23/2017

Reference String Extraction Using Line-Based Conditional Random Fields

The extraction of individual reference strings from the reference sectio...
research
09/04/2014

Accurate, fully-automated NMR spectral profiling for metabolomics

Many diseases cause significant changes to the concentrations of small m...
research
02/06/2020

Citation Data of Czech Apex Courts

In this paper, we introduce the citation data of the Czech apex courts (...

Please sign up or login with your details

Forgot password? Click here to reset