Occurrence Statistics of Entities, Relations and Types on the Web

05/14/2016
by   Aman Madaan, et al.
0

The problem of collecting reliable estimates of occurrence of entities on the open web forms the premise for this report. The models learned for tagging entities cannot be expected to perform well when deployed on the web. This is owing to the severe mismatch in the distributions of such entities on the web and in the relatively diminutive training data. In this report, we build up the case for maximum mean discrepancy for estimation of occurrence statistics of entities on the web, taking a review of named entity disambiguation techniques and related concepts along the way.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2019

Open Named Entity Modeling from Embedding Distribution

In this paper, we report our discovery on named entity distribution in g...
research
07/18/2016

Joint Event Detection and Entity Resolution: a Virtuous Cycle

Clustering web documents has numerous applications, such as aggregating ...
research
04/08/2020

Entity-Switched Datasets: An Approach to Auditing the In-Domain Robustness of Named Entity Recognition Models

Named entity recognition systems perform well on standard datasets compr...
research
04/25/2023

Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining

Named entities are ubiquitous in text that naturally accompanies images,...
research
01/01/2021

How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

The number of biomedical literature on new biomedical concepts is rapidl...
research
03/26/2018

Empirical Analysis of Foundational Distinctions in the Web of Data

A main difference between pre-Web artificial intelligence and the curren...
research
12/19/2013

Using Web Co-occurrence Statistics for Improving Image Categorization

Object recognition and localization are important tasks in computer visi...

Please sign up or login with your details

Forgot password? Click here to reset