Instilling Type Knowledge in Language Models via Multi-Task QA

04/28/2022
by   Shuyang Li, et al.
6

Understanding human language often necessitates understanding entities and their place in a taxonomy of knowledge – their types. Previous methods to learn entity types rely on training classifiers on datasets with coarse, noisy, and incomplete labels. We introduce a method to instill fine-grained type knowledge in language models with text-to-text pre-training on type-centric questions leveraging knowledge base documents and knowledge graphs. We create the WikiWiki dataset: entities and passages from 10M Wikipedia articles linked to the Wikidata knowledge graph with 41K types. Models trained on WikiWiki achieve state-of-the-art performance in zero-shot dialog state tracking benchmarks, accurately infer entity types in Wikipedia articles, and can discover new types deemed useful by human judges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

In this work, we aim at equipping pre-trained language models with struc...
research
07/07/2019

Zero-Shot Open Entity Typing as Type-Compatible Grounding

The problem of entity-typing has been studied predominantly in supervise...
research
05/21/2023

OntoType: Ontology-Guided Zero-Shot Fine-Grained Entity Typing with Weak Supervision from Pre-Trained Language Models

Fine-grained entity typing (FET), which assigns entities in text with co...
research
06/03/2023

Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models

In this paper, we propose a table and image generation task to verify ho...
research
09/21/2023

SLHCat: Mapping Wikipedia Categories and Lists to DBpedia by Leveraging Semantic, Lexical, and Hierarchical Features

Wikipedia articles are hierarchically organized through categories and l...
research
06/15/2023

Neural models for Factual Inconsistency Classification with Explanations

Factual consistency is one of the most important requirements when editi...
research
05/05/2022

Entity Cloze By Date: What LMs Know About Unseen Entities

Language models (LMs) are typically trained once on a large-scale corpus...

Please sign up or login with your details

Forgot password? Click here to reset