KILT: a Benchmark for Knowledge Intensive Language Tasks

09/04/2020
by   Fabio Petroni, et al.
6

Challenging problems such as open-domain question answering, fact checking, slot filling and entity linking require access to large, external knowledge sources. While some models do well on individual tasks, developing general models is difficult as each task might require computationally expensive indexing of custom knowledge sources, in addition to dedicated infrastructure. To catalyze research on models that condition on specific information in large textual resources, we present a benchmark for knowledge-intensive language tasks (KILT). All tasks in KILT are grounded in the same snapshot of Wikipedia, reducing engineering turnaround through the re-use of components, as well as accelerating research into task-agnostic memory architectures. We test both task-specific and general baselines, evaluating downstream performance in addition to the ability of the models to provide provenance. We find that a shared dense vector index coupled with a seq2seq model is a strong baseline, outperforming more tailor-made approaches for fact checking, open-domain question answering and dialogue, and yielding competitive results on entity linking and slot filling, by generating disambiguated text. KILT data and code are available at https://github.com/facebookresearch/KILT.

READ FULL TEXT
research
10/12/2021

Mention Memory: incorporating textual knowledge into Transformers through entity mention attention

Natural language understanding tasks such as open-domain question answer...
research
06/02/2021

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

Most state-of-the-art open-domain question answering systems use a neura...
research
12/16/2019

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Neural network models usually suffer from the challenge of incorporating...
research
04/16/2021

Editing Factual Knowledge in Language Models

The factual knowledge acquired during pretraining and stored in the para...
research
12/15/2021

Event Linking: Grounding Event Mentions to Wikipedia

Comprehending an article requires understanding its constituent events. ...
research
06/12/2021

Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP

Retrieval is a core component for open-domain NLP tasks. In open-domain ...
research
05/21/2023

TheoremQA: A Theorem-driven Question Answering dataset

The recent LLMs like GPT-4 and PaLM-2 have made tremendous progress in s...

Please sign up or login with your details

Forgot password? Click here to reset