Rethinking Search: Making Experts out of Dilettantes

05/05/2021
by   Donald Metzler, et al.
6

When experiencing an information need, users want to engage with an expert, but often turn to an information retrieval system, such as a search engine, instead. Classical information retrieval systems do not answer information needs directly, but instead provide references to (hopefully authoritative) answers. Successful question answering systems offer a limited corpus created on-demand by human experts, which is neither timely nor scalable. Large pre-trained language models, by contrast, are capable of directly generating prose that may be responsive to an information need, but at present they are dilettantes rather than experts - they do not have a true understanding of the world, they are prone to hallucinating, and crucially they are incapable of justifying their utterances by referring to supporting documents in the corpus they were trained over. This paper examines how ideas from classical information retrieval and large pre-trained language models can be synthesized and evolved into systems that truly deliver on the promise of expert advice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2020

On-The-Fly Information Retrieval Augmentation for Language Models

Here we experiment with the use of information retrieval as an augmentat...
research
05/11/2023

WebCPM: Interactive Web Search for Chinese Long-form Question Answering

Long-form question answering (LFQA) aims at answering complex, open-ende...
research
05/10/2020

How Context Affects Language Models' Factual Predictions

When pre-trained on large unsupervised textual corpora, language models ...
research
07/10/2019

ReQA: An Evaluation for End-to-End Answer Retrieval Models

Popular QA benchmarks like SQuAD have driven progress on the task of ide...
research
02/23/2023

Dr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctness

Generative pre-trained language models (GPLMs) like ChatGPT encode in th...
research
06/16/2012

Information Retrieval in Intelligent Systems: Current Scenario & Issues

Web space is the huge repository of data. Everyday lots of new informati...
research
10/19/2022

Revision Transformers: Getting RiT of No-Nos

Current transformer language models (LM) are large-scale models with bil...

Please sign up or login with your details

Forgot password? Click here to reset