Investigating Answerability of LLMs for Long-Form Question Answering

09/15/2023
by   Meghana Moorthy Bhat, et al.
0

As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences. Toward making further progress in this direction, we strive to build a deeper understanding of the gaps between massive LLMs (e.g., ChatGPT) and smaller yet effective open-source LLMs and their distilled counterparts. To this end, we specifically focus on long-form question answering (LFQA) because it has several practical and impactful applications (e.g., troubleshooting, customer service, etc.) yet is still understudied and challenging for LLMs. We propose a question-generation method from abstractive summaries and show that generating follow-up questions from summaries of long documents can create a challenging setting for LLMs to reason and infer from long contexts. Our experimental results confirm that: (1) our proposed method of generating questions from abstractive summaries pose a challenging setup for LLMs and shows performance gaps between LLMs like ChatGPT and open-source LLMs (Alpaca, Llama) (2) open-source LLMs exhibit decreased reliance on context for generated questions from the original document, but their generation capabilities drop significantly on generated questions from summaries – especially for longer contexts (>1024 tokens)

READ FULL TEXT
research
12/02/2014

Watsonsim: Overview of a Question Answering Engine

The objective of the project is to design and run a system similar to Wa...
research
05/29/2021

Is Sluice Resolution really just Question Answering?

Sluice resolution is a problem where a system needs to output the corres...
research
12/16/2021

QuALITY: Question Answering with Long Input Texts, Yes!

To enable building and testing models on long-document comprehension, we...
research
06/01/2021

Question-aware Transformer Models for Consumer Health Question Summarization

Searching for health information online is becoming customary for more a...
research
05/21/2023

Model Analysis Evaluation for Ambiguous Question Answering

Ambiguous questions are a challenge for Question Answering models, as th...
research
04/04/2019

Guiding Extractive Summarization with Question-Answering Rewards

Highlighting while reading is a natural behavior for people to track sal...
research
05/26/2023

Exploiting Abstract Meaning Representation for Open-Domain Question Answering

The Open-Domain Question Answering (ODQA) task involves retrieving and s...

Please sign up or login with your details

Forgot password? Click here to reset