Large Language Models are few(1)-shot Table Reasoners

10/13/2022
by   Wenhu Chen, et al.
0

Recent literature has shown that large language models (LLMs) are generally excellent few-shot reasoners to solve text reasoning tasks. However, the capability of LLMs on table reasoning tasks is yet to be explored. In this paper, we aim at understanding how well LLMs can perform on these table tasks with few-shot in-context learning. Specifically, we evaluate LLMs on popular table QA and fact verification datasets like WikiTableQuestion, FetaQA, TabFact, and FEVEROUS and found that LLMs are really competent at complex reasoning over table structures. When combined with `chain of thoughts' prompting, GPT-3 is able to achieve very strong performance with only a 1-shot demonstration. We further manually study the reasoning chains elicited from LLMs and found that these reasoning chains are highly consistent with the `ground truth' semantic form. We believe that our study opens new possibilities to employ LLMs on different table-based reasoning tasks under few-shot scenario.

READ FULL TEXT
research
11/22/2022

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Recently, there has been significant progress in teaching language model...
research
01/31/2023

Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning

Table-based reasoning has shown remarkable progress in combining deep mo...
research
07/21/2023

How to Tidy Up a Table: Fusing Visual and Semantic Commonsense Reasoning for Robotic Tasks with Vague Objectives

Vague objectives in many real-life scenarios pose long-standing challeng...
research
09/16/2022

Text and Patterns: For Effective Chain of Thought, It Takes Two to Tango

Reasoning is a key pillar of human cognition and intelligence. In the pa...
research
06/25/2023

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations

Despite significant progress having been made in question answering on t...
research
05/24/2023

Calc-X: Enriching Arithmetical Chain-of-Thoughts Datasets by Interaction with Symbolic Systems

This report overviews our ongoing work in enriching chain-of-thoughts da...
research
05/14/2018

Wearable Audio and IMU Based Shot Detection in Racquet Sports

Wearables like smartwatches which are embedded with sensors and powerful...

Please sign up or login with your details

Forgot password? Click here to reset