Generate, Transform, Answer: Question Specific Tool Synthesis for Tabular Data

03/17/2023
by   Carlos Gemmell, et al.
0

Tabular question answering (TQA) presents a challenging setting for neural systems by requiring joint reasoning of natural language with large amounts of semi-structured data. Unlike humans who use programmatic tools like filters to transform data before processing, language models in TQA process tables directly, resulting in information loss as table size increases. In this paper we propose ToolWriter to generate query specific programs and detect when to apply them to transform tables and align them with the TQA model's capabilities. Focusing ToolWriter to generate row-filtering tools improves the state-of-the-art for WikiTableQuestions and WikiSQL with the most performance gained on long tables. By investigating headroom, our work highlights the broader potential for programmatic tools combined with neural components to manipulate large amounts of structured data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2019

Question Answering via Web Extracted Tables and Pipelined Models

In this paper, we describe a dataset and baseline result for a question ...
research
08/23/2023

Bridging the Gap: Deciphering Tabular Data Using Large Language Model

In the realm of natural language processing, the understanding of tabula...
research
02/21/2017

Neural Multi-Step Reasoning for Question Answering on Semi-Structured Tables

Advances in natural language processing tasks have gained momentum in re...
research
10/29/2021

Learning Representations for Zero-Shot Retrieval over Structured Data

Large Scale Question-Answering systems today are widely used in downstre...
research
01/31/2019

Riconoscimento ortografico per apostrofo ed espressioni polirematiche

The work presents two algorithms of manipulation and comparison between ...
research
01/29/2023

Large-scale Data Modelling in Hive and Distributed Query Processing using MapReduce and Tez

Huge amounts of data being generated continuously by digitally interconn...
research
06/01/2023

Cross Modal Data Discovery over Structured and Unstructured Data Lakes

Organizations are collecting increasingly large amounts of data for data...

Please sign up or login with your details

Forgot password? Click here to reset