TAPAS at SemEval-2021 Task 9: Reasoning over tables with intermediate pre-training

04/02/2021
by   Thomas Müller, et al.
6

We present the TAPAS contribution to the Shared Task on Statement Verification and Evidence Finding with Tables (SemEval 2021 Task 9, Wang et al. (2021)). SEM TAB FACT Task A is a classification task of recognizing if a statement is entailed, neutral or refuted by the content of a given table. We adopt the binary TAPAS model of Eisenschlos et al. (2020) to this task. We learn two binary classification models: A first model to predict if a statement is neutral or non-neutral and a second one to predict if it is entailed or refuted. As the shared task training set contains only entailed or refuted examples, we generate artificial neutral examples to train the first model. Both models are pre-trained using a MASKLM objective, intermediate counter-factual and synthetic data (Eisenschlos et al., 2020) and TABFACT (Chen et al., 2020), a large table entailment dataset. We find that the artificial neutral examples are somewhat effective at training the first model, achieving 68.03 test F1 versus the 60.47 of a majority baseline. For the second stage, we find that the pre-training on the intermediate data and TABFACT improves the results over MASKLM pre-training (68.03 vs 57.01).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2020

Understanding tables with intermediate pre-training

Table entailment, the binary classification task of finding if a sentenc...
research
01/17/2013

Knowledge Matters: Importance of Prior Information for Optimization

We explore the effect of introducing prior information into the intermed...
research
12/31/2018

Multilingual Constituency Parsing with Self-Attention and Pre-Training

We extend our previous work on constituency parsing (Kitaev and Klein, 2...
research
05/16/2023

Generative Table Pre-training Empowers Models for Tabular Prediction

Recently, the topic of table pre-training has attracted considerable res...
research
06/02/2023

Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables

In recent years, the community of 'explainable artificial intelligence' ...
research
01/28/2019

Using Pre-Training Can Improve Model Robustness and Uncertainty

Tuning a pre-trained network is commonly thought to improve data efficie...
research
05/03/2023

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models

Sharding a large machine learning model across multiple devices to balan...

Please sign up or login with your details

Forgot password? Click here to reset