Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models

08/09/2021
by   Bogdan Kostić, et al.
0

Open-domain extractive question answering works well on textual data by first retrieving candidate texts and then extracting the answer from those candidates. However, some questions cannot be answered by text alone but require information stored in tables. In this paper, we present an approach for retrieving both texts and tables relevant to a question by jointly encoding texts, tables and questions into a single vector space. To this end, we create a new multi-modal dataset based on text and table datasets from related work and compare the retrieval performance of different encoding schemata. We find that dense vector embeddings of transformer models outperform sparse embeddings on four out of six evaluation datasets. Comparing different dense embedding models, tri-encoders, with one encoder for each question, text and table, increase retrieval performance compared to bi-encoders with one encoder for the question and one for both text and tables. We release the newly created multi-modal dataset to the community so that it can be used for training and evaluation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2023

Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval

Open-domain table question answering aims to provide answers to a questi...
research
04/26/2023

Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables

In this paper, we propose Multi-Modal Databases (MMDBs), which is a new ...
research
06/29/2023

Unified Language Representation for Question Answering over Text, Tables, and Images

When trying to answer complex questions, people often rely on multiple s...
research
07/18/2021

A Discriminative Semantic Ranker for Question Retrieval

Similar question retrieval is a core task in community-based question an...
research
05/19/2022

Table Retrieval May Not Necessitate Table-specific Model Design

Tables are an important form of structured data for both human and machi...
research
05/09/2021

Passage Retrieval for Outside-Knowledge Visual Question Answering

In this work, we address multi-modal information needs that contain text...
research
07/06/2022

BioTABQA: Instruction Learning for Biomedical Table Question Answering

Table Question Answering (TQA) is an important but under-explored task. ...

Please sign up or login with your details

Forgot password? Click here to reset