Medical Scientific Table-to-Text Generation with Human-in-the-Loop under the Data Sparsity Constraint

05/24/2022
by   Heng-Yi Wu, et al.
0

Structured (tabular) data in the preclinical and clinical domains contains valuable information about individuals and an efficient table-to-text summarization system can drastically reduce manual efforts to condense this data into reports. However, in practice, the problem is heavily impeded by the data paucity, data sparsity and inability of the state-of-the-art natural language generation models (including T5, PEGASUS and GPT-Neo) to produce accurate and reliable outputs. In this paper, we propose a novel table-to-text approach and tackle these problems with a novel two-step architecture which is enhanced by auto-correction, copy mechanism and synthetic data augmentation. The study shows that the proposed approach selects salient biomedical entities and values from structured data with improved precision (up to 0.13 absolute increase) of copying the tabular values to generate coherent and accurate text for assay validation reports and toxicology reports. Moreover, we also demonstrate a light-weight adaptation of the proposed system to new datasets by fine-tuning with as little as 40% training examples. The outputs of our model are validated by human experts in the Human-in-the-Loop scenario.

READ FULL TEXT

page 1

page 9

research
05/23/2023

QTSumm: A New Benchmark for Query-Focused Table Summarization

People primarily consult tables to conduct data analysis or answer speci...
research
03/01/2022

Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots

Few-shot table-to-text generation is a task of composing fluent and fait...
research
05/22/2022

Diversity Enhanced Table-to-Text Generation via Type Control

Generating natural language statements to convey information from tabula...
research
04/29/2020

ToTTo: A Controlled Table-To-Text Generation Dataset

We present ToTTo, an open-domain English table-to-text dataset with over...
research
08/27/2021

Few-Shot Table-to-Text Generation with Prototype Memory

Neural table-to-text generation models have achieved remarkable progress...
research
02/20/2023

Improving User Controlled Table-To-Text Generation Robustness

In this work we study user controlled table-to-text generation where use...
research
05/20/2020

Creative Artificial Intelligence – Algorithms vs. humans in an incentivized writing competition

The release of openly available, robust text generation algorithms has s...

Please sign up or login with your details

Forgot password? Click here to reset