A Comparative Study of Transformer-Based Language Models on Extractive Question Answering

10/07/2021
by   Kate Pearce, et al.
0

Question Answering (QA) is a task in natural language processing that has seen considerable growth after the advent of transformers. There has been a surge in QA datasets that have been proposed to challenge natural language processing models to improve human and existing model performance. Many pre-trained language models have proven to be incredibly effective at the task of extractive question answering. However, generalizability remains as a challenge for the majority of these models. That is, some datasets require models to reason more than others. In this paper, we train various pre-trained language models and fine-tune them on multiple question answering datasets of varying levels of difficulty to determine which of the models are capable of generalizing the most comprehensively across different datasets. Further, we propose a new architecture, BERT-BiLSTM, and compare it with other language models to determine if adding more bidirectionality can improve model performance. Using the F1-score as our metric, we find that the RoBERTa and BART pre-trained models perform the best across all datasets and that our BERT-BiLSTM model outperforms the baseline BERT model.

READ FULL TEXT

page 1

page 3

research
11/14/2020

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Pre-training a transformer-based model for the language modeling task in...
research
10/31/2022

Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Pre-trained Language Models recently gained traction in the Natural Lang...
research
06/09/2019

Gendered Pronoun Resolution using BERT and an extractive question answering formulation

The resolution of ambiguous pronouns is a longstanding challenge in Natu...
research
04/15/2021

Time-Stamped Language Model: Teaching Language Models to Understand the Flow of Events

Tracking entities throughout a procedure described in a text is challeng...
research
04/13/2021

Structural analysis of an all-purpose question answering model

Attention is a key component of the now ubiquitous pre-trained language ...
research
05/14/2023

Learning to Generalize for Cross-domain QA

There have been growing concerns regarding the out-of-domain generalizat...
research
09/15/2021

Topic Transferable Table Question Answering

Weakly-supervised table question-answering(TableQA) models have achieved...

Please sign up or login with your details

Forgot password? Click here to reset