Grid Search Hyperparameter Benchmarking of BERT, ALBERT, and LongFormer on DuoRC

01/15/2021
by   Alex John Quijano, et al.
0

The purpose of this project is to evaluate three language models named BERT, ALBERT, and LongFormer on the Question Answering dataset called DuoRC. The language model task has two inputs, a question, and a context. The context is a paragraph or an entire document while the output is the answer based on the context. The goal is to perform grid search hyperparameter fine-tuning using DuoRC. Pretrained weights of the models are taken from the Huggingface library. Different sets of hyperparameters are used to fine-tune the models using two versions of DuoRC which are the SelfRC and the ParaphraseRC. The results show that the ALBERT (pretrained using the SQuAD1 dataset) has an F1 score of 76.4 and an accuracy score of 68.52 after fine-tuning on the SelfRC dataset. The Longformer model (pretrained using the SQuAD and SelfRC datasets) has an F1 score of 52.58 and an accuracy score of 46.60 after fine-tuning on the ParaphraseRC dataset. The current results outperformed the results from the previous model by DuoRC.

READ FULL TEXT
research
11/13/2022

Xu at SemEval-2022 Task 4: Pre-BERT Neural Network Methods vs Post-BERT RoBERTa Approach for Patronizing and Condescending Language Detection

This paper describes my participation in the SemEval-2022 Task 4: Patron...
research
11/01/2022

Preserving In-Context Learning ability in Large Language Model Fine-tuning

Pretrained large language models (LLMs) are strong in-context learners t...
research
08/08/2023

In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning

In this note, we explore inference-time alignment through in-context lea...
research
09/14/2023

PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts

The meanings of words and phrases depend not only on where they are used...
research
06/09/2023

Understanding Telecom Language Through Large Language Models

The recent progress of artificial intelligence (AI) opens up new frontie...
research
06/29/2022

GPTs at Factify 2022: Prompt Aided Fact-Verification

One of the most pressing societal issues is the fight against false news...
research
08/27/2021

An Adaptive Clustering Approach for Accident Prediction

Traffic accident prediction is a crucial task in the mobility domain. St...

Please sign up or login with your details

Forgot password? Click here to reset