Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability

09/09/2020
by   Mayank Chhipa, et al.
0

With the recent influx of bidirectional contextualized transformer language models in the NLP, it becomes a necessity to have a systematic comparative study of these models on variety of datasets. Also, the performance of these language models has not been explored on non-GLUE datasets. The study presented in paper compares the state-of-the-art language models - BERT, ELECTRA and its derivatives which include RoBERTa, ALBERT and DistilBERT. We conducted experiments by finetuning these models for cross domain and disparate data and penned an in-depth analysis of model's performances. Moreover, an explainability of language models coherent with pretraining is presented which verifies the context capturing capabilities of these models through a model agnostic approach. The experimental results establish new state-of-the-art for Yelp 2013 rating classification task and Financial Phrasebank sentiment detection task with 69 study conferred here can greatly assist industry researchers in choosing the language model effectively in terms of performance or compute efficiency.

READ FULL TEXT
research
06/13/2023

NoCoLA: The Norwegian Corpus of Linguistic Acceptability

While there has been a surge of large language models for Norwegian in r...
research
02/27/2023

Fluid Transformers and Creative Analogies: Exploring Large Language Models' Capacity for Augmenting Cross-Domain Analogical Creativity

Cross-domain analogical reasoning is a core creative ability that can be...
research
02/04/2023

A New cross-domain strategy based XAI models for fake news detection

In this study, we presented a four-level cross-domain strategy for fake ...
research
05/21/2022

DeepStruct: Pretraining of Language Models for Structure Prediction

We introduce a method for improving the structural understanding abiliti...
research
03/12/2021

Improving Authorship Verification using Linguistic Divergence

We propose an unsupervised solution to the Authorship Verification task ...
research
08/26/2020

Language Models and Word Sense Disambiguation: An Overview and Analysis

Transformer-based language models have taken many fields in NLP by storm...
research
09/01/2023

Large Language Models for Semantic Monitoring of Corporate Disclosures: A Case Study on Korea's Top 50 KOSPI Companies

In the rapidly advancing domain of artificial intelligence, state-of-the...

Please sign up or login with your details

Forgot password? Click here to reset