Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification

03/22/2023
by   Berfu Buyukoz, et al.
0

This study evaluates the robustness of two state-of-the-art deep contextual language representations, ELMo and DistilBERT, on supervised learning of binary protest news classification and sentiment analysis of product reviews. A "cross-context" setting is enabled using test sets that are distinct from the training data. Specifically, in the news classification task, the models are developed on local news from India and tested on the local news from China. In the sentiment analysis task, the models are trained on movie reviews and tested on customer reviews. This comparison is aimed at exploring the limits of the representative power of today's Natural Language Processing systems on the path to the systems that are generalizable to real-life scenarios. The models are fine-tuned and fed into a Feed-Forward Neural Network and a Bidirectional Long Short Term Memory network. Multinomial Naive Bayes and Linear Support Vector Machine are used as traditional baselines. The results show that, in binary text classification, DistilBERT is significantly better than ELMo on generalizing to the cross-context setting. ELMo is observed to be significantly more robust to the cross-context test data than both baselines. On the other hand, the baselines performed comparably well to ELMo when the training and test data are subsets of the same corpus (no cross-context). DistilBERT is also found to be 30 DistilBERT can transfer generic semantic knowledge to other domains better than ELMo. DistilBERT is also favorable in incorporating into real-life systems for it requires a smaller computational training budget. When generalization is not the utmost preference and test domain is similar to the training domain, the traditional ML algorithms can still be considered as more economic alternatives to deep language representations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2018

Deep Learning for Digital Text Analytics: Sentiment Analysis

In today's scenario, imagining a world without negativity is something v...
research
12/26/2019

Text Classification for Azerbaijani Language Using Machine Learning and Embedding

Text classification systems will help to solve the text clustering probl...
research
10/29/2019

An Efficient Model for Sentiment Analysis of Electronic Product Reviews in Vietnamese

In the past few years, the growth of e-commerce and digital marketing in...
research
03/04/2020

SeMemNN: A Semantic Matrix-Based Memory Neural Network for Text Classification

Text categorization is the task of assigning labels to documents written...
research
03/15/2023

Cross-domain Sentiment Classification in Spanish

Sentiment Classification is a fundamental task in the field of Natural L...
research
03/16/2020

Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

The automatic identification of propaganda has gained significance in re...
research
08/01/2020

Overview of CLEF 2019 Lab ProtestNews: Extracting Protests from News in a Cross-context Setting

We present an overview of the CLEF-2019 Lab ProtestNews on Extracting Pr...

Please sign up or login with your details

Forgot password? Click here to reset