A Side-by-side Comparison of Transformers for English Implicit Discourse Relation Classification

07/07/2023
by   Bruce W. Lee, et al.
0

Though discourse parsing can help multiple NLP fields, there has been no wide language model search done on implicit discourse relation classification. This hinders researchers from fully utilizing public-available models in discourse analysis. This work is a straightforward, fine-tuned discourse performance comparison of seven pre-trained language models. We use PDTB-3, a popular discourse relation annotated dataset. Through our model search, we raise SOTA to 0.671 ACC and obtain novel observations. Some are contrary to what has been reported before (Shi and Demberg, 2019b), that sentence-level pre-training objectives (NSP, SBO, SOP) generally fail to produce the best performing model for implicit discourse relation classification. Counterintuitively, similar-sized PLMs with MLM and full attention led to better performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2022

Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models

With a growing number of BERTology work analyzing different components o...
research
10/20/2022

Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification

Implicit discourse relations bind smaller linguistic units into coherent...
research
02/03/2017

Automatic Prediction of Discourse Connectives

Accurate prediction of suitable discourse connectives (however, furtherm...
research
08/30/2018

Acquiring Annotated Data with Cross-lingual Explicitation for Implicit Discourse Relation Classification

Implicit discourse relation classification is one of the most challengin...
research
10/13/2022

Prompt-based Connective Prediction Method for Fine-grained Implicit Discourse Relation Recognition

Due to the absence of connectives, implicit discourse relation recogniti...
research
04/01/2017

Adversarial Connective-exploiting Networks for Implicit Discourse Relation Classification

Implicit discourse relation classification is of great challenge due to ...
research
11/02/2020

IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP

Although the Indonesian language is spoken by almost 200 million people ...

Please sign up or login with your details

Forgot password? Click here to reset