Transformer-based Korean Pretrained Language Models: A Survey on Three Years of Progress

11/25/2021
by   Kichang Yang, et al.
0

With the advent of Transformer, which was used in translation models in 2017, attention-based architectures began to attract attention. Furthermore, after the emergence of BERT, which strengthened the NLU-specific encoder part, which is a part of the Transformer, and the GPT architecture, which strengthened the NLG-specific decoder part, various methodologies, data, and models for learning the Pretrained Language Model began to appear. Furthermore, in the past three years, various Pretrained Language Models specialized for Korean have appeared. In this paper, we intend to numerically and qualitatively compare and analyze various Korean PLMs released to the public.

READ FULL TEXT
research
09/19/2023

A Family of Pretrained Transformer Language Models for Russian

Nowadays, Transformer language models (LMs) represent a fundamental comp...
research
09/06/2020

Duluth at SemEval-2020 Task 7: Using Surprise as a Key to Unlock Humorous Headlines

We use pretrained transformer-based language models in SemEval-2020 Task...
research
06/07/2019

Analyzing the Structure of Attention in a Transformer Language Model

The Transformer is a fully attention-based alternative to recurrent netw...
research
02/12/2023

Transformer models: an introduction and catalog

In the past few years we have seen the meteoric appearance of dozens of ...
research
10/31/2019

Parameter Sharing Decoder Pair for Auto Composing

Auto Composing is an active and appealing research area in the past few ...
research
02/23/2022

Short-answer scoring with ensembles of pretrained language models

We investigate the effectiveness of ensembles of pretrained transformer-...

Please sign up or login with your details

Forgot password? Click here to reset