An Overview on Language Models: Recent Developments and Outlook

03/10/2023
by   Chengwei Wei, et al.
0

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner. In contrast, pre-trained language models (PLMs) cover broader concepts and can be used in both causal sequential modeling and fine-tuning for downstream applications. PLMs have their own training paradigms (usually self-supervised) and serve as foundation models in modern NLP systems. This overview paper provides an introduction to both CLMs and PLMs from five aspects, i.e., linguistic units, structures, training methods, evaluation methods, and applications. Furthermore, we discuss the relationship between CLMs and PLMs and shed light on the future directions of language modeling in the pre-trained era.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2022

A Survey of Knowledge-Enhanced Pre-trained Language Models

Pre-trained Language Models (PLMs) which are trained on large text corpu...
research
02/16/2023

Foundation Models for Natural Language Processing – Pre-trained Language Models Integrating Media

This open access book provides a comprehensive overview of the state of ...
research
12/20/2022

A Measure-Theoretic Characterization of Tight Language Models

Language modeling, a central task in natural language processing, involv...
research
05/31/2023

Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation

Pre-trained language models (PLMs) have achieved great success in NLP an...
research
07/14/2023

MorphPiece : Moving away from Statistical Language Representation

Tokenization is a critical part of modern NLP pipelines. However, contem...
research
07/11/2023

GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts

In the context of the rapid development of large language models, we hav...

Please sign up or login with your details

Forgot password? Click here to reset