Language Models are not Models of Language

12/13/2021
by   Csaba Veres, et al.
0

Natural Language Processing (NLP) has become one of the leading application areas in the current Artificial Intelligence boom. Transfer learning has enabled large deep learning neural networks trained on the language modeling task to vastly improve performance in almost all language tasks. Interestingly, when the models are trained with data that includes software code, they demonstrate remarkable abilities in generating functioning computer code from natural language specifications. We argue that this creates a conundrum for claims that neural models provide an alternative theory to generative phrase structure grammars in explaining how language works. Since the syntax of programming languages is determined by phrase structure grammars, successful neural models are apparently uninformative about the theoretical foundations of programming languages, and by extension, natural languages. We argue that the term language model is misleading because deep learning models are not theoretical models of language and propose the adoption of corpus model instead, which better reflects the genesis and contents of the model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

A Precis of Language Models are not Models of Language

Natural Language Processing is one of the leading application areas in t...
research
06/22/2020

Exploring Software Naturalness throughNeural Language Models

The Software Naturalness hypothesis argues that programming languages ca...
research
05/03/2022

Neural language models for network configuration: Opportunities and reality check

Boosted by deep learning, natural language processing (NLP) techniques h...
research
10/08/2019

Do People Prefer "Natural" code?

Natural code is known to be very repetitive (much more so than natural l...
research
08/28/2018

Rational Recurrences

Despite the tremendous empirical success of neural models in natural lan...
research
08/15/2023

Anaphoric Structure Emerges Between Neural Networks

Pragmatics is core to natural language, enabling speakers to communicate...

Please sign up or login with your details

Forgot password? Click here to reset