Towards Zero-Shot Functional Compositionality of Language Models

03/06/2023
by   Hangyeol Yu, et al.
0

Large Pre-trained Language Models (PLM) have become the most desirable starting point in the field of NLP, as they have become remarkably good at solving many individual tasks. Despite such success, in this paper, we argue that current paradigms of working with PLMs are neglecting a critical aspect of modeling human intelligence: functional compositionality. Functional compositionality - the ability to compose learned tasks - has been a long-standing challenge in the field of AI (and many other fields) as it is considered one of the hallmarks of human intelligence. An illustrative example of such is cross-lingual summarization, where a bilingual person (English-French) could directly summarize an English document into French sentences without having to translate the English document or summary into French explicitly. We discuss why this matter is an important open problem that requires further attention from the field. Then, we show that current PLMs (e.g., GPT-2 and T5) don't have functional compositionality yet and it is far from human-level generalizability. Finally, we suggest several research directions that could push the field towards zero-shot functional compositionality of language models.

READ FULL TEXT
research
03/15/2022

Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction

We present a study on leveraging multilingual pre-trained generative lan...
research
08/30/2023

Response: Emergent analogical reasoning in large language models

In their recent Nature Human Behaviour paper, "Emergent analogical reaso...
research
01/26/2021

El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing

Being able to parse code-switched (CS) utterances, such as Spanish+Engli...
research
07/16/2023

A Neural-Symbolic Approach Towards Identifying Grammatically Correct Sentences

Textual content around us is growing on a daily basis. Numerous articles...
research
03/23/2022

A Survey on Cross-Lingual Summarization

Cross-lingual summarization is the task of generating a summary in one l...
research
07/14/2023

EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Large language models (LLMs) have achieved significant performance in ma...
research
10/26/2022

Large language models are not zero-shot communicators

Despite widespread use of LLMs as conversational agents, evaluations of ...

Please sign up or login with your details

Forgot password? Click here to reset