Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models

09/10/2021
by   Tassilo Klein, et al.
0

Can we get existing language models and refine them for zero-shot commonsense reasoning? This paper presents an initial study exploring the feasibility of zero-shot commonsense reasoning for the Winograd Schema Challenge by formulating the task as self-supervised refinement of a pre-trained language model. In contrast to previous studies that rely on fine-tuning annotated datasets, we seek to boost conceptualization via loss landscape refinement. To this end, we propose a novel self-supervised learning approach that refines the language model utilizing a set of linguistic perturbations of similar concept relationships. Empirical analysis of our conceptually simple framework demonstrates the viability of zero-shot commonsense reasoning on multiple benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2023

Prompt Engineering and Calibration for Zero-Shot Commonsense Reasoning

Prompt engineering and calibration make large language models excel at r...
research
12/20/2022

Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models

With increasing scale, large language models demonstrate both quantitati...
research
04/16/2021

Back to Square One: Bias Detection, Training and Commonsense Disentanglement in the Winograd Schema

The Winograd Schema (WS) has been proposed as a test for measuring commo...
research
10/12/2022

Zero-Shot Prompting for Implicit Intent Prediction and Recommendation with Commonsense Reasoning

Intelligent virtual assistants are currently designed to perform tasks o...
research
03/22/2023

Frozen Language Model Helps ECG Zero-Shot Learning

The electrocardiogram (ECG) is one of the most commonly used non-invasiv...
research
05/02/2020

Contrastive Self-Supervised Learning for Commonsense Reasoning

We propose a self-supervised method to solve Pronoun Disambiguation and ...
research
10/02/2020

MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale

We study the zero-shot transfer capabilities of text matching models on ...

Please sign up or login with your details

Forgot password? Click here to reset