What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

04/19/2022
by   Matthew Finlayson, et al.
11

The instruction learning paradigm – where a model learns to perform new tasks from task descriptions alone – has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a given string matches a regular expression (viewed as an instruction) to identify properties of tasks, instructions, and instances that make instruction learning challenging. For instance, we find that our model, a fine-tuned T5-based text2text transformer, struggles with large regular languages, suggesting that less precise instructions are challenging for models. Additionally, instruction executions that require tracking longer contexts of prior steps are also more difficult. We use our findings to systematically construct a challenging instruction learning dataset, which we call Hard RegSet. Fine-tuning on Hard RegSet, our large transformer learns to correctly interpret only 65.6 accuracy), and 11 generalization settings. We propose Hard RegSet as a challenging instruction learning task, and a controlled environment for studying instruction learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2023

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models

Instruction fine-tuning has recently emerged as a promising approach for...
research
02/07/2019

Neural Inverse Knitting: From Images to Manufacturing Instructions

Motivated by the recent potential of mass customization brought by whole...
research
03/17/2022

How Many Data Samples is an Additional Instruction Worth?

Recently introduced instruction-paradigm empowers non-expert users to le...
research
06/16/2023

Differentiable Instruction Optimization for Cross-Task Generalization

Instruction tuning has been attracting much attention to achieve general...
research
03/18/2023

Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning

Task semantics can be expressed by a set of input-to-output examples or ...
research
05/21/2018

A new dataset and model for learning to understand navigational instructions

In this paper, we present a state-of-the-art model and introduce a new d...
research
06/21/2023

Improving Long-Horizon Imitation Through Instruction Prediction

Complex, long-horizon planning and its combinatorial nature pose steep c...

Please sign up or login with your details

Forgot password? Click here to reset