Explanations from Large Language Models Make Small Reasoners Better

10/13/2022
by   Shiyang Li, et al.
12

Integrating free-text explanations to in-context learning of large language models (LLM) is shown to elicit strong reasoning capabilities along with reasonable explanations. In this paper, we consider the problem of leveraging the explanations generated by LLM to improve the training of small reasoners, which are more favorable in real-production deployment due to their low cost. We systematically explore three explanation generation approaches from LLM and utilize a multi-task learning framework to facilitate small models to acquire strong reasoning power together with explanation generation capabilities. Experiments on multiple reasoning tasks show that our method can consistently and significantly outperform finetuning baselines across different settings, and even perform better than finetuning/prompting a 60x larger GPT-3 (175B) model by up to 9.5 shows that our method can generate high-quality explanations to justify its predictions, moving towards the goal of explainable AI.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2023

LMExplainer: a Knowledge-Enhanced Explainer for Language Models

Large language models (LMs) such as GPT-4 are very powerful and can proc...
research
11/25/2022

Complementary Explanations for Effective In-Context Learning

Large language models (LLMs) have exhibited remarkable capabilities in l...
research
05/19/2023

CCGen: Explainable Complementary Concept Generation in E-Commerce

We propose and study Complementary Concept Generation (CCGen): given a c...
research
12/08/2022

Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language Explanations

Natural language explanations promise to offer intuitively understandabl...
research
05/07/2023

Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting

Large Language Models (LLMs) can achieve strong performance on many task...
research
01/11/2021

Explain and Predict, and then Predict again

A desirable property of learning systems is to be both effective and int...
research
06/27/2023

REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction

The ability to detect and analyze failed executions automatically is cru...

Please sign up or login with your details

Forgot password? Click here to reset