Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs

05/19/2023
by   Pranjal Aggarwal, et al.
0

A popular approach for improving the correctness of output from large language models (LLMs) is Self-Consistency - poll the LLM multiple times and output the most frequent solution. Existing Self-Consistency techniques always draw a constant number of samples per question, where a better approach will be to non-uniformly distribute the available budget based on the amount of agreement in the samples drawn so far. In response, we introduce Adaptive-Consistency, a cost-efficient, model-agnostic technique that dynamically adjusts the number of samples per question using a lightweight stopping criterion. Our experiments over 13 datasets and two LLMs demonstrate that Adaptive-Consistency reduces sample budget by up to 6.0 times with an average accuracy drop of less than 0.1

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

Large language models (LLMs) have achieved widespread success on a varie...
research
01/19/2023

Batch Prompting: Efficient Inference with Large Language Model APIs

Performing inference on hundreds of thousands of samples with large lang...
research
05/24/2023

Discriminator-Guided Multi-step Reasoning with Language Models

In the context of multi-step reasoning, language models (LMs) probabilit...
research
09/06/2012

The Sample Complexity of Search over Multiple Populations

This paper studies the sample complexity of searching over multiple popu...
research
07/11/2023

Self-consistency for open-ended generations

In this paper, we present a novel approach for improving the quality and...
research
02/22/2018

Are Two (Samples) Really Better Than One? On the Non-Asymptotic Performance of Empirical Revenue Maximization

The literature on "mechanism design from samples," which has flourished ...
research
01/15/2022

Sample Summary with Generative Encoding

With increasing sample sizes, all algorithms require longer run times th...

Please sign up or login with your details

Forgot password? Click here to reset