Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

05/30/2022
by   Guilherme Moraes Rosa, et al.
0

Recent work has shown that language models scaled to billions of parameters, such as GPT-3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment with zero-shot models in the legal case entailment task of the COLIEE 2022 competition. Our experiments show that scaling the number of parameters in a language model improves the F1 score of our previous zero-shot result by more than 6 points, suggesting that stronger zero-shot capability may be a characteristic of larger models, at least for this task. Our 3B-parameter zero-shot model outperforms all models, including ensembles, in the COLIEE 2021 test set and also achieves the best performance of a single model in the COLIEE 2022 competition, second only to the ensemble composed of the 3B model itself and a smaller version of the same model. Despite the challenges posed by large language models, mainly due to latency constraints in real-time applications, we provide a demonstration of our zero-shot monoT5-3b model being used in production as a search engine, including for legal documents. The code for our submission and the demo of our system are available at https://github.com/neuralmind-ai/coliee and https://neuralsearchx.neuralmind.ai, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2022

To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment

There has been mounting evidence that pretrained language models fine-tu...
research
08/08/2023

Large Language Model Prompt Chaining for Long Legal Document Classification

Prompting is used to guide or steer a language model in generating an ap...
research
07/28/2022

LAD: Language Models as Data for Zero-Shot Dialog

To facilitate zero-shot generalization in taskoriented dialog, this pape...
research
03/25/2022

ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

The current workflow for Information Extraction (IE) analysts involves t...
research
01/11/2023

GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

The global economy is increasingly dependent on knowledge workers to mee...
research
10/17/2022

Zero-Shot Ranking Socio-Political Texts with Transformer Language Models to Reduce Close Reading Time

We approach the classification problem as an entailment problem and appl...
research
06/27/2022

A Zero-Shot Classification Approach for a Word-Guessing Challenge

The Taboo Challenge competition, a task based on the well-known Taboo ga...

Please sign up or login with your details

Forgot password? Click here to reset