Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks

01/17/2023
by   Shi Zong, et al.
0

Industry practitioners always face the problem of choosing the appropriate model for deployment under different considerations, such as to maximize a metric that is crucial for production, or to reduce the total cost given financial concerns. In this work, we focus on the text classification task and present a quantitative analysis for this challenge. Using classification accuracy as the main metric, we evaluate the classifiers' performances for a variety of models, including large language models, along with their associated costs, including the annotation cost, training (fine-tuning) cost, and inference cost. We then discuss the model choices for situations like having a large number of samples needed for inference. We hope our work will help people better understand the cost/quality trade-offs for the text classification task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

Performance-Efficiency Trade-Offs in Adapting Language Models to Text Classification Tasks

Pre-trained language models (LMs) obtain state-of-the-art performance wh...
research
05/03/2023

Using Language Models on Low-end Hardware

This paper evaluates the viability of using fixed language models for tr...
research
05/12/2022

On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

Borrowing ideas from Production functions in micro-economics, in this pa...
research
10/11/2022

Analyzing Text Representations under Tight Annotation Budgets: Measuring Structural Alignment

Annotating large collections of textual data can be time consuming and e...
research
03/29/2023

Did You Mean...? Confidence-based Trade-offs in Semantic Parsing

We illustrate how a calibrated model can help balance common trade-offs ...
research
06/08/2023

Privacy- and Utility-Preserving NLP with Anonymized Data: A case study of Pseudonymization

This work investigates the effectiveness of different pseudonymization t...
research
01/14/2022

Model Stability with Continuous Data Updates

In this paper, we study the "stability" of machine learning (ML) models ...

Please sign up or login with your details

Forgot password? Click here to reset