Towards More Fine-grained and Reliable NLP Performance Prediction

02/10/2021
by   Zihuiwen Ye, et al.
0

Performance prediction, the task of estimating a system's performance without performing experiments, allows us to reduce the experimental burden caused by the combinatorial explosion of different datasets, languages, tasks, and models. In this paper, we make two contributions to improving performance prediction for NLP tasks. First, we examine performance predictors not only for holistic measures of accuracy like F1 or BLEU but also fine-grained performance measures such as accuracy over individual classes of examples. Second, we propose methods to understand the reliability of a performance prediction model from two angles: confidence intervals and calibration. We perform an analysis of four types of NLP tasks, and both demonstrate the feasibility of fine-grained performance prediction and the necessity to perform reliability analysis for performance prediction methods in the future. We make our code publicly available: <https://github.com/neulab/Reliable-NLPPP>

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2021

Fine-grained Interpretation and Causation Analysis in Deep NLP Models

This paper is a write-up for the tutorial on "Fine-grained Interpretatio...
research
04/13/2021

EXPLAINABOARD: An Explainable Leaderboard for NLP

With the rapid development of NLP research, leaderboards have emerged as...
research
04/23/2023

Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

The capability of Large Language Models (LLMs) like ChatGPT to comprehen...
research
11/26/2020

Fine-Grained Re-Identification

Research into the task of re-identification (ReID) is picking up momentu...
research
08/20/2020

ImagiFilter: A resource to enable the semi-automatic mining of images at scale

Datasets (semi-)automatically collected from the web can easily scale to...
research
10/11/2022

CHAE: Fine-Grained Controllable Story Generation with Characters, Actions and Emotions

Story generation has emerged as an interesting yet challenging NLP task ...
research
09/20/2022

Data-Centric AI Paradigm Based on Application-Driven Fine-grained Dataset Design

Deep learning has a wide range of applications in industrial scenario, b...

Please sign up or login with your details

Forgot password? Click here to reset