Evaluation for Change

12/20/2022
by   Rishi Bommasani, et al.
0

Evaluation is the central means for assessing, understanding, and communicating about NLP models. In this position paper, we argue evaluation should be more than that: it is a force for driving change, carrying a sociological and political character beyond its technical dimensions. As a force, evaluation's power arises from its adoption: under our view, evaluation succeeds when it achieves the desired change in the field. Further, by framing evaluation as a force, we consider how it competes with other forces. Under our analysis, we conjecture that the current trajectory of NLP suggests evaluation's power is waning, in spite of its potential for realizing more pluralistic ambitions in the field. We conclude by discussing the legitimacy of this power, who acquires this power and how it distributes. Ultimately, we hope the research community will more aggressively harness evaluation for change.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

Rethinking Model Evaluation as Narrowing the Socio-Technical Gap

The recent development of generative and large language models (LLMs) po...
research
05/04/2022

Reproducibility Beyond the Research Community: Experience from NLP Beginners

As NLP research attracts public attention and excitement, it becomes inc...
research
05/17/2022

Letters From the Past: Modeling Historical Sound Change Through Diachronic Character Embeddings

While a great deal of work has been done on NLP approaches to lexical se...
research
01/30/2018

A State-of-the-Art of Semantic Change Computation

This paper reviews the state-of-the-art of semantic change computation, ...
research
05/28/2020

What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP

SemEval is the primary venue in the NLP community for the proposal of ne...
research
09/14/2021

Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP

A key part of the NLP ethics movement is responsible use of data, but ex...
research
01/13/2022

NLP in Human Rights Research – Extracting Knowledge Graphs About Police and Army Units and Their Commanders

In this working paper we explore the use of an NLP system to assist the ...

Please sign up or login with your details

Forgot password? Click here to reset