Problems and Countermeasures in Natural Language Processing Evaluation

04/20/2021
by   Qingxiu Dong, et al.
0

Evaluation in natural language processing guides and promotes research on models and methods. In recent years, new evalua-tion data sets and evaluation tasks have been continuously proposed. At the same time, a series of problems exposed by ex-isting evaluation have also restricted the progress of natural language processing technology. Starting from the concept, com-position, development and meaning of natural language evaluation, this article classifies and summarizes the tasks and char-acteristics of mainstream natural language evaluation, and then summarizes the problems and causes of natural language pro-cessing evaluation. Finally, this article refers to the human language ability evaluation standard, puts forward the concept of human-like machine language ability evaluation, and proposes a series of basic principles and implementation ideas for hu-man-like machine language ability evaluation from the three aspects of reliability, difficulty and validity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2016

Natural Language Processing using Hadoop and KOSHIK

Natural language processing, as a data analytics related technology, is ...
research
10/06/2020

Is the Best Better? Bayesian Statistical Model Comparison for Natural Language Processing

Recent work raises concerns about the use of standard splits to compare ...
research
10/23/2018

What can AI do for me: Evaluating Machine Learning Interpretations in Cooperative Play

Machine learning is an important tool for decision making, but its ethic...
research
08/28/2019

Language Tasks and Language Games: On Methodology in Current Natural Language Processing Research

"This paper introduces a new task and a new dataset", "we improve the st...
research
05/16/2022

Reasoning about Procedures with Natural Language Processing: A Tutorial

This tutorial provides a comprehensive and in-depth view of the research...
research
03/19/2018

Dynamic Natural Language Processing with Recurrence Quantification Analysis

Writing and reading are dynamic processes. As an author composes a text,...
research
07/08/2022

No Time Like the Present: Effects of Language Change on Automated Comment Moderation

The spread of online hate has become a significant problem for newspaper...

Please sign up or login with your details

Forgot password? Click here to reset