Appendix - Recommended Statistical Significance Tests for NLP Tasks

09/05/2018
by   Rotem Dror, et al.
0

Statistical significance testing plays an important role when drawing conclusions from experimental results in NLP papers. Particularly, it is a valuable tool when one would like to establish the superiority of one algorithm over another. This appendix complements the guide for testing statistical significance in NLP presented in dror2018hitchhiker by proposing valid statistical tests for the common tasks and evaluation measures in the field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2020

NLPStatTest: A Toolkit for Comparing NLP System Performance

Statistical significance testing centered on p-values is commonly used t...
research
06/14/2022

An Experimental Investigation of Part-Of-Speech Taggers for Vietnamese

Part-of-speech (POS) tagging plays an important role in Natural Language...
research
08/13/2022

Size Matters: The Use and Misuse of Statistical Significance in Discrete Choice Models in the Transportation Academic Literature

In this paper we review the academic transportation literature published...
research
04/01/2021

Statistical significance revisited

Statistical significance measures the reliability of a result obtained f...
research
04/14/2022

deep-significance - Easy and Meaningful Statistical Significance Testing in the Age of Neural Networks

A lot of Machine Learning (ML) and Deep Learning (DL) research is of an ...
research
02/05/2023

Efficient Adaptive Sobel and Joint Significance Tests for Mediation Effects

Mediation analysis is an important statistical tool in many research fie...
research
03/16/2019

The Language of Betting as a Strategy for Statistical and Scientific Communication

The established language for statistical testing --- significance levels...

Please sign up or login with your details

Forgot password? Click here to reset