The risk of sub-optimal use of Open Source NLP Software: UKB is inadvertently state-of-the-art in knowledge-based WSD

05/11/2018
by   Eneko Agirre, et al.
0

UKB is an open source collection of programs for performing, among other tasks, knowledge-based Word Sense Disambiguation (WSD). Since it was released in 2009 it has been often used out-of-the-box in sub-optimal settings. We show that nine years later it is the state-of-the-art on knowledge-based WSD. This case shows the pitfalls of releasing open source NLP software without optimal default settings and precise instructions for reproducibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2021

2020 State of the Octoverse: Securing the World's Software

Open source is the connective tissue for much of the information economy...
research
04/26/2022

Defining the role of open source software in research reproducibility

Reproducibility is inseparable from transparency, as sharing data, code ...
research
10/06/2020

Validating UTF-8 In Less Than One Instruction Per Byte

The majority of text is stored in UTF-8, which must be validated on inge...
research
08/12/2016

Extracting Biological Pathway Models From NLP Event Representations

This paper describes an an open-source software system for the automatic...
research
09/04/2017

A Reproducible Study on Remote Heart Rate Measurement

This paper studies the problem of reproducible research in remote photop...
research
03/28/2023

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP

Despite its pivotal role in research experiments, code correctness is of...
research
12/16/2017

Overview of the Wikidata Vandalism Detection Task at WSDM Cup 2017

We report on the Wikidata vandalism detection task at the WSDM Cup 2017....

Please sign up or login with your details

Forgot password? Click here to reset