Visualizing NLP annotations for Crowdsourcing

08/25/2015
by   Hanchuan Li, et al.
0

Visualizing NLP annotation is useful for the collection of training data for the statistical NLP approaches. Existing toolkits either provide limited visual aid, or introduce comprehensive operators to realize sophisticated linguistic rules. Workers must be well trained to use them. Their audience thus can hardly be scaled to large amounts of non-expert crowdsourced workers. In this paper, we present CROWDANNO, a visualization toolkit to allow crowd-sourced workers to annotate two general categories of NLP problems: clustering and parsing. Workers can finish the tasks with simplified operators in an interactive interface, and fix errors conveniently. User studies show our toolkit is very friendly to NLP non-experts, and allow them to produce high quality labels for several sophisticated problems. We release our source code and toolkit to spur future research.

READ FULL TEXT

page 4

page 5

research
09/17/2021

A General-Purpose Crowdsourcing Computational Quality Control Toolkit for Python

Quality control is a crux of crowdsourcing. While most means for quality...
research
02/19/2023

SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes

We present a neural Sanskrit Natural Language Processing (NLP) toolkit n...
research
05/26/2021

Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing

Extensive work has argued in favour of paying crowd workers a wage that ...
research
07/25/2022

DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

Dialog system developers need high-quality data to train, fine-tune and ...
research
05/05/2020

CODA-19: Reliably Annotating Research Aspects on 10,000+ CORD-19 Abstracts Using a Non-Expert Crowd

This paper introduces CODA-19, a human-annotated dataset that codes the ...
research
11/26/2020

NLPStatTest: A Toolkit for Comparing NLP System Performance

Statistical significance testing centered on p-values is commonly used t...
research
05/28/2021

Confident in the Crowd: Bayesian Inference to Improve Data Labelling in Crowdsourcing

With the increased interest in machine learning and big data problems, t...

Please sign up or login with your details

Forgot password? Click here to reset