On-the-Job Learning with Bayesian Decision Theory

06/10/2015
by   Keenon Werling, et al.
0

Our goal is to deploy a high-accuracy system starting with zero training examples. We consider an "on-the-job" setting, where as inputs arrive, we use real-time crowdsourcing to resolve uncertainty where needed and output our prediction when confident. As the model improves over time, the reliance on crowdsourcing queries decreases. We cast our setting as a stochastic game based on Bayesian decision theory, which allows us to balance latency, cost, and accuracy objectives in a principled way. Computing the optimal policy is intractable, so we develop an approximation based on Monte Carlo Tree Search. We tested our approach on three datasets---named-entity recognition, sentiment classification, and image classification. On the NER task we obtained more than an order of magnitude reduction in cost compared to full human annotation, while boosting performance relative to the expert provided labels. We also achieve a 8 a 28

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2021

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition

Crowdsourcing is regarded as one prospective solution for effective supe...
research
02/12/2017

Octopus: A Framework for Cost-Quality-Time Optimization in Crowdsourcing

We present Octopus, an AI agent to jointly balance three conflicting tas...
research
06/28/2023

Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition

Information extraction(IE) is a crucial subfield within natural language...
research
07/18/2023

Mutual Reinforcement Effects in Japanese Sentence Classification and Named Entity Recognition Tasks

Information extraction(IE) is a crucial subfield within natural language...
research
02/14/2021

Costly Features Classification using Monte Carlo Tree Search

We consider the problem of costly feature classification, where we seque...
research
02/27/2019

F10-SGD: Fast Training of Elastic-net Linear Models for Text Classification and Named-entity Recognition

Voice-assistants text classification and named-entity recognition (NER) ...
research
01/02/2019

A Full Probabilistic Model for Yes/No Type Crowdsourcing in Multi-Class Classification

Crowdsourcing has become widely used in supervised scenarios where train...

Please sign up or login with your details

Forgot password? Click here to reset