Data Valuation using Reinforcement Learning

09/25/2019
by   Jinsung Yoon, et al.
11

Quantifying the value of data is a fundamental problem in machine learning. Data valuation has multiple important use cases: (1) building insights about the learning task, (2) domain adaptation, (3) corrupted sample discovery, and (4) robust learning. To adaptively learn data values jointly with the target task predictor model, we propose a meta learning framework which we name Data Valuation using Reinforcement Learning (DVRL). We employ a data value estimator (modeled by a deep neural network) to learn how likely each datum is used in training of the predictor model. We train the data value estimator using a reinforcement signal of the reward obtained on a small validation set that reflects performance on the target task. We demonstrate that DVRL yields superior data value estimates compared to alternative methods across different types of datasets and in a diverse set of application scenarios. The corrupted sample discovery performance of DVRL is close to optimal in many regimes (i.e. as if the noisy samples were known apriori), and for domain adaptation and robust learning DVRL significantly outperforms state-of-the-art by 14.6 10.8

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2019

Meta Reinforcement Learning for Sim-to-real Domain Adaptation

Modern reinforcement learning methods suffer from low sample efficiency ...
research
12/18/2018

Domain Adaptation for Reinforcement Learning on the Atari

Deep reinforcement learning agents have recently been successful across ...
research
05/31/2023

Building Manufacturing Deep Learning Models with Minimal and Imbalanced Training Data Using Domain Adaptation and Data Augmentation

Deep learning (DL) techniques are highly effective for defect detection ...
research
11/22/2021

Reinforcement Learning for Few-Shot Text Generation Adaptation

Controlling the generative model to adapt a new domain with limited samp...
research
10/01/2020

Value-based Bayesian Meta-reinforcement Learning and Traffic Signal Control

Reinforcement learning methods for traffic signal control has gained inc...
research
03/25/2016

Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning

Most successful information extraction systems operate with access to a ...

Please sign up or login with your details

Forgot password? Click here to reset