The Re-Label Method For Data-Centric Machine Learning

02/09/2023
by   Tong Guo, et al.
0

In industry deep learning application, our manually labeled data has a certain number of noisy data. To solve this problem and achieve more than 90 score in dev dataset, we present a simple method to find the noisy data and re-label the noisy data by human, given the model predictions as references in human labeling. In this paper, we illustrate our idea for a broad set of deep learning tasks, includes classification, sequence tagging, object detection, sequence generation, click-through rate prediction. The experimental results and human evaluation results verify our idea.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2021

Learning From How Human Correct

In industry NLP application, our manually labeled data has a certain num...
research
10/09/2021

X-model: Improving Data Efficiency in Deep Learning with A Minimax Model

To mitigate the burden of data labeling, we aim at improving data effici...
research
06/07/2019

Audio tagging with noisy labels and minimal supervision

This paper introduces Task 2 of the DCASE2019 Challenge, titled "Audio t...
research
10/20/2022

Semi-supervised object detection based on single-stage detector for thighbone fracture localization

The thighbone is the largest bone supporting the lower body. If the thig...
research
10/13/2021

Simple Attention Module based Speaker Verification with Iterative noisy label detection

Recently, the attention mechanism such as squeeze-and-excitation module ...
research
11/30/2022

Rethinking Out-of-Distribution Detection From a Human-Centric Perspective

Out-Of-Distribution (OOD) detection has received broad attention over th...
research
04/29/2019

Mixture of Pre-processing Experts Model for Noise Robust Deep Learning on Resource Constrained Platforms

Deep learning on an edge device requires energy efficient operation due ...

Please sign up or login with your details

Forgot password? Click here to reset