A Survey on Machine Learning Techniques for Auto Labeling of Video, Audio, and Text Data

09/08/2021
by   Shikun Zhang, et al.
0

Machine learning has been utilized to perform tasks in many different domains such as classification, object detection, image segmentation and natural language analysis. Data labeling has always been one of the most important tasks in machine learning. However, labeling large amounts of data increases the monetary cost in machine learning. As a result, researchers started to focus on reducing data annotation and labeling costs. Transfer learning was designed and widely used as an efficient approach that can reasonably reduce the negative impact of limited data, which in turn, reduces the data preparation cost. Even transferring previous knowledge from a source domain reduces the amount of data needed in a target domain. However, large amounts of annotated data are still demanded to build robust models and improve the prediction accuracy of the model. Therefore, researchers started to pay more attention on auto annotation and labeling. In this survey paper, we provide a review of previous techniques that focuses on optimized data annotation and labeling for video, audio, and text data.

READ FULL TEXT
research
11/05/2020

Language Model is All You Need: Natural Language Understanding as Question Answering

Different flavors of transfer learning have shown tremendous impact in a...
research
08/23/2021

Analyzing the Granularity and Cost of Annotation in Clinical Sequence Labeling

Well-annotated datasets, as shown in recent top studies, are becoming mo...
research
09/10/2019

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation

In sequence labeling, previous domain adaptation methods focus on the ad...
research
11/08/2018

A Survey on Data Collection for Machine Learning: a Big Data - AI Integration Perspective

Data collection is a major bottleneck in machine learning and an active ...
research
03/03/2020

Trained Model Fusion for Object Detection using Gating Network

The major approaches of transfer learning in computer vision have tried ...
research
08/20/2022

General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation

Training keyphrase generation (KPG) models requires a large amount of an...
research
08/26/2020

An End-to-End Attack on Text-based CAPTCHAs Based on Cycle-Consistent Generative Adversarial Network

As a widely deployed security scheme, text-based CAPTCHAs have become mo...

Please sign up or login with your details

Forgot password? Click here to reset