A Survey of Data Optimization for Problems in Computer Vision Datasets

10/21/2022
by   Zhijing Wan, et al.
0

Recent years have witnessed remarkable progress in artificial intelligence (AI) thanks to refined deep network structures, powerful computing devices, and large-scale labeled datasets. However, researchers have mainly invested in the optimization of models and computational devices, leading to the fact that good models and powerful computing devices are currently readily available, while datasets are still stuck at the initial stage of large-scale but low quality. Data becomes a major obstacle to AI development. Taking note of this, we dig deeper and find that there has been some but unstructured work on data optimization. They focus on various problems in datasets and attempt to improve dataset quality by optimizing its structure to facilitate AI development. In this paper, we present the first review of recent advances in this area. First, we summarize and analyze various problems that exist in large-scale computer vision datasets. We then define data optimization and classify data optimization algorithms into three directions according to the optimization form: data sampling, data subset selection, and active learning. Next, we organize these data optimization works according to data problems addressed, and provide a systematic and comparative description. Finally, we summarize the existing literature and propose some potential future research topics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2022

Deep Active Learning for Computer Vision: Past and Future

As an important data selection schema, active learning emerges as the es...
research
12/20/2022

Data Augmentation on Graphs: A Survey

In recent years, graph representation learning has achieved remarkable s...
research
05/14/2023

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

Artificial intelligence (AI) is evolving towards artificial general inte...
research
09/18/2021

Computational Imaging and Artificial Intelligence: The Next Revolution of Mobile Vision

Signal capture stands in the forefront to perceive and understand the en...
research
01/13/2022

Fantastic Data and How to Query Them

It is commonly acknowledged that the availability of the huge amount of ...
research
05/05/2023

A Survey on Out-of-Distribution Detection in NLP

Out-of-distribution (OOD) detection is essential for the reliable and sa...
research
11/01/2020

AI Marker-based Large-scale AI Literature Mining

The knowledge contained in academic literature is interesting to mine. I...

Please sign up or login with your details

Forgot password? Click here to reset