Dataset Condensation with Distribution Matching

10/08/2021
by   Bo Zhao, et al.
4

Computational cost to train state-of-the-art deep models in many learning problems is rapidly increasing due to more sophisticated models and larger datasets. A recent promising direction to reduce training time is dataset condensation that aims to replace the original large training set with a significantly smaller learned synthetic set while preserving its information. While training deep models on the small set of condensed images can be extremely fast, their synthesis remains computationally expensive due to the complex bi-level optimization and second-order derivative computation. In this work, we propose a simple yet effective dataset condensation technique that requires significantly lower training cost with comparable performance by matching feature distributions of the synthetic and original training images in sampled embedding spaces. Thanks to its efficiency, we apply our method to more realistic and larger datasets with sophisticated neural architectures and achieve a significant performance boost while using larger synthetic training set. We also show various practical benefits of our method in continual learning and neural architecture search.

READ FULL TEXT

page 6

page 14

research
06/10/2020

Dataset Condensation with Gradient Matching

Efficient training of deep neural networks is an increasingly important ...
research
06/17/2020

Fine-Grained Stochastic Architecture Search

State-of-the-art deep networks are often too large to deploy on mobile d...
research
02/16/2021

Dataset Condensation with Differentiable Siamese Augmentation

In many machine learning problems, large-scale datasets have become the ...
research
06/15/2022

Condensing Graphs via One-Step Gradient Matching

As training deep learning models on large dataset takes a lot of time an...
research
05/24/2022

Semi-Parametric Deep Neural Networks in Linear Time and Memory

Recent advances in deep learning have been driven by large-scale paramet...
research
05/29/2023

Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching

The expenses involved in training state-of-the-art deep hashing retrieva...
research
07/19/2023

Improved Distribution Matching for Dataset Condensation

Dataset Condensation aims to condense a large dataset into a smaller one...

Please sign up or login with your details

Forgot password? Click here to reset