Towards Mitigating Architecture Overfitting in Dataset Distillation

09/08/2023
by   Xuyang Zhong, et al.
0

Dataset distillation methods have demonstrated remarkable performance for neural networks trained with very limited training data. However, a significant challenge arises in the form of architecture overfitting: the distilled training data synthesized by a specific network architecture (i.e., training network) generates poor performance when trained by other network architectures (i.e., test networks). This paper addresses this issue and proposes a series of approaches in both architecture designs and training schemes which can be adopted together to boost the generalization performance across different network architectures on the distilled training data. We conduct extensive experiments to demonstrate the effectiveness and generality of our methods. Particularly, across various scenarios involving different sizes of distilled data, our approaches achieve comparable or superior performance to existing methods when training on the distilled data using networks with larger capacities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2018

Towards Robust Neural Networks with Lipschitz Continuity

Deep neural networks have shown remarkable performance across a wide ran...
research
03/21/2017

Knowledge distillation using unlabeled mismatched images

Current approaches for Knowledge Distillation (KD) either directly use t...
research
11/29/2022

Building Resilience to Out-of-Distribution Visual Data via Input Optimization and Model Finetuning

A major challenge in machine learning is resilience to out-of-distributi...
research
08/21/2023

Dataset Quantization

State-of-the-art deep neural networks are trained with large amounts (mi...
research
11/04/2020

Channel Planting for Deep Neural Networks using Knowledge Distillation

In recent years, deeper and wider neural networks have shown excellent p...
research
05/30/2023

Quantifying Overfitting: Evaluating Neural Network Performance through Analysis of Null Space

Machine learning models that are overfitted/overtrained are more vulnera...
research
08/16/2023

Quantifying Overfitting: Introducing the Overfitting Index

In the rapidly evolving domain of machine learning, ensuring model gener...

Please sign up or login with your details

Forgot password? Click here to reset