Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning

03/20/2023
by   Zixuan Hu, et al.
0

The goal of data-free meta-learning is to learn useful prior knowledge from a collection of pre-trained models without accessing their training data. However, existing works only solve the problem in parameter space, which (i) ignore the fruitful data knowledge contained in the pre-trained models; (ii) can not scale to large-scale pre-trained models; (iii) can only meta-learn pre-trained models with the same network architecture. To address those issues, we propose a unified framework, dubbed PURER, which contains: (1) ePisode cUrriculum inveRsion (ECI) during data-free meta training; and (2) invErsion calibRation following inner loop (ICFIL) during meta testing. During meta training, we propose ECI to perform pseudo episode training for learning to adapt fast to new unseen tasks. Specifically, we progressively synthesize a sequence of pseudo episodes by distilling the training data from each pre-trained model. The ECI adaptively increases the difficulty level of pseudo episodes according to the real-time feedback of the meta model. We formulate the optimization process of meta training with ECI as an adversarial form in an end-to-end manner. During meta testing, we further propose a simple plug-and-play supplement-ICFIL-only used during meta testing to narrow the gap between meta training and meta testing task distribution. Extensive experiments in various real-world scenarios show the superior performance of ours.

READ FULL TEXT

page 4

page 8

research
05/28/2023

Learning to Learn from APIs: Black-Box Data-Free Meta-Learning

Data-free meta-learning (DFML) aims to enable efficient learning of new ...
research
07/06/2020

Meta-Learning Symmetries by Reparameterization

Many successful deep learning architectures are equivariant to certain t...
research
04/13/2023

Generalizable Deep Learning Method for Suppressing Unseen and Multiple MRI Artifacts Using Meta-learning

Magnetic Resonance (MR) images suffer from various types of artifacts du...
research
04/20/2023

Learning Sample Difficulty from Pre-trained Models for Reliable Prediction

Large-scale pre-trained models have achieved remarkable success in a var...
research
12/20/2022

Robust and Resource-efficient Machine Learning Aided Viewport Prediction in Virtual Reality

360-degree panoramic videos have gained considerable attention in recent...
research
05/19/2021

Learning Graph Meta Embeddings for Cold-Start Ads in Click-Through Rate Prediction

Click-through rate (CTR) prediction is one of the most central tasks in ...
research
07/09/2022

Generating Pseudo-labels Adaptively for Few-shot Model-Agnostic Meta-Learning

Model-Agnostic Meta-Learning (MAML) is a famous few-shot learning method...

Please sign up or login with your details

Forgot password? Click here to reset