Learning to Learn from APIs: Black-Box Data-Free Meta-Learning

05/28/2023
by   Zixuan Hu, et al.
0

Data-free meta-learning (DFML) aims to enable efficient learning of new tasks by meta-learning from a collection of pre-trained models without access to the training data. Existing DFML work can only meta-learn from (i) white-box and (ii) small-scale pre-trained models (iii) with the same architecture, neglecting the more practical setting where the users only have inference access to the APIs with arbitrary model architectures and model scale inside. To solve this issue, we propose a Bi-level Data-free Meta Knowledge Distillation (BiDf-MKD) framework to transfer more general meta knowledge from a collection of black-box APIs to one single meta model. Specifically, by just querying APIs, we inverse each API to recover its training data via a zero-order gradient estimator and then perform meta-learning via a novel bi-level meta knowledge distillation structure, in which we design a boundary query set recovery technique to recover a more informative query set near the decision boundary. In addition, to encourage better generalization within the setting of limited API budgets, we propose task memory replay to diversify the underlying task distribution by covering more interpolated tasks. Extensive experiments in various real-world scenarios show the superior performance of our BiDf-MKD framework.

READ FULL TEXT

page 3

page 15

research
03/20/2023

Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning

The goal of data-free meta-learning is to learn useful prior knowledge f...
research
12/08/2022

General-Purpose In-Context Learning by Meta-Learning Transformers

Modern machine learning requires system designers to specify aspects of ...
research
07/16/2019

Meta-Learning for Black-box Optimization

Recently, neural networks trained as optimizers under the "learning to l...
research
07/07/2023

MALIBO: Meta-learning for Likelihood-free Bayesian Optimization

Bayesian optimization (BO) is a popular method to optimize costly black-...
research
08/03/2021

DeepFreeze: Cold Boot Attacks and High Fidelity Model Recovery on Commercial EdgeML Device

EdgeML accelerators like Intel Neural Compute Stick 2 (NCS) can enable e...
research
06/03/2023

Deep Classifier Mimicry without Data Access

Access to pre-trained models has recently emerged as a standard across n...
research
03/06/2023

Knowledge-embedded meta-learning model for lift coefficient prediction of airfoils

Aerodynamic performance evaluation is an important part of the aircraft ...

Please sign up or login with your details

Forgot password? Click here to reset