Data-Efficient Learning via Minimizing Hyperspherical Energy

06/30/2022
by   Xiaofeng Cao, et al.
0

Deep learning on large-scale data is dominant nowadays. The unprecedented scale of data has been arguably one of the most important driving forces for the success of deep learning. However, there still exist scenarios where collecting data or labels could be extremely expensive, e.g., medical imaging and robotics. To fill up this gap, this paper considers the problem of data-efficient learning from scratch using a small amount of representative data. First, we characterize this problem by active learning on homeomorphic tubes of spherical manifolds. This naturally generates feasible hypothesis class. With homologous topological properties, we identify an important connection – finding tube manifolds is equivalent to minimizing hyperspherical energy (MHE) in physical geometry. Inspired by this connection, we propose a MHE-based active learning (MHEAL) algorithm, and provide comprehensive theoretical guarantees for MHEAL, covering convergence and generalization analysis. Finally, we demonstrate the empirical performance of MHEAL in a wide range of applications on data-efficient learning, including deep clustering, distribution matching, version space sampling and deep active learning.

READ FULL TEXT

page 3

page 15

research
10/16/2021

Deep Active Learning by Leveraging Training Dynamics

Active learning theories and methods have been extensively studied in cl...
research
10/29/2021

Convergence of Uncertainty Sampling for Active Learning

Uncertainty sampling in active learning is heavily used in practice to r...
research
02/03/2022

GALAXY: Graph-based Active Learning at the Extreme

Active learning is a label-efficient approach to train highly effective ...
research
01/12/2023

Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data

This paper considers deep out-of-distribution active learning. In practi...
research
03/25/2023

Deep Kernel Methods Learn Better: From Cards to Process Optimization

The ability of deep learning methods to perform classification and regre...
research
07/29/2022

A Survey of Learning on Small Data

Learning on big data brings success for artificial intelligence (AI), bu...
research
05/13/2023

An Active Learning-based Approach for Hosting Capacity Analysis in Distribution Systems

With the increasing amount of distributed energy resources (DERs) integr...

Please sign up or login with your details

Forgot password? Click here to reset