RD-DPP: Rate-Distortion Theory Meets Determinantal Point Process to Diversify Learning Data Samples

04/09/2023
by   Xiwen Chen, et al.
0

In some practical learning tasks, such as traffic video analysis, the number of available training samples is restricted by different factors, such as limited communication bandwidth and computation power; therefore, it is imperative to select diverse data samples that contribute the most to the quality of the learning system. One popular approach to selecting diverse samples is Determinantal Point Process (DPP). However, it suffers from a few known drawbacks, such as restriction of the number of samples to the rank of the similarity matrix, and not being customizable for specific learning tasks (e.g., multi-level classification tasks). In this paper, we propose a new way of measuring task-oriented diversity based on the Rate-Distortion (RD) theory, appropriate for multi-level classification. To this end, we establish a fundamental relationship between DPP and RD theory, which led to designing RD-DPP, an RD-based value function to evaluate the diversity gain of data samples. We also observe that the upper bound of the diversity of data selected by DPP has a universal trend of phase transition that quickly approaches its maximum point, then slowly converges to its final limits, meaning that DPP is beneficial only at the beginning of sample accumulation. We use this fact to design a bi-modal approach for sequential data selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2019

Multi-level Similarity Learning for Low-Shot Recognition

Low-shot learning indicates the ability to recognize unseen objects base...
research
06/04/2023

Learning on Bandwidth Constrained Multi-Source Data with MIMO-inspired DPP MAP Inference

This paper proposes a distributed version of Determinant Point Processin...
research
06/15/2021

A Value-Function-based Interior-point Method for Non-convex Bi-level Optimization

Bi-level optimization model is able to capture a wide range of complex l...
research
02/28/2022

Rate-Distortion Problems of the Poisson Process based on a Group-Theoretic Approach

We study rate-distortion problems of a Poisson process using a group the...
research
01/07/2019

A New Perspective on Machine Learning: How to do Perfect Supervised Learning

In this work, we introduce the concept of bandlimiting into the theory o...
research
12/02/2021

Trap of Feature Diversity in the Learning of MLPs

In this paper, we discover a two-phase phenomenon in the learning of mul...
research
02/14/2021

Multi-Level Fine-Tuning: Closing Generalization Gaps in Approximation of Solution Maps under a Limited Budget for Training Data

In scientific machine learning, regression networks have been recently a...

Please sign up or login with your details

Forgot password? Click here to reset