Coresets For Monotonic Functions with Applications to Deep Learning

02/21/2018
by   Elad Tolochinsky, et al.
0

Coreset (or core-set) in this paper is a small weighted subset Q of the input set P with respect to a given monotonic function f:R→R that provably approximates its fitting loss ∑_p∈ Pf(p· x) to any given x∈R^d. Using Q we can obtain approximation to x^* that minimizes this loss, by running existing optimization algorithms on Q. We provide: (i) a lower bound that proves that there are sets with no coresets smaller than n=|P| , (ii) a proof that a small coreset of size near-logarithmic in n exists for any input P, under natural assumption that holds e.g. for logistic regression and the sigmoid activation function. (iii) a generic algorithm that computes Q in O(nd+n n) expected time, (iv) novel technique for improving existing deep networks using such coresets, (v) extensive experimental results with open code.oving existing deep networks using such coresets, (v) extensive experimental results with open code.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2020

Coresets for Near-Convex Functions

Coreset is usually a small weighted subset of n input points in R^d, tha...
research
03/09/2020

Sets Clustering

The input to the sets-k-means problem is an integer k≥ 1 and a set P={P_...
research
04/06/2018

A comparison of deep networks with ReLU activation function and linear spline-type methods

Deep neural networks (DNNs) generate much richer function spaces than sh...
research
11/30/2015

Coresets for Kinematic Data: From Theorems to Real-Time Systems

A coreset (or core-set) of a dataset is its semantic compression with re...
research
11/04/2021

A Unified Approach to Coreset Learning

Coreset of a given dataset and loss function is usually a small weighed ...
research
01/28/2023

On the Lipschitz Constant of Deep Networks and Double Descent

Existing bounds on the generalization error of deep networks assume some...
research
03/06/2022

Coresets for Data Discretization and Sine Wave Fitting

In the monitoring problem, the input is an unbounded stream P=p_1,p_2⋯ o...

Please sign up or login with your details

Forgot password? Click here to reset