Feature Selection with Annealing for Computer Vision and Big Data Learning

10/10/2013
by   Adrian Barbu, et al.
0

Many computer vision and medical imaging problems are faced with learning from large-scale datasets, with millions of observations and features. In this paper we propose a novel efficient learning scheme that tightens a sparsity constraint by gradually removing variables based on a criterion and a schedule. The attractive fact that the problem size keeps dropping throughout the iterations makes it particularly suitable for big data learning. Our approach applies generically to the optimization of any differentiable loss function, and finds applications in regression, classification and ranking. The resultant algorithms build variable screening into estimation and are extremely simple to implement. We provide theoretical guarantees of convergence and selection consistency. In addition, one dimensional piecewise linear response functions are used to account for nonlinearity and a second order prior is imposed on these functions to avoid overfitting. Experiments on real and synthetic data show that the proposed method compares very well with other state of the art methods in regression, classification and ranking while being computationally very efficient and scalable.

READ FULL TEXT

page 12

page 14

research
05/02/2023

Slow Kill for Big Data Learning

Big-data applications often involve a vast number of observations and fe...
research
03/12/2018

Scalable Algorithms for Learning High-Dimensional Linear Mixed Models

Linear mixed models (LMMs) are used extensively to model dependecies of ...
research
05/16/2016

Classification of Big Data with Application to Imaging Genetics

Big data applications, such as medical imaging and genetics, typically g...
research
09/23/2016

Efficient Feature Selection With Large and High-dimensional Data

Driven by the advances in technology, large and high-dimensional data ha...
research
02/11/2020

Training Efficient Network Architecture and Weights via Direct Sparsity Control

Artificial neural networks (ANNs) especially deep convolutional networks...
research
09/02/2019

Consistency of Ranking Estimators

The ranking problem is to order a collection of units by some unobserved...
research
05/12/2023

Extended ADMM for general penalized quantile regression with linear constraints in big data

Quantile regression (QR) can be used to describe the comprehensive relat...

Please sign up or login with your details

Forgot password? Click here to reset