Tricks and Plugins to GBM on Images and Sequences

03/01/2022
by   Biyi Fang, et al.
0

Convolutional neural networks (CNNs) and transformers, which are composed of multiple processing layers and blocks to learn the representations of data with multiple abstract levels, are the most successful machine learning models in recent years. However, millions of parameters and many blocks make them difficult to be trained, and sometimes several days or weeks are required to find an ideal architecture or tune the parameters. Within this paper, we propose a new algorithm for boosting Deep Convolutional Neural Networks (BoostCNN) to combine the merits of dynamic feature selection and BoostCNN, and another new family of algorithms combining boosting and transformers. To learn these new models, we introduce subgrid selection and importance sampling strategies and propose a set of algorithms to incorporate boosting weights into a deep learning architecture based on a least squares objective function. These algorithms not only reduce the required manual effort for finding an appropriate network architecture but also result in superior performance and lower running time. Experiments show that the proposed methods outperform benchmarks on several fine-grained classification tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2023

A Gradient Boosting Approach for Training Convolutional and Deep Neural Networks

Deep learning has revolutionized the computer vision and image classific...
research
08/19/2021

Do Vision Transformers See Like Convolutional Neural Networks?

Convolutional neural networks (CNNs) have so far been the de-facto model...
research
05/21/2020

SpotFast Networks with Memory Augmented Lateral Transformers for Lipreading

This paper presents a novel deep learning architecture for word-level li...
research
11/25/2022

Learning General Audio Representations with Large-Scale Training of Patchout Audio Transformers

The success of supervised deep learning methods is largely due to their ...
research
11/12/2021

Convolutional Nets Versus Vision Transformers for Diabetic Foot Ulcer Classification

This paper compares well-established Convolutional Neural Networks (CNNs...
research
04/30/2022

Combined Learning of Neural Network Weights for Privacy in Collaborative Tasks

We introduce CoLN, Combined Learning of Neural network weights, a novel ...
research
01/07/2020

Inferring Convolutional Neural Networks' accuracies from their architectural characterizations

Convolutional Neural Networks (CNNs) have shown strong promise for analy...

Please sign up or login with your details

Forgot password? Click here to reset