Joint Training of Neural Network Ensembles

02/12/2019
by   Andrew M. Webb, et al.
28

We examine the practice of joint training for neural network ensembles, in which a multi-branch architecture is trained via single loss. This approach has recently gained traction, with claims of greater accuracy per parameter along with increased parallelism. We introduce a family of novel loss functions generalizing multiple previously proposed approaches, with which we study theoretical and empirical properties of joint training. These losses interpolate smoothly between independent and joint training of predictors, demonstrating that joint training has several disadvantages not observed in prior work. However, with appropriate regularization via our proposed loss, the method shows new promise in resource limited scenarios and fault-tolerant systems, e.g., IoT and edge devices. Finally, we discuss how these results may have implications for general multi-branch architectures such as ResNeXt and Inception.

READ FULL TEXT
research
09/22/2022

Training neural network ensembles via trajectory sampling

In machine learning, there is renewed interest in neural network ensembl...
research
01/26/2023

Joint Training of Deep Ensembles Fails Due to Learner Collusion

Ensembles of machine learning models have been well established as a pow...
research
03/29/2019

Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform

Recently, we proposed short-time Fourier transform (STFT)-based loss fun...
research
02/27/2018

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

The loss functions of deep neural networks are complex and their geometr...
research
06/23/2023

Minibatch training of neural network ensembles via trajectory sampling

Most iterative neural network training methods use estimates of the loss...
research
06/08/2022

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

In machine learning, an agent needs to estimate uncertainty to efficient...
research
03/01/2023

Neural inverse procedural modeling of knitting yarns from images

We investigate the capabilities of neural inverse procedural modeling to...

Please sign up or login with your details

Forgot password? Click here to reset