Optimisation of Overparametrized Sum-Product Networks

05/20/2019
by   Martin Trapp, et al.
0

It seems to be a pearl of conventional wisdom that parameter learning in deep sum-product networks is surprisingly fast compared to shallow mixture models. This paper examines the effects of overparameterization in sum-product networks on the speed of parameter optimisation. Using theoretical analysis and empirical experiments, we show that deep sum-product networks exhibit an implicit acceleration compared to their shallow counterpart. In fact, gradient-based optimisation in deep sum-product networks is equal to gradient ascend with adaptive and time-varying learning rates and additional momentum terms.

READ FULL TEXT
research
10/10/2021

Fitting large mixture models using stochastic component selection

Traditional methods for unsupervised learning of finite mixture models r...
research
10/19/2018

Leveraging Product as an Activation Function in Deep Networks

Product unit neural networks (PUNNs) are powerful representational model...
research
11/09/2018

Deep Compression of Sum-Product Networks on Tensor Networks

Sum-product networks (SPNs) represent an emerging class of neural networ...
research
01/19/2017

Online Structure Learning for Sum-Product Networks with Gaussian Leaves

Sum-product networks have recently emerged as an attractive representati...
research
01/03/2016

A Unified Approach for Learning the Parameters of Sum-Product Networks

We present a unified approach for learning the parameters of Sum-Product...
research
02/02/2023

Over-parameterised Shallow Neural Networks with Asymmetrical Node Scaling: Global Convergence Guarantees and Feature Learning

We consider the optimisation of large and shallow neural networks via gr...
research
04/15/2021

Fast Private Parameter Learning and Evaluation for Sum-Product Networks

A sum-product network (SPN) is a graphical model that allows several typ...

Please sign up or login with your details

Forgot password? Click here to reset