Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures

09/09/2014
by   John R. Hershey, et al.
0

Model-based methods and deep neural networks have both been tremendously successful paradigms in machine learning. In model-based methods, problem domain knowledge can be built into the constraints of the model, typically at the expense of difficulties during inference. In contrast, deterministic deep neural networks are constructed in such a way that inference is straightforward, but their architectures are generic and it is unclear how to incorporate knowledge. This work aims to obtain the advantages of both approaches. To do so, we start with a model-based approach and an associated inference algorithm, and unfold the inference iterations as layers in a deep network. Rather than optimizing the original model, we untie the model parameters across layers, in order to create a more powerful network. The resulting architecture can be trained discriminatively to perform accurate inference within a fixed network size. We show how this framework allows us to interpret conventional networks as mean-field inference in Markov random fields, and to obtain new architectures by instead using belief propagation as the inference algorithm. We then show its application to a non-negative matrix factorization model that incorporates the problem-domain knowledge that sound sources are additive. Deep unfolding of this model yields a new kind of non-negative deep neural network, that can be trained using a multiplicative backpropagation-style update algorithm. We present speech enhancement experiments showing that our approach is competitive with conventional neural networks despite using far fewer parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2018

A Peek Into the Hidden Layers of a Convolutional Neural Network Through a Factorization Lens

Despite their increasing popularity and success in a variety of supervis...
research
02/08/2019

Speech enhancement with variational autoencoders and alpha-stable distributions

This paper focuses on single-channel semi-supervised speech enhancement....
research
11/16/2022

Learning unfolded networks with a cyclic group structure

Deep neural networks lack straightforward ways to incorporate domain kno...
research
05/18/2018

Knowledge Discovery from Layered Neural Networks based on Non-negative Task Decomposition

Interpretability has become an important issue in the machine learning f...
research
02/22/2019

Probabilistic Inference of Binary Markov Random Fields in Spiking Neural Networks through Mean-field Approximation

Recent studies have suggested that the cognitive process of the human br...
research
05/15/2017

Learning Probabilistic Programs Using Backpropagation

Probabilistic modeling enables combining domain knowledge with learning ...
research
12/13/2016

Information Extraction with Character-level Neural Networks and Free Noisy Supervision

We present an architecture for information extraction from text that aug...

Please sign up or login with your details

Forgot password? Click here to reset