A differentiable Gaussian Prototype Layer for explainable Segmentation

06/25/2023
by   Michael Gerstenberger, et al.
0

We introduce a Gaussian Prototype Layer for gradient-based prototype learning and demonstrate two novel network architectures for explainable segmentation one of which relies on region proposals. Both models are evaluated on agricultural datasets. While Gaussian Mixture Models (GMMs) have been used to model latent distributions of neural networks before, they are typically fitted using the EM algorithm. Instead, the proposed prototype layer relies on gradient-based optimization and hence allows for end-to-end training. This facilitates development and allows to use the full potential of a trainable deep feature extractor. We show that it can be used as a novel building block for explainable neural networks. We employ our Gaussian Prototype Layer in (1) a model where prototypes are detected in the latent grid and (2) a model inspired by Fast-RCNN with SLIC superpixels as region proposals. The earlier achieves a similar performance as compared to the state-of-the art while the latter has the benefit of a more precise prototype localization that comes at the cost of slightly lower accuracies. By introducing a gradient-based GMM layer we combine the benefits of end-to-end training with the simplicity and theoretical foundation of GMMs which will allow to adapt existing semi-supervised learning strategies for prototypical part models in future.

READ FULL TEXT
research
07/17/2020

End-to-end Deep Prototype and Exemplar Models for Predicting Human Behavior

Traditional models of category learning in psychology focus on represent...
research
11/17/2020

DS-UI: Dual-Supervised Mixture of Gaussian Mixture Models for Uncertainty Inference

This paper proposes a dual-supervised uncertainty inference (DS-UI) fram...
research
07/25/2020

Learning Variational Data Assimilation Models and Solvers

This paper addresses variational data assimilation from a learning point...
research
07/19/2023

Boundary-Refined Prototype Generation: A General End-to-End Paradigm for Semi-Supervised Semantic Segmentation

Prototype-based classification is a classical method in machine learning...
research
03/16/2018

Activity Detection with Latent Sub-event Hierarchy Learning

In this paper, we introduce a new convolutional layer named the Temporal...
research
12/07/2020

Using Differentiable Programming for Flexible Statistical Modeling

Differentiable programming has recently received much interest as a para...
research
08/28/2023

Multilayer Multiset Neuronal Networks – MMNNs

The coincidence similarity index, based on a combination of the Jaccard ...

Please sign up or login with your details

Forgot password? Click here to reset