MIST: Multiple Instance Spatial Transformer Network

11/26/2018
by   Baptiste Angles, et al.
0

We propose a deep network that can be trained to tackle image reconstruction and classification problems that involve detection of multiple object instances, without any supervision regarding their whereabouts. The network learns to extract the most significant top-K patches, and feeds these patches to a task-specific network -- e.g., auto-encoder or classifier -- to solve a domain specific problem. The challenge in training such a network is the non-differentiable top-K selection process. To address this issue, we lift the training optimization problem by treating the result of top-K selection as a slack variable, resulting in a simple, yet effective, multi-stage training. Our method is able to learn to detect recurrent structures in the training dataset by learning to reconstruct images. It can also learn to localize structures when only knowledge on the occurrence of the object is provided, and in doing so it outperforms the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 6

page 7

page 8

research
09/19/2022

Attentive Symmetric Autoencoder for Brain MRI Segmentation

Self-supervised learning methods based on image patch reconstruction hav...
research
01/09/2022

Auto-Encoder based Co-Training Multi-View Representation Learning

Multi-view learning is a learning problem that utilizes the various repr...
research
02/04/2022

Image-to-Image MLP-mixer for Image Reconstruction

Neural networks are highly effective tools for image reconstruction prob...
research
04/14/2023

Masked Pre-Training of Transformers for Histology Image Analysis

In digital pathology, whole slide images (WSIs) are widely used for appl...
research
04/10/2017

Weakly-Supervised Spatial Context Networks

We explore the power of spatial context as a self-supervisory signal for...
research
07/13/2018

Domain-Specific Human-Inspired Binarized Statistical Image Features for Iris Recognition

Binarized statistical image features (BSIF) have been successfully used ...
research
05/22/2019

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

In this work, we consider one challenging training time attack by modify...

Please sign up or login with your details

Forgot password? Click here to reset