Chained Predictions Using Convolutional Neural Networks

05/08/2016
by   Georgia Gkioxari, et al.
0

In this paper, we present an adaptation of the sequence-to-sequence model for structured output prediction in vision tasks. In this model the output variables for a given input are predicted sequentially using neural networks. The prediction for each output variable depends not only on the input but also on the previously predicted output variables. The model is applied to spatial localization tasks and uses convolutional neural networks (CNNs) for processing input images and a multi-scale deconvolutional architecture for making spatial predictions at each time step. We explore the impact of weight sharing with a recurrent connection matrix between consecutive predictions, and compare it to a formulation where these weights are not tied. Untied weights are particularly suited for problems with a fixed sized structure, where different classes of output are predicted in different steps. We show that chained predictions achieve top performing results on human pose estimation from single images and videos.

READ FULL TEXT

page 11

page 14

research
11/16/2015

A Neural Transducer

Sequence-to-sequence models have achieved impressive results on various ...
research
07/02/2018

Dynamic Prediction Length for Time Series with Sequence to Sequence Networks

Recurrent neural networks and sequence to sequence models require a pred...
research
01/19/2022

Swin-Pose: Swin Transformer Based Human Pose Estimation

Convolutional neural networks (CNNs) have been widely utilized in many c...
research
08/03/2017

Learning Feature Pyramids for Human Pose Estimation

Articulated human pose estimation is a fundamental yet challenging task ...
research
08/15/2013

Learning ambiguous functions by neural networks

It is not, in general, possible to have access to all variables that det...
research
05/23/2019

Learning the Representations of Moist Convection with Convolutional Neural Networks

The representations of atmospheric moist convection in general circulati...
research
09/09/2020

Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

This paper addresses the extraction of multiple F0 values from polyphoni...

Please sign up or login with your details

Forgot password? Click here to reset