DeepAI AI Chat
Log In Sign Up

Associative Embedding: End-to-End Learning for Joint Detection and Grouping

by   Alejandro Newell, et al.
University of Michigan
Tsinghua University

We introduce associative embedding, a novel method for supervising convolutional neural networks for the task of detection and grouping. A number of computer vision problems can be framed in this manner including multi-person pose estimation, instance segmentation, and multi-object tracking. Usually the grouping of detections is achieved with multi-stage pipelines, instead we propose an approach that teaches a network to simultaneously output detections and group assignments. This technique can be easily integrated into any state-of-the-art network architecture that produces pixel-wise predictions. We show how to apply this method to both multi-person pose estimation and instance segmentation and report state-of-the-art performance for multi-person pose on the MPII and MS-COCO datasets.


page 1

page 4

page 5

page 6

page 7

page 9


Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation

Multi-person pose estimation is challenging because it localizes body ke...

Multi-person Articulated Tracking with Spatial and Temporal Embeddings

We propose a unified framework for multi-person pose estimation and trac...

Neural Scene Decomposition for Multi-Person Motion Capture

Learning general image representations has proven key to the success of ...

A Global to Local Double Embedding Method for Multi-person Pose Estimation

Multi-person pose estimation is a fundamental and challenging problem to...

Recurrent Pixel Embedding for Instance Grouping

We introduce a differentiable, end-to-end trainable framework for solvin...

Code Repositories


face keypoints deteciton based on stackedhourglass

view repo