DeepAI AI Chat
Log In Sign Up

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

by   Guillem Brasó, et al.
Technische Universität München

We introduce CenterGroup, an attention-based framework to estimate human poses from a set of identity-agnostic keypoints and person center predictions in an image. Our approach uses a transformer to obtain context-aware embeddings for all detected keypoints and centers and then applies multi-head attention to directly group joints into their corresponding person centers. While most bottom-up methods rely on non-learnable clustering at inference, CenterGroup uses a fully differentiable attention mechanism that we train end-to-end together with our keypoint detector. As a result, our method obtains state-of-the-art performance with up to 2.5x faster inference time than competing bottom-up methods. Our code is available at .


page 1

page 4

page 15

page 16

page 17


MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network

In this paper, we present MultiPoseNet, a novel bottom-up multi-person p...

QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query

We propose a sparse end-to-end multi-person pose regression framework, t...

DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

Multi-center positron emission tomography (PET) image synthesis aims at ...

YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss

We introduce YOLO-pose, a novel heatmap-free approach for joint detectio...

Learning Spatial Context with Graph Neural Network for Multi-Person Pose Grouping

Bottom-up approaches for image-based multi-person pose estimation consis...

Greedy Offset-Guided Keypoint Grouping for Human Pose Estimation

We propose a simple yet reliable bottom-up approach with a good trade-of...

Can WiFi Estimate Person Pose?

WiFi human sensing has achieved great progress in indoor localization, a...