YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss

04/14/2022
by   Debapriya Maji, et al.
0

We introduce YOLO-pose, a novel heatmap-free approach for joint detection, and 2D multi-person pose estimation in an image based on the popular YOLO object detection framework. Existing heatmap based two-stage approaches are sub-optimal as they are not end-to-end trainable and training relies on a surrogate L1 loss that is not equivalent to maximizing the evaluation metric, i.e. Object Keypoint Similarity (OKS). Our framework allows us to train the model end-to-end and optimize the OKS metric itself. The proposed model learns to jointly detect bounding boxes for multiple persons and their corresponding 2D poses in a single forward pass and thus bringing in the best of both top-down and bottom-up approaches. Proposed approach doesn't require the postprocessing of bottom-up approaches to group detected keypoints into a skeleton as each bounding box has an associated pose, resulting in an inherent grouping of the keypoints. Unlike top-down approaches, multiple forward passes are done away with since all persons are localized along with their pose in a single inference. YOLO-pose achieves new state-of-the-art results on COCO validation (90.2 bottom-up approaches in a single forward pass without flip test, multi-scale testing, or any other test time augmentation. All experiments and results reported in this paper are without any test time augmentation, unlike traditional approaches that use flip-test and multi-scale testing to boost performance. Our training codes will be made publicly available at https://github.com/TexasInstruments/edgeai-yolov5 and https://github.com/TexasInstruments/edgeai-yolox

READ FULL TEXT

page 1

page 4

page 5

research
11/09/2020

EfficientPose – An efficient, accurate and scalable end-to-end 6D multi object pose estimation approach

In this paper we introduce EfficientPose, a new approach for 6D object p...
research
11/18/2019

DirectPose: Direct End-to-End Multi-Person Pose Estimation

We propose the first direct end-to-end multi-person pose estimation fram...
research
10/27/2022

Joint Multi-Person Body Detection and Orientation Estimation via One Unified Embedding

Human body orientation estimation (HBOE) is widely applied into various ...
research
08/25/2022

Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons

In multi-person 2D pose estimation, the bottom-up methods simultaneously...
research
10/11/2021

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

We introduce CenterGroup, an attention-based framework to estimate human...
research
12/20/2021

BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations

We propose BAPose, a novel bottom-up approach that achieves state-of-the...
research
12/15/2022

QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query

We propose a sparse end-to-end multi-person pose regression framework, t...

Please sign up or login with your details

Forgot password? Click here to reset