AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation

05/11/2022
by   Xu Cao, et al.
0

Movement and pose assessment of newborns lets experienced pediatricians predict neurodevelopmental disorders, allowing early intervention for related diseases. However, most of the newest AI approaches for human pose estimation methods focus on adults, lacking publicly benchmark for infant pose estimation. In this paper, we fill this gap by proposing infant pose dataset and Deep Aggregation Vision Transformer for human pose estimation, which introduces a fast trained full transformer framework without using convolution operations to extract features in the early stages. It generalizes Transformer + MLP to high-resolution deep layer aggregation within feature maps, thus enabling information fusion between different vision levels. We pre-train AggPose on COCO pose dataset and apply it on our newly released large-scale infant pose estimation dataset. The results show that AggPose could effectively learn the multi-scale features among different resolutions and significantly improve the performance of infant pose estimation. We show that AggPose outperforms hybrid model HRFormer and TokenPose in the infant pose estimation dataset. Moreover, our AggPose outperforms HRFormer by 0.7 average. Our code is available at github.com/SZAR-LAB/AggPose.

READ FULL TEXT
research
02/25/2019

Deep High-Resolution Representation Learning for Human Pose Estimation

This is an official pytorch implementation of Deep High-Resolution Repre...
research
05/03/2022

Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

Pose estimation plays a critical role in human-centered vision applicati...
research
02/09/2023

HybrIK-Transformer

HybrIK relies on a combination of analytical inverse kinematics and deep...
research
09/17/2021

GraFormer: Graph Convolution Transformer for 3D Pose Estimation

Exploiting relations among 2D joints plays a crucial role yet remains se...
research
03/30/2015

Globally Tuned Cascade Pose Regression via Back Propagation with Application in 2D Face Pose Estimation and Heart Segmentation in 3D CT Images

Recently, a successful pose estimation algorithm, called Cascade Pose Re...
research
03/20/2020

Multi-Person Pose Estimation with Enhanced Feature Aggregation and Selection

We propose a novel Enhanced Feature Aggregation and Selection network (E...
research
06/21/2023

LPFormer: LiDAR Pose Estimation Transformer with Multi-Task Network

In this technical report, we present the 1st place solution for the 2023...

Please sign up or login with your details

Forgot password? Click here to reset