A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants

03/29/2023
by   Shaotong Zhu, et al.
0

We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS) – an infant sucking pattern with no nutrition delivered – as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of relevant activity. Our NNS activity segmentation algorithm solves this problem by identifying periods of NNS with high certainty – up to 94.0% average precision and 84.9% average recall across 30 heterogeneous 60 s clips, drawn from our manually annotated NNS clinical in-crib dataset of 183 hours of overnight baby monitor footage from 19 infants. Our method is based on an underlying NNS action recognition algorithm, which uses spatiotemporal deep learning networks and infant-specific pose estimation, achieving 94.9% accuracy in binary classification of 960 2.5 s balanced NNS vs. non-NNS clips. Tested on our second, independent, and public NNS in-the-wild dataset, NNS recognition classification reaches 92.3% accuracy, and NNS segmentation achieves 90.8% precision and 84.2% recall.

READ FULL TEXT

page 2

page 11

page 12

research
05/14/2023

Is end-to-end learning enough for fitness activity recognition?

End-to-end learning has taken hold of many computer vision tasks, in par...
research
02/26/2018

2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

Action recognition and human pose estimation are closely related but bot...
research
10/06/2021

Vision-based Excavator Activity Analysis and Safety Monitoring System

In this paper, we propose an excavator activity analysis and safety moni...
research
07/22/2023

An X3D Neural Network Analysis for Runner's Performance Assessment in a Wild Sporting Environment

We present a transfer learning analysis on a sporting environment of the...
research
10/14/2021

Video-based cattle identification and action recognition

We demonstrate a working prototype for the monitoring of cow welfare by ...
research
04/12/2018

SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos

In this paper, we introduce SoccerNet, a benchmark for action spotting i...
research
06/02/2023

DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning

Twenty-five hundred years ago, the paperwork of the Achaemenid Empire wa...

Please sign up or login with your details

Forgot password? Click here to reset