Automated Temporal Segmentation of Orofacial Assessment Videos

08/22/2022
by   Saeid Alavi Naeini, et al.
6

Computer vision techniques can help automate or partially automate clinical examination of orofacial impairments to provide accurate and objective assessments. Towards the development of such automated systems, we evaluated two approaches to detect and temporally segment (parse) repetitions in orofacial assessment videos. Recorded videos of participants with amyotrophic lateral sclerosis (ALS) and healthy control (HC) individuals were obtained from the Toronto NeuroFace Dataset. Two approaches for repetition detection and parsing were examined: one based on engineered features from tracked facial landmarks and peak detection in the distance between the vermilion-cutaneous junction of the upper and lower lips (baseline analysis), and another using a pre-trained transformer-based deep learning model called RepNet (Dwibedi et al, 2020), which automatically detects periodicity, and parses periodic and semi-periodic repetitions in video data. In experimental evaluation of two orofacial assessments tasks, - repeating maximum mouth opening (OPEN) and repeating the sentence "Buy Bobby a Puppy" (BBP) - RepNet provided better parsing than the landmark-based approach, quantified by higher mean intersection-over-union (IoU) with respect to ground truth manual parsing. Automated parsing using RepNet also clearly separated HC and ALS participants based on the duration of BBP repetitions, whereas the landmark-based method could not.

READ FULL TEXT
research
10/25/2019

Toward an Automatic System for Computer-Aided Assessment in Facial Palsy

Importance: Machine learning (ML) approaches to facial landmark localiza...
research
08/06/2021

Deep Learning-based Biological Anatomical Landmark Detection in Colonoscopy Videos

Colonoscopy is a standard imaging tool for visualizing the entire gastro...
research
02/08/2023

Neonatal Face and Facial Landmark Detection from Video Recordings

This paper explores automated face and facial landmark detection of neon...
research
04/07/2021

Pretrained equivariant features improve unsupervised landmark discovery

Locating semantically meaningful landmark points is a crucial component ...
research
12/13/2020

Using Computer Vision to Automate Hand Detection and Tracking of Surgeon Movements in Videos of Open Surgery

Open, or non-laparoscopic surgery, represents the vast majority of all o...
research
09/19/2023

Fully automated landmarking and facial segmentation on 3D photographs

Three-dimensional facial stereophotogrammetry provides a detailed repres...
research
12/02/2021

TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing

Video scene parsing in the wild with diverse scenarios is a challenging ...

Please sign up or login with your details

Forgot password? Click here to reset