The Benefit of Distraction: Denoising Remote Vitals Measurements using Inverse Attention

by   Ewa Nowara, et al.

Attention is a powerful concept in computer vision. End-to-end networks that learn to focus selectively on regions of an image or video often perform strongly. However, other image regions, while not necessarily containing the signal of interest, may contain useful context. We present an approach that exploits the idea that statistics of noise may be shared between the regions that contain the signal of interest and those that do not. Our technique uses the inverse of an attention mask to generate a noise estimate that is then used to denoise temporal observations. We apply this to the task of camera-based physiological measurement. A convolutional attention network is used to learn which regions of a video contain the physiological signal and generate a preliminary estimate. A noise estimate is obtained by using the pixel intensities in the inverse regions of the learned attention mask, this in turn is used to refine the estimate of the physiological signal. We perform experiments on two large benchmark datasets and show that this approach produces state-of-the-art results, increasing the signal-to-noise ratio by up to 5.8 dB, reducing heart rate and breathing rate estimation error by as much as 30 NIR videos without retraining.


page 2

page 3

page 4

page 5

page 14


Dual Attention Network for Heart Rate and Respiratory Rate Estimation

Heart rate and respiratory rate measurement is a vital step for diagnosi...

DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks

Non-contact video-based physiological measurement has many applications ...

Federated Remote Physiological Measurement with Imperfect Data

The growing need for technology that supports remote healthcare is being...

Video-based Remote Physiological Measurement via Self-supervised Learning

Video-based remote physiological measurement aims to estimate remote pho...

The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video

The ability to reliably estimate physiological signals from video is a p...

Needles in Haystacks: On Classifying Tiny Objects in Large Images

In some computer vision domains, such as medical or hyperspectral imagin...

Please sign up or login with your details

Forgot password? Click here to reset