Visual Representations of Physiological Signals for Fake Video Detection

07/18/2022
by   Kalin Stefanov, et al.
7

Realistic fake videos are a potential tool for spreading harmful misinformation given our increasing online presence and information intake. This paper presents a multimodal learning-based method for detection of real and fake videos. The method combines information from three modalities - audio, video, and physiology. We investigate two strategies for combining the video and physiology modalities, either by augmenting the video with information from the physiology or by novelly learning the fusion of those two modalities with a proposed Graph Convolutional Network architecture. Both strategies for combining the two modalities rely on a novel method for generation of visual representations of physiological signals. The detection of real and fake videos is then based on the dissimilarity between the audio and modified video modalities. The proposed method is evaluated on two benchmark datasets and the results show significant increase in detection performance compared to previous methods.

READ FULL TEXT

page 2

page 4

research
03/14/2020

Emotions Don't Lie: A Deepfake Detection Method using Audio-Visual Affective Cues

We present a learning-based multimodal method for detecting real and dee...
research
10/01/2020

DeepFakesON-Phys: DeepFakes Detection based on Heart Rate Estimation

This work introduces a novel DeepFake detection framework based on physi...
research
04/10/2023

Artifact magnification on deepfake videos increases human detection and subjective confidence

The development of technologies for easily and automatically falsifying ...
research
03/15/2019

Inserting Videos into Videos

In this paper, we introduce a new problem of manipulating a given video ...
research
08/26/2020

How Do the Hearts of Deep Fakes Beat? Deep Fake Source Detection via Interpreting Residuals with Biological Signals

Fake portrait video generation techniques have been posing a new threat ...
research
09/08/2023

EGOFALLS: A visual-audio dataset and benchmark for fall detection using egocentric cameras

Falls are significant and often fatal for vulnerable populations such as...
research
08/20/2021

Video Ads Content Structuring by Combining Scene Confidence Prediction and Tagging

Video ads segmentation and tagging is a challenging task due to two main...

Please sign up or login with your details

Forgot password? Click here to reset