Bi-modal First Impressions Recognition using Temporally Ordered Deep Audio and Stochastic Visual Features

10/31/2016
by   Arulkumar Subramaniam, et al.
0

We propose a novel approach for First Impressions Recognition in terms of the Big Five personality-traits from short videos. The Big Five personality traits is a model to describe human personality using five broad categories: Extraversion, Agreeableness, Conscientiousness, Neuroticism and Openness. We train two bi-modal end-to-end deep neural network architectures using temporally ordered audio and novel stochastic visual features from few frames, without over-fitting. We empirically show that the trained models perform exceptionally well, even after training from a small sub-portions of inputs. Our method is evaluated in ChaLearn LAP 2016 Apparent Personality Analysis (APA) competition using ChaLearn LAP APA2016 dataset and achieved excellent performance.

READ FULL TEXT

page 3

page 6

research
08/18/2023

Audio-Visual Glance Network for Efficient Video Recognition

Deep learning has made significant strides in video understanding tasks,...
research
05/17/2020

A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer

Dense video captioning aims to localize and describe important events in...
research
11/09/2016

Audio Visual Speech Recognition using Deep Recurrent Neural Networks

In this work, we propose a training algorithm for an audio-visual automa...
research
11/01/2019

Multimodal Video-based Apparent Personality Recognition Using Long Short-Term Memory and Convolutional Neural Networks

Personality computing and affective computing, where the recognition of ...
research
11/22/2020

CORAL: Colored structural representation for bi-modal place recognition

Place recognition is indispensable for drift-free localization system. D...
research
02/11/2019

GET-AID: Visual Recognition of Human Rights Abuses via Global Emotional Traits

In the era of social media and big data, the use of visual evidence to d...
research
05/18/2020

End-to-End Lip Synchronisation

The goal of this work is to synchronise audio and video of a talking fac...

Please sign up or login with your details

Forgot password? Click here to reset