Multi-modal Ensemble Models for Predicting Video Memorability

02/01/2021
by   Tony Zhao, et al.
0

Modeling media memorability has been a consistent challenge in the field of machine learning. The Predicting Media Memorability task in MediaEval2020 is the latest benchmark among similar challenges addressing this topic. Building upon techniques developed in previous iterations of the challenge, we developed ensemble methods with the use of extracted video, image, text, and audio features. Critically, in this work we introduce and demonstrate the efficacy and high generalizability of extracted audio embeddings as a feature for the task of predicting media memorability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2020

Leveraging Audio Gestalt to Predict Media Memorability

Memorability determines what evanesces into emptiness, and what worms it...
research
06/15/2023

Multi-modal Hate Speech Detection using Machine Learning

With the continuous growth of internet users and media content, it is ve...
research
05/14/2023

Unraveling Cold Start Enigmas in Predictive Analytics for OTT Media: Synergistic Meta-Insights and Multimodal Ensemble Mastery

The cold start problem is a common challenge in various domains, includi...
research
06/17/2017

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text

The YouTube-8M video classification challenge requires teams to classify...
research
03/28/2018

Topic Modeling Based Multi-modal Depression Detection

Major depressive disorder is a common mental disorder that affects almos...
research
08/02/2016

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016

This paper presents the method that underlies our submission to the untr...
research
12/07/2022

Experiences from the MediaEval Predicting Media Memorability Task

The Predicting Media Memorability task in the MediaEval evaluation campa...

Please sign up or login with your details

Forgot password? Click here to reset