Exploring the multimodal information from video content using deep learning features of appearance, audio and action for video recommendation

11/21/2020
by   A. Almeida, et al.
0

Following the popularisation of media streaming, a number of video streaming services are continuously buying new video content to mine the potential profit from them. As such, the newly added content has to be handled well to be recommended to suitable users. In this paper, we address the new item cold-start problem by exploring the potential of various deep learning features to provide video recommendations. The deep learning features investigated include features that capture the visual-appearance, audio and motion information from video content. We also explore different fusion methods to evaluate how well these feature modalities can be combined to fully exploit the complementary information captured by them. Experiments on a real-world video dataset for movie recommendations show that deep learning features outperform hand-crafted features. In particular, recommendations generated with deep learning audio features and action-centric deep learning features are superior to MFCC and state-of-the-art iDT features. In addition, the combination of various deep learning features with hand-crafted features and textual metadata yields significant improvement in recommendations compared to combining only the former.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2016

Deep Motion Features for Visual Tracking

Robust visual tracking is a challenging computer vision problem, with ma...
research
08/08/2019

Moviescope: Large-scale Analysis of Movies using Multiple Modalities

Film media is a rich form of artistic expression. Unlike photography, an...
research
09/23/2020

Cosine Similarity of Multimodal Content Vectors for TV Programmes

Multimodal information originates from a variety of sources: audiovisual...
research
05/07/2020

White Paper: Recommendations for immersive accessibility services

This paper provides recommendations on how to integrate accessibility so...
research
01/17/2017

Fusing Deep Learned and Hand-Crafted Features of Appearance, Shape, and Dynamics for Automatic Pain Estimation

Automatic continuous time, continuous value assessment of a patient's pa...
research
10/26/2020

Multimodal Topic Learning for Video Recommendation

Facilitated by deep neural networks, video recommendation systems have m...
research
07/15/2020

Content-based Recommendations for Radio Stations with Deep Learned Audio Fingerprints

The world of linear radio broadcasting is characterized by a wide variet...

Please sign up or login with your details

Forgot password? Click here to reset