Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework

03/28/2020
by   Yaochen Zhu, et al.
0

As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions. However, the popularity pattern of an individual micro-video still remains elusive among the researchers. One of the major challenges is that the potential popularity of a micro-video tends to fluctuate under the impact of various external factors, which makes it full of uncertainties. In addition, since micro-videos are mainly uploaded by individuals that lack professional techniques, multiple types of noise could exist that obscure useful information. In this paper, we propose a multimodal variational encoder-decoder (MMVED) framework for micro-video popularity prediction tasks. MMVED learns a stochastic Gaussian embedding of a micro-video that is informative to its popularity level while preserves the inherent uncertainties simultaneously. Moreover, through the optimization of a deep variational information bottleneck lower-bound (IBLBO), the learned hidden representation is shown to be maximally expressive about the popularity target while maximally compressive to the noise in micro-video features. Furthermore, the Bayesian product-of-experts principle is applied to the multimodal encoder, where the decision for information keeping or discarding is made comprehensively with all available modalities. Extensive experiments conducted on a public dataset and a dataset we collect from Xigua demonstrate the effectiveness of the proposed MMVED framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2021

Cross-modal Variational Auto-encoder for Content-based Micro-video Background Music Recommendation

In this paper, we propose a cross-modal variational auto-encoder (CMVAE)...
research
12/30/2021

A Benchmark Dataset for Micro-video Thumbnail Selection

The thumbnail, as the first sight of a micro-video, plays a pivotal role...
research
02/07/2022

Towards Micro-video Thumbnail Selection via a Multi-label Visual-semantic Embedding Model

The thumbnail, as the first sight of a micro-video, plays a pivotal role...
research
05/18/2021

Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media

Billions of photos are uploaded to the web daily through various types o...
research
05/06/2022

Implicit semantic-based personalized micro-videos recommendation

With the rapid development of mobile Internet and big data, a huge amoun...
research
01/28/2021

Playable Video Generation

This paper introduces the unsupervised learning problem of playable vide...

Please sign up or login with your details

Forgot password? Click here to reset