Towards Micro-video Thumbnail Selection via a Multi-label Visual-semantic Embedding Model

02/07/2022
by   Liu Bo, et al.
0

The thumbnail, as the first sight of a micro-video, plays a pivotal role in attracting users to click and watch. While in the real scenario, the more the thumbnails satisfy the users, the more likely the micro-videos will be clicked. In this paper, we aim to select the thumbnail of a given micro-video that meets most users` interests. Towards this end, we present a multi-label visual-semantic embedding model to estimate the similarity between the pair of each frame and the popular topics that users are interested in. In this model, the visual and textual information is embedded into a shared semantic space, whereby the similarity can be measured directly, even the unseen words. Moreover, to compare the frame to all words from the popular topics, we devise an attention embedding space associated with the semantic-attention projection. With the help of these two embedding spaces, the popularity score of a frame, which is defined by the sum of similarity scores over the corresponding visual information and popular topic pairs, is achieved. Ultimately, we fuse the visual representation score and the popularity score of each frame to select the attractive thumbnail for the given micro-video. Extensive experiments conducted on a real-world dataset have well-verified that our model significantly outperforms several state-of-the-art baselines.

READ FULL TEXT

page 1

page 2

page 8

research
12/30/2021

A Benchmark Dataset for Micro-video Thumbnail Selection

The thumbnail, as the first sight of a micro-video, plays a pivotal role...
research
08/27/2019

Personalized Hashtag Recommendation for Micro-videos

Personalized hashtag recommendation methods aim to suggest users hashtag...
research
04/16/2020

Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence

Visual-semantic embedding aims to learn a joint embedding space where re...
research
03/28/2020

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework

As an emerging type of user-generated content, micro-video drastically e...
research
08/09/2022

Improving Micro-video Recommendation by Controlling Position Bias

As the micro-video apps become popular, the numbers of micro-videos and ...
research
06/10/2020

A novel sentence embedding based topic detection method for micro-blog

Topic detection is a challenging task, especially without knowing the ex...
research
03/31/2016

The Open World of Micro-Videos

Micro-videos are six-second videos popular on social media networks with...

Please sign up or login with your details

Forgot password? Click here to reset