MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection

07/20/2020
by   Fa-Ting Hong, et al.
0

We address the weakly supervised video highlight detection problem for learning to detect segments that are more attractive in training videos given their video event label but without expensive supervision of manually annotating highlight segments. While manually averting localizing highlight segments, weakly supervised modeling is challenging, as a video in our daily life could contain highlight segments with multiple event types, e.g., skiing and surfing. In this work, we propose casting weakly supervised video highlight detection modeling for a given specific event as a multiple instance ranking network (MINI-Net) learning. We consider each video as a bag of segments, and therefore, the proposed MINI-Net learns to enforce a higher highlight score for a positive bag that contains highlight segments of a specific event than those for negative bags that are irrelevant. In particular, we form a max-max ranking loss to acquire a reliable relative comparison between the most likely positive segment instance and the hardest negative segment instance. With this max-max ranking loss, our MINI-Net effectively leverages all segment information to acquire a more distinct video feature representation for localizing the highlight segments of a specific event in a video. The extensive experimental results on three challenging public benchmarks clearly validate the efficacy of our multiple instance ranking approach for solving the problem.

READ FULL TEXT

page 12

page 21

research
03/31/2020

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning

Weakly-supervised action localization problem requires training a model ...
research
03/03/2019

Less is More: Learning Highlight Detection from Video Duration

Highlight detection has the potential to significantly ease video browsi...
research
10/12/2021

Reliable Shot Identification for Complex Event Detection via Visual-Semantic Embedding

Multimedia event detection is the task of detecting a specific event of ...
research
01/31/2018

A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video

We address the problem of highlight detection from a 360 degree video by...
research
02/14/2022

Adaptive graph convolutional networks for weakly supervised anomaly detection in videos

For the weakly supervised anomaly detection task, most existing work is ...
research
08/11/2022

Locality-aware Attention Network with Discriminative Dynamics Learning for Weakly Supervised Anomaly Detection

Video anomaly detection is recently formulated as a multiple instance le...
research
12/15/2018

Weakly supervised segment annotation via expectation kernel density estimation

Since the labelling for the positive images/videos is ambiguous in weakl...

Please sign up or login with your details

Forgot password? Click here to reset