Zero-Shot Anticipation for Instructional Activities

12/06/2018
by   Fadime Sener, et al.
6

How can we teach a robot to predict what will happen next for an activity it has never seen before? We address the problem of zero-shot anticipation by presenting a hierarchical model that generalizes instructional knowledge from large-scale text-corpora and transfers the knowledge to the visual domain. Given a portion of an instructional video, our model predicts coherent and plausible actions multiple steps into the future, all in rich natural language. To demonstrate the anticipation capabilities of our model, we introduce the Tasty Videos dataset, a collection of 2511 recipes for zero-shot learning, recognition and anticipation.

READ FULL TEXT

page 1

page 3

page 8

page 12

page 18

page 19

research
06/06/2021

Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions

Can we teach a robot to recognize and make predictions for activities th...
research
09/14/2022

Natural Language Inference Prompts for Zero-shot Emotion Classification in Text across Corpora

Within textual emotion classification, the set of relevant labels depend...
research
02/01/2023

Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization

Contrastive Language-Image Pretraining (CLIP) has demonstrated impressiv...
research
12/08/2019

Zero-shot Recognition of Complex Action Sequences

Zero-shot video classification for fine-grained activity recognition has...
research
07/17/2023

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

Procedural activity understanding requires perceiving human actions in t...
research
01/22/2020

Zero-Shot Activity Recognition with Videos

In this paper, we examined the zero-shot activity recognition task with ...
research
05/23/2023

Prompt position really matters in few-shot and zero-shot NLU tasks

Prompt-based models have made remarkable advancements in the fields of z...

Please sign up or login with your details

Forgot password? Click here to reset