Unsupervised Task Graph Generation from Instructional Video Transcripts

02/17/2023
by   Lajanugen Logeswaran, et al.
0

This work explores the problem of generating task graphs of real-world activities. Different from prior formulations, we consider a setting where text transcripts of instructional videos performing a real-world activity (e.g., making coffee) are provided and the goal is to identify the key steps relevant to the task as well as the dependency relationship between these key steps. We propose a novel task graph generation approach that combines the reasoning capabilities of instruction-tuned language models along with clustering and ranking components to generate accurate task graphs in a completely unsupervised manner. We show that the proposed approach generates more accurate task graphs compared to a supervised learning approach on tasks from the ProceL and CrossTask datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2023

Multimodal Subtask Graph Generation from Instructional Videos

Real-world tasks consist of multiple inter-dependent subtasks (e.g., a d...
research
06/30/2015

Unsupervised Learning from Narrated Instruction Videos

We address the problem of automatically learning the main steps to compl...
research
07/17/2023

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

Procedural activity understanding requires perceiving human actions in t...
research
07/29/2023

RoCar: A Relationship Network-based Evaluation Method to Large Language Models

Large language models (LLMs) have received increasing attention. However...
research
09/18/2023

LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models

Automated occupation extraction and standardization from free-text job p...
research
01/25/2023

Improving Graph Generation by Restricting Graph Bandwidth

Deep graph generative modeling has proven capable of learning the distri...
research
07/15/2022

FLOWGEN: Fast and slow graph generation

We present FLOWGEN, a graph-generation model inspired by the dual-proces...

Please sign up or login with your details

Forgot password? Click here to reset