HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding

07/09/2023
by   Hao Zheng, et al.
0

Understanding comprehensive assembly knowledge from videos is critical for futuristic ultra-intelligent industry. To enable technological breakthrough, we present HA-ViD - the first human assembly video dataset that features representative industrial assembly scenarios, natural procedural knowledge acquisition process, and consistent human-robot shared annotations. Specifically, HA-ViD captures diverse collaboration patterns of real-world assembly, natural human behaviors and learning progression during assembly, and granulate action annotations to subject, action verb, manipulated object, target object, and tool. We provide 3222 multi-view, multi-modality videos (each video contains one assembly task), 1.5M frames, 96K temporal labels and 2M spatial labels. We benchmark four foundational video understanding tasks: action recognition, action segmentation, object detection and multi-object tracking. Importantly, we analyze their performance for comprehending knowledge in assembly progress, process efficiency, task collaboration, skill parameters and human intention. Details of HA-ViD is available at: https://iai-hrc.github.io/ha-vid.

READ FULL TEXT

page 2

page 5

page 12

page 13

page 16

page 17

page 18

page 24

research
04/17/2023

ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding

With the emergence of collaborative robots (cobots), human-robot collabo...
research
03/24/2023

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

Multimodal alignment facilitates the retrieval of instances from one mod...
research
11/16/2021

IKEA Object State Dataset: A 6DoF object pose estimation dataset and benchmark for multi-state assembly objects

Utilizing 6DoF(Degrees of Freedom) pose information of an object and its...
research
03/07/2023

Challenges of the Creation of a Dataset for Vision Based Human Hand Action Recognition in Industrial Assembly

This work presents the Industrial Hand Action Dataset V1, an industrial ...
research
06/09/2023

How Object Information Improves Skeleton-based Human Action Recognition in Assembly Tasks

As the use of collaborative robots (cobots) in industrial manufacturing ...
research
10/20/2022

VideoPipe 2022 Challenge: Real-World Video Understanding for Urban Pipe Inspection

Video understanding is an important problem in computer vision. Currentl...
research
10/09/2021

Scene Editing as Teleoperation: A Case Study in 6DoF Kit Assembly

Studies in robot teleoperation have been centered around action specific...

Please sign up or login with your details

Forgot password? Click here to reset