MovieGraphs: Towards Understanding Human-Centric Situations from Videos

12/19/2017
by   Paul Vicol, et al.
0

There is growing interest in artificial intelligence to build socially intelligent robots. This requires machines to have the ability to "read" people's emotions, motivations, and other factors that affect behavior. Towards this goal, we introduce a novel dataset called MovieGraphs which provides detailed, graph-based annotations of social situations depicted in movie clips. Each graph consists of several types of nodes, to capture who is present in the clip, their emotional and physical attributes, their relationships (i.e., parent/child), and the interactions between them. Most interactions are associated with topics that provide additional details, and reasons that give motivations for actions. In addition, most interactions and many attributes are grounded in the video with time stamps. We provide a thorough analysis of our dataset, showing interesting common-sense correlations between different social aspects of scenes, as well as across scenes over time. We propose a method for querying videos and text with graphs, and show that: 1) our graphs contain rich and sufficient information to summarize and localize each scene; and 2) subgraphs allow us to describe situations at an abstract level and retrieve multiple semantically relevant situations. We also propose methods for interaction understanding via ordering, and reason understanding. MovieGraphs is the first benchmark to focus on inferred properties of human-centric situations, and opens up an exciting avenue towards socially-intelligent AI agents.

READ FULL TEXT

page 1

page 8

page 14

page 15

page 16

page 18

page 19

page 21

research
03/10/2020

PANDA: A Gigapixel-level Human-centric Video Dataset

We present PANDA, the first gigaPixel-level humAN-centric viDeo dAtaset,...
research
03/23/2019

An End-to-End Network for Generating Social Relationship Graphs

Socially-intelligent agents are of growing interest in artificial intell...
research
06/13/2017

The "something something" video database for learning and evaluating visual common sense

Neural networks trained on datasets such as ImageNet have led to major a...
research
05/11/2023

Towards a Computational Analysis of Suspense: Detecting Dangerous Situations

Suspense is an important tool in storytelling to keep readers engaged an...
research
08/08/2019

"Conservatives Overfit, Liberals Underfit": The Social-Psychological Control of Affect and Uncertainty

The presence of artificial agents in human social networks is growing. F...
research
06/22/2013

Affect Control Processes: Intelligent Affective Interaction using a Partially Observable Markov Decision Process

This paper describes a novel method for building affectively intelligent...

Please sign up or login with your details

Forgot password? Click here to reset