Using Motion and Internal Supervision in Object Recognition

12/13/2018
by   Daniel Harari, et al.
30

In this thesis we address two related aspects of visual object recognition: the use of motion information, and the use of internal supervision, to help unsupervised learning. These two aspects are inter-related in the current study, since image motion is used for internal supervision, via the detection of spatiotemporal events of active-motion and the use of tracking. Most current work in object recognition deals with static images during both learning and recognition. In contrast, we are interested in a dynamic scene where visual processes, such as detecting motion events and tracking, contribute spatiotemporal information, which is useful for object attention, motion segmentation, 3-D understanding and object interactions. We explore the use of these sources of information in both learning and recognition processes. In the first part of the work, we demonstrate how motion can be used for adaptive detection of object-parts in dynamic environments, while automatically learning new object appearances and poses. In the second and main part of the study we develop methods for using specific types of visual motion to solve two difficult problems in unsupervised visual learning: learning to recognize hands by their appearance and by context, and learning to extract direction of gaze. We use our conclusions in this part to propose a model for several aspects of learning by human infants from their visual environment.

READ FULL TEXT

page 12

page 29

page 32

page 35

page 37

page 38

page 39

page 40

research
09/01/2021

From simple innate biases to complex visual concepts

Early in development, infants learn to solve visual problems that are hi...
research
12/06/2019

Continual egocentric object recognition

We are interested in the problem of continual object recognition in a se...
research
05/07/2015

Learning to See by Moving

The dominant paradigm for feature learning in computer vision relies on ...
research
09/01/2021

A model for discovering 'containment' relations

Rapid developments in the fields of learning and object recognition have...
research
03/31/2023

Learning Internal Representations of 3D Transformations from 2D Projected Inputs

When interacting in a three dimensional world, humans must estimate 3D s...
research
12/23/2015

Mid-level Representation for Visual Recognition

Visual Recognition is one of the fundamental challenges in AI, where the...
research
05/20/2018

Object Localization and Motion Transfer learning with Capsules

Inspired by CapsNet's routing-by-agreement mechanism, with its ability t...

Please sign up or login with your details

Forgot password? Click here to reset