Ikshana: A Theory of Human Scene Understanding Mechanism

In recent years, deep neural networks achieved state-of-the-art performance on many computer vision tasks. The two most commonly observed drawbacks of these deep neural networks are: the requirement of a massive amount of labeled data and a vast number of parameters. In this work, we propose a theory named Ikshana, to explain the functioning of the human brain, while humans understand a natural scene/image. We have designed an architecture named IkshanaNet and evaluated on the Cityscapes pixel-level semantic segmentation benchmark, to show how to implement our theory in practice. The results showed that the Ikshana theory could perform with less training data. Also, through some experiments evaluated on the validation set, we showed that the Ikshana theory can significantly reduce the number of parameters of the network. In conclusion, a deep neural network designed by following the Ikshana theory will learn better vector representations of the image, useful for any computer vision task.

READ FULL TEXT
research
01/25/2023

Connecting metrics for shape-texture knowledge in computer vision

Modern artificial neural networks, including convolutional neural networ...
research
09/21/2021

PDFNet: Pointwise Dense Flow Network for Urban-Scene Segmentation

In recent years, using a deep convolutional neural network (CNN) as a fe...
research
02/12/2015

Over-Sampling in a Deep Neural Network

Deep neural networks (DNN) are the state of the art on many engineering ...
research
11/29/2018

Bootstrapping Deep Neural Networks from Image Processing and Computer Vision Pipelines

Complex image processing and computer vision systems often consist of a ...
research
06/14/2017

LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation

Pixel-wise semantic segmentation for visual scene understanding not only...
research
06/08/2023

Degraded Polygons Raise Fundamental Questions of Neural Network Perception

It is well-known that modern computer vision systems often exhibit behav...
research
03/30/2022

How Deep is Your Art: An Experimental Study on the Limits of Artistic Understanding in a Single-Task, Single-Modality Neural Network

Mathematical modeling and aesthetic rule extraction of works of art are ...

Please sign up or login with your details

Forgot password? Click here to reset