Superpixel-based Domain-Knowledge Infusion in Computer Vision

05/20/2021
by   Gunjan Chhablani, et al.
0

Superpixels are higher-order perceptual groups of pixels in an image, often carrying much more information than raw pixels. There is an inherent relational structure to the relationship among different superpixels of an image. This relational information can convey some form of domain information about the image, e.g. relationship between superpixels representing two eyes in a cat image. Our interest in this paper is to construct computer vision models, specifically those based on Deep Neural Networks (DNNs) to incorporate these superpixels information. We propose a methodology to construct a hybrid model that leverages (a) Convolutional Neural Network (CNN) to deal with spatial information in an image, and (b) Graph Neural Network (GNN) to deal with relational superpixel information in the image. The proposed deep model is learned using a generic hybrid loss function that we call a `hybrid' loss. We evaluate the predictive performance of our proposed hybrid vision model on four popular image classification datasets: MNIST, FMNIST, CIFAR-10 and CIFAR-100. Moreover, we evaluate our method on three real-world classification tasks: COVID-19 X-Ray Detection, LFW Face Recognition, and SOCOFing Fingerprint Identification. The results demonstrate that the relational superpixel information provided via a GNN could improve the performance of standard CNN-based vision systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

Visual Tree Convolutional Neural Network in Image Classification

In image classification, Convolutional Neural Network(CNN) models have a...
research
03/01/2022

Graph Neural Network (GNN) in Image and Video Understanding Using Deep Learning for Computer Vision Applications

Graph neural networks (GNNs) is an information - processing system that ...
research
09/28/2022

CompNet: A Designated Model to Handle Combinations of Images and Designed features

Convolutional neural networks (CNNs) are one of the most popular models ...
research
08/01/2015

Towards Distortion-Predictable Embedding of Neural Networks

Current research in Computer Vision has shown that Convolutional Neural ...
research
04/15/2020

Learning Furniture Compatibility with Graph Neural Networks

We propose a graph neural network (GNN) approach to the problem of predi...
research
09/13/2021

Shape-Biased Domain Generalization via Shock Graph Embeddings

There is an emerging sense that the vulnerability of Image Convolutional...
research
05/01/2020

An Adaptive Enhancement Based Hybrid CNN Model for Digital Dental X-ray Positions Classification

Analysis of dental radiographs is an important part of the diagnostic pr...

Please sign up or login with your details

Forgot password? Click here to reset