Approximate Query Matching for Image Retrieval

03/14/2018
by   Abhijit Suprem, et al.
0

Traditional image recognition involves identifying the key object in a portrait-type image with a single object focus (ILSVRC, AlexNet, and VGG). More recent approaches consider dense image recognition - segmenting an image with appropriate bounding boxes and performing image recognition within these bounding boxes (Semantic segmentation). The Visual Genome dataset [5] is an attempt to bridge these various approaches to a cohesive dataset for each subtask - bounding box generation, image recognition, captioning, and a new operation: scene graph generation. Our focus is on using such scene graphs to perform graph search on image databases to holistically retrieve images based on a search criteria. We develop a method to store scene graphs and metadata in graph databases (using Neo4J) and to perform fast approximate retrieval of images based on a graph search query. We process more complex queries than single object search, e.g. "girl eating cake" retrieves images that contain the specified relation as well as variations.

READ FULL TEXT

page 1

page 2

page 6

page 8

page 9

page 11

research
07/22/2022

Panoptic Scene Graph Generation

Existing research addresses scene graph generation (SGG) – a critical te...
research
07/25/2022

Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning

Text detection and recognition are essential components of a modern OCR ...
research
01/08/2020

Weakly Supervised Visual Semantic Parsing

Scene Graph Generation (SGG) aims to extract entities, predicates and th...
research
11/30/2022

SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation

Scene understanding is an essential and challenging task in computer vis...
research
08/27/2018

Single Shot Scene Text Retrieval

Textual information found in scene images provides high level semantic i...
research
11/28/2016

Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval

Spatial relationships between objects provide important information for ...
research
08/01/2018

Real-time image-based instrument classification for laparoscopic surgery

During laparoscopic surgery, context-aware assistance systems aim to all...

Please sign up or login with your details

Forgot password? Click here to reset