Natural Language Person Search Using Deep Reinforcement Learning

09/02/2018
by   Ankit Shah, et al.
0

Recent success in deep reinforcement learning is having an agent learn how to play Go and beat the world champion without any prior knowledge of the game. In that task, the agent has to make a decision on what action to take based on the positions of the pieces. Person Search is recently explored using natural language based text description of images for video surveillance applications (S.Li et.al). We see (Fu.et al) provides an end to end approach for object-based retrieval using deep reinforcement learning without constraints placed on which objects are being detected. However, we believe for real-world applications such as person search defining specific constraints which identify a person as opposed to starting with a general object detection will have benefits in terms of performance and computational resources required. In our task, Deep reinforcement learning would localize the person in an image by reshaping the sizes of the bounding boxes. Deep Reinforcement learning with appropriate constraints would look only for the relevant person in the image as opposed to an unconstrained approach where each individual objects in the image are ranked. For person search, the agent is trying to form a tight bounding box around the person in the image who matches the description. The bounding box is initialized to the full image and at each time step, the agent makes a decision on how to change the current bounding box so that it has a tighter bound around the person based on the description of the person and the pixel values of the current bounding box. After the agent takes an action, it will be given a reward based on the Intersection over Union (IoU) of the current bounding box and the ground truth box. Once the agent believes that the bounding box is covering the person, it will indicate that the person is found.

READ FULL TEXT
research
03/22/2017

An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning

We propose an end-to-end approach to the natural language object retriev...
research
10/15/2018

Multi-Stage Reinforcement Learning For Object Detection

We present a reinforcement learning approach for detecting objects withi...
research
01/24/2018

Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

Psychlab is a simulated psychology laboratory inside the first-person 3D...
research
10/31/2019

Visual Appearance Based Person Retrieval in Unconstrained Environment Videos

Visual appearance-based person retrieval is a challenging problem in sur...
research
10/24/2018

Resolving Referring Expressions in Images With Labeled Elements

Images may have elements containing text and a bounding box associated w...
research
03/27/2023

Intersection over Union with smoothing for bounding box regression

We focus on the construction of a loss function for the bounding box reg...
research
05/24/2017

Attention-based Natural Language Person Retrieval

Following the recent progress in image classification and captioning usi...

Please sign up or login with your details

Forgot password? Click here to reset