Recent Trends in 2D Object Detection and Applications in Video Event Recognition

02/07/2022
by   Prithwish Jana, et al.
13

Object detection serves as a significant step in improving performance of complex downstream computer vision tasks. It has been extensively studied for many years now and current state-of-the-art 2D object detection techniques proffer superlative results even in complex images. In this chapter, we discuss the geometry-based pioneering works in object detection, followed by the recent breakthroughs that employ deep learning. Some of these use a monolithic architecture that takes a RGB image as input and passes it to a feed-forward ConvNet or vision Transformer. These methods, thereby predict class-probability and bounding-box coordinates, all in a single unified pipeline. Two-stage architectures on the other hand, first generate region proposals and then feed it to a CNN to extract features and predict object category and bounding-box. We also elaborate upon the applications of object detection in video event recognition, to achieve better fine-grained video classification performance. Further, we highlight recent datasets for 2D object detection both in images and videos, and present a comparative performance summary of various state-of-the-art object detection techniques.

READ FULL TEXT

page 2

page 3

page 7

page 10

research
05/30/2019

Hierarchical Structure and Joint Training for Large Scale Semi-supervised Object Detection

Generic object detection is one of the most fundamental and important pr...
research
03/14/2017

Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects

We present a novel object detection pipeline for localization and recogn...
research
03/17/2015

3D Object Class Detection in the Wild

Object class detection has been a synonym for 2D bounding box localizati...
research
06/10/2020

Object Detection in the DCT Domain: is Luminance the Solution?

Object detection in images has reached unprecedented performances. The s...
research
01/05/2022

Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection

Modern leading object detectors are either two-stage or one-stage networ...
research
04/22/2019

Detecting retail products in situ using CNN without human effort labeling

CNN is a powerful tool for many computer vision tasks, achieving much be...
research
11/07/2018

DOD-CNN: Doubly-injecting Object Information for Event Recognition

Recognizing an event in an image can be enhanced by detecting relevant o...

Please sign up or login with your details

Forgot password? Click here to reset