DMODE: Differential Monocular Object Distance Estimation Module without Class Specific Information

10/23/2022
by   Pedram Agand, et al.
0

Using a single camera to estimate the distances of objects reduces costs compared to stereo-vision and LiDAR. Although monocular distance estimation has been studied in the literature, previous methods mostly rely on knowing an object's class in some way. This can result in deteriorated performance for dataset with multi-class objects and objects with an undefined class. In this paper, we aim to overcome the potential downsides of class-specific approaches, and provide an alternative technique called DMODE that does not require any information relating to its class. Using differential approaches, we combine the changes in an object's size over time together with the camera's motion to estimate the object's distance. Since DMODE is class agnostic method, it is easily adaptable to new environments. Therefore, it is able to maintain performance across different object detectors, and be easily adapted to new object classes. We tested our model across different scenarios of training and testing on the KITTI MOTS dataset's ground-truth bounding box annotations, and bounding box outputs of TrackRCNN and EagerMOT. The instantaneous change of bounding box sizes and camera position are then used to obtain an object's position in 3D without measuring its detection source or class properties. Our results show that we are able to outperform traditional alternatives methods e.g. IPM <cit.>, SVR <cit.>, and <cit.> in test environments with multi-class object distance detections.

READ FULL TEXT
research
08/23/2021

Marine vessel tracking using a monocular camera

In this paper, a new technique for camera calibration using only GPS dat...
research
01/10/2020

RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving

In this work, we propose an efficient and accurate monocular 3D detectio...
research
04/16/2019

Objects as Points

Detection identifies objects as axis-aligned boxes in an image. Most suc...
research
10/13/2021

DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries

We introduce a framework for multi-camera 3D object detection. In contra...
research
12/16/2020

MSL-RAPTOR: A 6DoF Relative Pose Tracker for Onboard Robotic Perception

Determining the relative position and orientation of objects in an envir...
research
09/19/2022

SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity

The development and application of modern technology is an essential bas...
research
09/21/2023

2DDATA: 2D Detection Annotations Transmittable Aggregation for Semantic Segmentation on Point Cloud

Recently, multi-modality models have been introduced because of the comp...

Please sign up or login with your details

Forgot password? Click here to reset