Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery

05/21/2019
by   Chaitanya Devaguptapu, et al.
0

Can we improve detection in the thermal domain by borrowing features from rich domains like visual RGB? In this paper, we propose a pseudo-multimodal object detector trained on natural image domain data to help improve the performance of object detection in thermal images. We assume access to a large-scale dataset in the visual RGB domain and relatively smaller dataset (in terms of instances) in the thermal domain, as is common today. We propose the use of well-known image-to-image translation frameworks to generate pseudo-RGB equivalents of a given thermal image and then use a multi-modal architecture for object detection in the thermal image. We show that our framework outperforms existing benchmarks without the explicit need for paired training examples from the two domains. We also show that our framework has the ability to learn with less data from thermal domain when using our approach.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 8

research
08/20/2023

ThermRad: A Multi-modal Dataset for Robust 3D Object Detection under Challenging Conditions

Robust 3D object detection in extreme weather and illumination condition...
research
12/29/2020

Visual-Thermal Camera Dataset Release and Multi-Modal Alignment without Calibration Information

This report accompanies a dataset release on visual and thermal camera d...
research
12/05/2020

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams

We present SpeakingFaces as a publicly-available large-scale multimodal ...
research
01/18/2021

A Novel Registration Colorization Technique for Thermal to Cross Domain Colorized Images

Thermal images can be obtained as either grayscale images or pseudo colo...
research
03/17/2023

Scribble-Supervised RGB-T Salient Object Detection

Salient object detection segments attractive objects in scenes. RGB and ...
research
06/08/2022

Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection

The RGB complementary metal-oxidesemiconductor (CMOS) sensor works withi...
research
12/23/2022

Assessing thermal imagery integration into object detection methods on ground-based and air-based collection platforms

Object detection models commonly deployed on uncrewed aerial systems (UA...

Please sign up or login with your details

Forgot password? Click here to reset