Visual-Thermal Camera Dataset Release and Multi-Modal Alignment without Calibration Information

12/29/2020
by   Frank Mascarich, et al.
31

This report accompanies a dataset release on visual and thermal camera data and details a procedure followed to align such multi-modal camera frames in order to provide pixel-level correspondence between the two without using intrinsic or extrinsic calibration information. To achieve this goal we benefit from progress in the domain of multi-modal image alignment and specifically employ the Mattes Mutual Information Metric to guide the registration process. In the released dataset we release both the raw visual and thermal camera data, as well as the aligned frames, alongside calibration parameters with the goal to better facilitate the investigation on common local/global features across such multi-modal image streams.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
05/21/2019

Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery

Can we improve detection in the thermal domain by borrowing features fro...
research
09/28/2021

Targetless Extrinsic Calibration of Stereo Cameras, Thermal Cameras, and Laser Sensors in the Wild

The fusion of multi-modal sensors has become increasingly popular in aut...
research
09/27/2021

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

In order to better simulate the real human conversation process, models ...
research
09/06/2023

MAD: Modality Agnostic Distance Measure for Image Registration

Multi-modal image registration is a crucial pre-processing step in many ...
research
08/24/2022

Modeling Paragraph-Level Vision-Language Semantic Alignment for Multi-Modal Summarization

Most current multi-modal summarization methods follow a cascaded manner,...
research
09/08/2023

WiSARD: A Labeled Visual and Thermal Image Dataset for Wilderness Search and Rescue

Sensor-equipped unoccupied aerial vehicles (UAVs) have the potential to ...
research
04/17/2023

Pretrained Language Models as Visual Planners for Human Assistance

To make progress towards multi-modal AI assistants which can guide users...

Please sign up or login with your details

Forgot password? Click here to reset