ROMA: Cross-Domain Region Similarity Matching for Unpaired Nighttime Infrared to Daytime Visible Video Translation

04/26/2022
by   Zhenjie Yu, et al.
3

Infrared cameras are often utilized to enhance the night vision since the visible light cameras exhibit inferior efficacy without sufficient illumination. However, infrared data possesses inadequate color contrast and representation ability attributed to its intrinsic heat-related imaging principle. This makes it arduous to capture and analyze information for human beings, meanwhile hindering its application. Although, the domain gaps between unpaired nighttime infrared and daytime visible videos are even huger than paired ones that captured at the same time, establishing an effective translation mapping will greatly contribute to various fields. In this case, the structural knowledge within nighttime infrared videos and semantic information contained in the translated daytime visible pairs could be utilized simultaneously. To this end, we propose a tailored framework ROMA that couples with our introduced cRoss-domain regiOn siMilarity mAtching technique for bridging the huge gaps. To be specific, ROMA could efficiently translate the unpaired nighttime infrared videos into fine-grained daytime visible ones, meanwhile maintain the spatiotemporal consistency via matching the cross-domain region similarity. Furthermore, we design a multiscale region-wise discriminator to distinguish the details from synthesized visible results and real references. Extensive experiments and evaluations for specific applications indicate ROMA outperforms the state-of-the-art methods. Moreover, we provide a new and challenging dataset encouraging further research for unpaired nighttime infrared and daytime visible video translation, named InfraredCity. In particular, it consists of 9 long video clips including City, Highway and Monitor scenarios. All clips could be split into 603,142 frames in total, which are 20 times larger than the recently released daytime infrared-to-visible dataset IRVI.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 9

page 10

research
08/02/2021

I2V-GAN: Unpaired Infrared-to-Visible Video Translation

Human vision is often adversely affected by complex environmental factor...
research
11/07/2022

Cross-Domain Local Characteristic Enhanced Deepfake Video Detection

As ultra-realistic face forgery techniques emerge, deepfake detection ha...
research
03/25/2023

Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

For the visible-infrared person re-identification (VIReID) task, one of ...
research
04/14/2018

Video2Shop: Exactly Matching Clothes in Videos to Online Shopping Images

In recent years, both online retail and video hosting service are expone...
research
05/02/2020

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

In recent years, both online retail and video hosting service have been ...
research
05/13/2016

Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning

Cross-domain visual data matching is one of the fundamental problems in ...
research
06/02/2003

On multiple connectedness of regions visible due to multiple diffuse reflections

It is known that the region V(s) of a simple polygon P, directly visible...

Please sign up or login with your details

Forgot password? Click here to reset