LadleNet: Translating Thermal Infrared Images to Visible Light Images Using A Scalable Two-stage U-Net

08/12/2023
by   Tonghui Zou, et al.
0

The translation of thermal infrared (TIR) images to visible light (VI) images presents a challenging task with potential applications spanning various domains such as TIR-VI image registration and fusion. Leveraging supplementary information derived from TIR image conversions can significantly enhance model performance and generalization across these applications. However, prevailing issues within this field include suboptimal image fidelity and limited model scalability. In this paper, we introduce an algorithm, LadleNet, based on the U-Net architecture. LadleNet employs a two-stage U-Net concatenation structure, augmented with skip connections and refined feature aggregation techniques, resulting in a substantial enhancement in model performance. Comprising 'Handle' and 'Bowl' modules, LadleNet's Handle module facilitates the construction of an abstract semantic space, while the Bowl module decodes this semantic space to yield mapped VI images. The Handle module exhibits extensibility by allowing the substitution of its network architecture with semantic segmentation networks, thereby establishing more abstract semantic spaces to bolster model performance. Consequently, we propose LadleNet+, which replaces LadleNet's Handle module with the pre-trained DeepLabv3+ network, thereby endowing the model with enhanced semantic space construction capabilities. The proposed method is evaluated and tested on the KAIST dataset, accompanied by quantitative and qualitative analyses. Compared to existing methodologies, our approach achieves state-of-the-art performance in terms of image clarity and perceptual quality. The source code will be made available at https://github.com/Ach-1914/LadleNet/tree/main/.

READ FULL TEXT

page 8

page 9

page 15

research
04/12/2022

Glass Segmentation with RGB-Thermal Image Pairs

This paper proposes a new glass segmentation method utilizing paired RGB...
research
04/23/2018

DenseFuse: A Fusion Approach to Infrared and Visible Images

In this paper, we present a novel deep learning architecture for infrare...
research
10/14/2020

Photovoltaic module segmentation and thermal analysis tool from thermal images

The growing interest in the use of clean energy has led to the construct...
research
05/02/2023

RT-K-Net: Revisiting K-Net for Real-Time Panoptic Segmentation

Panoptic segmentation is one of the most challenging scene parsing tasks...
research
07/28/2023

OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation of Road Scenes

Light field cameras can provide rich angular and spatial information to ...
research
01/12/2023

DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attention

Single image dehazing is a challenging ill-posed problem which estimates...
research
08/09/2020

Recurrent Feature Reasoning for Image Inpainting

Existing inpainting methods have achieved promising performance for reco...

Please sign up or login with your details

Forgot password? Click here to reset