SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network

01/17/2023
by   Dongseok Shim, et al.
0

Monocular depth estimation plays a critical role in various computer vision and robotics applications such as localization, mapping, and 3D object detection. Recently, learning-based algorithms achieve huge success in depth estimation by training models with a large amount of data in a supervised manner. However, it is challenging to acquire dense ground truth depth labels for supervised training, and the unsupervised depth estimation using monocular sequences emerges as a promising alternative. Unfortunately, most studies on unsupervised depth estimation explore loss functions or occlusion masks, and there is little change in model architecture in that ConvNet-based encoder-decoder structure becomes a de-facto standard for depth estimation. In this paper, we employ a convolution-free Swin Transformer as an image feature extractor so that the network can capture both local geometric features and global semantic features for depth estimation. Also, we propose a Densely Cascaded Multi-scale Network (DCMNet) that connects every feature map directly with another from different scales via a top-down cascade pathway. This densely cascaded connectivity reinforces the interconnection between decoding layers and produces high-quality multi-scale depth outputs. The experiments on two different datasets, KITTI and Make3D, demonstrate that our proposed method outperforms existing state-of-the-art unsupervised algorithms.

READ FULL TEXT

page 1

page 3

page 6

research
01/19/2020

FIS-Nets: Full-image Supervised Networks for Monocular Depth Estimation

This paper addresses the importance of full-image supervision for monocu...
research
01/19/2022

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

Depth estimation from a single image is an important task that can be ap...
research
04/11/2022

HiMODE: A Hybrid Monocular Omnidirectional Depth Estimation Model

Monocular omnidirectional depth estimation is receiving considerable res...
research
07/28/2021

Unsupervised Monocular Depth Estimation in Highly Complex Environments

Previous unsupervised monocular depth estimation methods mainly focus on...
research
03/30/2020

DeFeat-Net: General Monocular Depth via Simultaneous Unsupervised Representation Learning

In the current monocular depth research, the dominant approach is to emp...
research
07/24/2019

From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation

Estimating accurate depth from a single image is challenging, because it...
research
07/25/2017

Relative Depth Order Estimation Using Multi-scale Densely Connected Convolutional Networks

We study the problem of estimating the relative depth order of point pai...

Please sign up or login with your details

Forgot password? Click here to reset