LoLep: Single-View View Synthesis with Locally-Learned Planes and Self-Attention Occlusion Inference

07/23/2023
by   Cong Wang, et al.
0

We propose a novel method, LoLep, which regresses Locally-Learned planes from a single RGB image to represent scenes accurately, thus generating better novel views. Without the depth information, regressing appropriate plane locations is a challenging problem. To solve this issue, we pre-partition the disparity space into bins and design a disparity sampler to regress local offsets for multiple planes in each bin. However, only using such a sampler makes the network not convergent; we further propose two optimizing strategies that combine with different disparity distributions of datasets and propose an occlusion-aware reprojection loss as a simple yet effective geometric supervision technique. We also introduce a self-attention mechanism to improve occlusion inference and present a Block-Sampling Self-Attention (BS-SA) module to address the problem of applying self-attention to large feature maps. We demonstrate the effectiveness of our approach and generate state-of-the-art results on different datasets. Compared to MINE, our approach has an LPIPS reduction of 4.8 performance on real-world images and demonstrate the benefits.

READ FULL TEXT

page 2

page 7

page 8

research
09/13/2022

Switchable Self-attention Module

Attention mechanism has gained great success in vision recognition. Many...
research
05/25/2023

Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective

Understanding action recognition in egocentric videos has emerged as a v...
research
01/20/2023

Unsupervised Light Field Depth Estimation via Multi-view Feature Matching with Occlusion Prediction

Depth estimation from light field (LF) images is a fundamental step for ...
research
05/28/2023

OccCasNet: Occlusion-aware Cascade Cost Volume for Light Field Depth Estimation

Light field (LF) depth estimation is a crucial task with numerous practi...
research
07/02/2021

Cross-view Geo-localization with Evolving Transformer

In this work, we address the problem of cross-view geo-localization, whi...
research
04/14/2021

Weakly But Deeply Supervised Occlusion-Reasoned Parametric Layouts

We propose an end-to-end network that takes a single perspective RGB ima...
research
07/01/2022

ChrSNet: Chromosome Straightening using Self-attention Guided Networks

Karyotyping is an important procedure to assess the possible existence o...

Please sign up or login with your details

Forgot password? Click here to reset