Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer

09/14/2021
by   Fushun Zhu, et al.
9

We propose a semi-supervised network for wide-angle portraits correction. Wide-angle images often suffer from skew and distortion affected by perspective distortion, especially noticeable at the face regions. Previous deep learning based approaches require the ground-truth correction flow maps for the training guidance. However, such labels are expensive, which can only be obtained manually. In this work, we propose a semi-supervised scheme, which can consume unlabeled data in addition to the labeled data for improvements. Specifically, our semi-supervised scheme takes the advantages of the consistency mechanism, with several novel components such as direction and range consistency (DRC) and regression consistency (RC). Furthermore, our network, named as Multi-Scale Swin-Unet (MS-Unet), is built upon the multi-scale swin transformer block (MSTB), which can learn both local-scale and long-range semantic information effectively. In addition, we introduce a high-quality unlabeled dataset with rich scenarios for the training. Extensive experiments demonstrate that the proposed method is superior over the state-of-the-art methods and other representative baselines.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 10

page 11

page 12

page 13

research
11/22/2022

Progressive Learning with Cross-Window Consistency for Semi-Supervised Semantic Segmentation

Semi-supervised semantic segmentation focuses on the exploration of a sm...
research
04/26/2021

Practical Wide-Angle Portraits Correction with Deep Structured Models

Wide-angle portraits often enjoy expanded views. However, they contain p...
research
07/24/2022

Semi-supervised Deep Multi-view Stereo

Significant progress has been witnessed in learning-based Multi-view Ste...
research
11/20/2019

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation

Deep hashing methods have been proved to be effective and efficient for ...
research
01/04/2023

Semi-MAE: Masked Autoencoders for Semi-supervised Vision Transformers

Vision Transformer (ViT) suffers from data scarcity in semi-supervised l...
research
11/04/2021

Lexically Aware Semi-Supervised Learning for OCR Post-Correction

Much of the existing linguistic data in many languages of the world is l...

Please sign up or login with your details

Forgot password? Click here to reset