Document dewarping from a distorted camera-captured image is of great va...
We analyze the DETR-based framework on semi-supervised object detection
...
One of the mainstream schemes for 2D human pose estimation (HPE) is lear...
With basic Semi-Supervised Object Detection (SSOD) techniques, one-stage...
Existing methods of multi-person video 3D human Pose and Shape Estimatio...
In this paper, we present StrucTexTv2, an effective document image
pre-t...
In the field of skeleton-based action recognition, current top-performin...
Current domain adaptation methods for face anti-spoofing leverage labele...
Masked image modeling (MIM) learns visual representation by masking and
...
DETR is a novel end-to-end transformer architecture object detector, whi...
We present a strong object detector with encoder-decoder pretraining and...
Recently, transformer-based networks have shown impressive results in
se...
This paper proposes a novel Unified Feature Optimization (UFO) paradigm ...
Freezing the pre-trained backbone has become a standard paradigm to avoi...
In this paper, we present a model pretraining technique, named MaskOCR, ...
Visual appearance is considered to be the most important cue to understa...
Advanced face swapping methods have achieved appealing results. However,...
Structured text understanding on Visually Rich Documents (VRDs) is a cru...
Learning discriminative representation using large-scale face datasets i...
The reading of arbitrarily-shaped text has received increasing research
...
Face attribute editing aims to generate faces with one or multiple desir...
This paper introduces the real image Super-Resolution (SR) challenge tha...
With advancement in deep neural network (DNN), recent state-of-the-art (...
Fast appearance variations and the distractions of similar objects are t...
This paper reviews the NTIRE 2020 challenge on real image denoising with...
Many existing face anti-spoofing (FAS) methods focus on modeling the dec...
Scene text image contains two levels of contents: visual texture and sem...
Current face detectors utilize anchors to frame a multi-task learning pr...
Recent works have made great progress in semantic segmentation by exploi...
Extracting entity from images is a crucial part of many OCR applications...
Most existing text reading benchmarks make it difficult to evaluate the
...
Robust text reading from street view images provides valuable informatio...
This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-S...
Video text detection is considered as one of the most difficult tasks in...
Detecting scene text of arbitrary shapes has been a challenging task ove...
In this paper, we are interested in editing text in natural images, whic...
Previous scene text detection methods have progressed substantially over...
With the rapid development of deep convolutional neural network, face
de...
Most text detection methods hypothesize texts are horizontal or
multi-or...
Reading text from images remains challenging due to multi-orientation,
p...
Imagery texts are usually organized as a hierarchy of several visual
ele...