Junyu Han

research

∙ 07/24/2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

Document dewarping from a distorted camera-captured image is of great va...

0 Beiya Dai, et al. ∙

research

∙ 07/16/2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

We analyze the DETR-based framework on semi-supervised object detection ...

1 Jiacheng Zhang, et al. ∙

research

∙ 06/29/2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation

One of the mainstream schemes for 2D human pose estimation (HPE) is lear...

0 Zhongwei Qiu, et al. ∙

research

∙ 03/27/2023

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection

With basic Semi-Supervised Object Detection (SSOD) techniques, one-stage...

0 Chang Liu, et al. ∙

research

∙ 03/16/2023

PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers

Existing methods of multi-person video 3D human Pose and Shape Estimatio...

0 Zhongwei Qiu, et al. ∙

research

∙ 03/01/2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

In this paper, we present StrucTexTv2, an effective document image pre-t...

0 Yuechen Yu, et al. ∙

research

∙ 01/26/2023

Graph Contrastive Learning for Skeleton-based Action Recognition

In the field of skeleton-based action recognition, current top-performin...

0 Xiaohu Huang, et al. ∙

research

∙ 12/07/2022

Cyclically Disentangled Feature Translation for Face Anti-spoofing

Current domain adaptation methods for face anti-spoofing leverage labele...

0 Haixiao Yue, et al. ∙

research

∙ 11/17/2022

CAE v2: Context Autoencoder with CLIP Target

Masked image modeling (MIM) learns visual representation by masking and ...

0 Xinyu Zhang, et al. ∙

research

∙ 11/15/2022

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

DETR is a novel end-to-end transformer architecture object detector, whi...

0 Yu Wang, et al. ∙

research

∙ 11/07/2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

We present a strong object detector with encoder-decoder pretraining and...

0 Qiang Chen, et al. ∙

research

∙ 10/13/2022

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

Recently, transformer-based networks have shown impressive results in se...

0 Jian Wang, et al. ∙

research

∙ 07/21/2022

UFO: Unified Feature Optimization

This paper proposes a novel Unified Feature Optimization (UFO) paradigm ...

0 Teng Xi, et al. ∙

research

∙ 06/13/2022

Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning

Freezing the pre-trained backbone has become a standard paradigm to avoi...

10 Yanpeng Sun, et al. ∙

research

∙ 06/01/2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

In this paper, we present a model pretraining technique, named MaskOCR, ...

0 Pengyuan Lyu, et al. ∙

research

∙ 03/31/2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

Visual appearance is considered to be the most important cue to understa...

0 Mengjun Cheng, et al. ∙

research

∙ 01/11/2022

MobileFaceSwap: A Lightweight Framework for Video Face Swapping

Advanced face swapping methods have achieved appealing results. However,...

10 Zhiliang Xu, et al. ∙

research

∙ 08/06/2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

Structured text understanding on Visually Rich Documents (VRDs) is a cru...

0 Yulin Li, et al. ∙

research

∙ 05/24/2021

Dynamic Class Queue for Large Scale Face Recognition In the Wild

Learning discriminative representation using large-scale face datasets i...

0 Bi Li, et al. ∙

research

∙ 04/12/2021

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

The reading of arbitrarily-shaped text has received increasing research ...

0 Pengfei Wang, et al. ∙

research

∙ 02/23/2021

FaceController: Controllable Attribute Editing for Face in the Wild

Face attribute editing aims to generate faces with one or multiple desir...

7 Zhiliang Xu, et al. ∙

research

∙ 09/25/2020

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results

This paper introduces the real image Super-Resolution (SR) challenge tha...

7 Pengxu Wei, et al. ∙

research

∙ 09/02/2020

Real Image Super Resolution Via Heterogeneous Model using GP-NAS

With advancement in deep neural network (DNN), recent state-of-the-art (...

10 Zhihong Pan, et al. ∙

research

∙ 08/26/2020

Learning Global Structure Consistency for Robust Object Tracking

Fast appearance variations and the distractions of similar objects are t...

0 Bi Li, et al. ∙

research

∙ 05/08/2020

NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results

This paper reviews the NTIRE 2020 challenge on real image denoising with...

25 Abdelrahman Abdelhamed, et al. ∙

research

∙ 05/08/2020

Learning Generalized Spoof Cues for Face Anti-spoofing

Many existing face anti-spoofing (FAS) methods focus on modeling the dec...

0 Haocheng Feng, et al. ∙

research

∙ 03/27/2020

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Scene text image contains two levels of contents: visual texture and sem...

0 Deli Yu, et al. ∙

research

∙ 12/19/2019

HAMBox: Delving into Online High-quality Anchors Mining for Detecting Outer Faces

Current face detectors utilize anchors to frame a multi-task learning pr...

2 Yang Liu, et al. ∙

research

∙ 09/20/2019

ACFNet: Attentional Class Feature Network for Semantic Segmentation

Recent works have made great progress in semantic segmentation by exploi...

42 Fan Zhang, et al. ∙

research

∙ 09/20/2019

EATEN: Entity-aware Attention for Single Shot Visual Text Extraction

Extracting entity from images is a crucial part of many OCR applications...

13 He Guo, et al. ∙

research

∙ 09/17/2019

Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning

Most existing text reading benchmarks make it difficult to evaluate the ...

8 Yipeng Sun, et al. ∙

research

∙ 09/17/2019

ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling – RRC-LSVT

Robust text reading from street view images provides valuable informatio...

6 Yipeng Sun, et al. ∙

research

∙ 09/16/2019

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)

This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-S...

4 Chee Kheng Chng, et al. ∙

research

∙ 08/20/2019

An End-to-end Video Text Detector with Online Tracking

Video text detection is considered as one of the most difficult tasks in...

2 Hongyuan Yu, et al. ∙

research

∙ 08/15/2019

A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning

Detecting scene text of arbitrary shapes has been a challenging task ove...

4 Pengfei Wang, et al. ∙

research

∙ 08/08/2019

Editing Text in the Wild

In this paper, we are interested in editing text in natural images, whic...

4 Liang Wu, et al. ∙

research

∙ 04/13/2019

Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes

Previous scene text detection methods have progressed substantially over...

0 Chengquan Zhang, et al. ∙

research

∙ 03/31/2019

PyramidBox++: High Performance Detector for Finding Tiny Face

With the rapid development of deep convolutional neural network, face de...

0 Zhihang Li, et al. ∙

research

∙ 01/02/2019

Detecting Text in the Wild with Deep Character Embedding Network

Most text detection methods hypothesize texts are horizontal or multi-or...

0 Jiaming Liu, et al. ∙

research

∙ 12/24/2018

TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network

Reading text from images remains challenging due to multi-orientation, p...

0 Yipeng Sun, et al. ∙

research

∙ 08/22/2017

WordSup: Exploiting Word Annotations for Character based Text Detection

Imagery texts are usually organized as a hierarchy of several visual ele...

0 Han Hu, et al. ∙

Junyu Han

Featured Co-authors

Sign in with Google

Consider DeepAI Pro