
-
FaceController: Controllable Attribute Editing for Face in the Wild
Face attribute editing aims to generate faces with one or multiple desir...
read it
-
Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective
The re-ranking approach leverages high-confidence retrieved samples to r...
read it
-
Coherent Loss: A Generic Framework for Stable Video Segmentation
Video segmentation approaches are of great importance for numerous visio...
read it
-
LID 2020: The Learning from Imperfect Data Challenge Results
Learning from imperfect data becomes an issue in many industrial applica...
read it
-
HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network
This paper addresses representational block named Hierarchical-Split Blo...
read it
-
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching
Discriminatively localizing sounding objects in cocktail-party, i.e., mi...
read it
-
AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results
This paper introduces the real image Super-Resolution (SR) challenge tha...
read it
-
Real Image Super Resolution Via Heterogeneous Model using GP-NAS
With advancement in deep neural network (DNN), recent state-of-the-art (...
read it
-
Learning Global Structure Consistency for Robust Object Tracking
Fast appearance variations and the distractions of similar objects are t...
read it
-
PP-YOLO: An Effective and Efficient Implementation of Object Detector
Object detection is one of the most important areas in computer vision, ...
read it
-
Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
Recently, most of the state-of-the-art human pose estimation methods are...
read it
-
Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
Current multi-object tracking and segmentation (MOTS) methods follow the...
read it
-
PointTrack++ for Effective Online Multi-Object Tracking and Segmentation
Multiple-object tracking and segmentation (MOTS) is a novel computer vis...
read it
-
Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection
Object detection from 3D point clouds remains a challenging task, though...
read it
-
NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results
This paper reviews the NTIRE 2020 challenge on real image denoising with...
read it
-
Learning Generalized Spoof Cues for Face Anti-spoofing
Many existing face anti-spoofing (FAS) methods focus on modeling the dec...
read it
-
Towards Accurate Scene Text Recognition with Semantic Reasoning Networks
Scene text image contains two levels of contents: visual texture and sem...
read it
-
ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
3D object detection is an essential task in autonomous driving and robot...
read it
-
HAMBox: Delving into Online High-quality Anchors Mining for Detecting Outer Faces
Current face detectors utilize anchors to frame a multi-task learning pr...
read it
-
Dynamic Instance Normalization for Arbitrary Style Transfer
Prior normalization methods rely on affine transformations to produce ar...
read it
-
ACFNet: Attentional Class Feature Network for Semantic Segmentation
Recent works have made great progress in semantic segmentation by exploi...
read it
-
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
Extracting entity from images is a crucial part of many OCR applications...
read it
-
Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning
Most existing text reading benchmarks make it difficult to evaluate the ...
read it
-
ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling – RRC-LSVT
Robust text reading from street view images provides valuable informatio...
read it
-
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)
This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-S...
read it
-
Perspective-Guided Convolution Networks for Crowd Counting
In this paper, we propose a novel perspective-guided convolution (PGC) f...
read it
-
Image Inpainting with Learnable Bidirectional Attention Maps
Most convolutional network (CNN)-based inpainting methods adopt standard...
read it
-
An End-to-end Video Text Detector with Online Tracking
Video text detection is considered as one of the most difficult tasks in...
read it
-
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning
Detecting scene text of arbitrary shapes has been a challenging task ove...
read it
-
Editing Text in the Wild
In this paper, we are interested in editing text in natural images, whic...
read it
-
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
Temporal action proposal generation is an challenging and promising task...
read it
-
STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
Arbitrary attribute editing generally can be tackled by incorporating en...
read it
-
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
Previous scene text detection methods have progressed substantially over...
read it
-
Detecting Text in the Wild with Deep Character Embedding Network
Most text detection methods hypothesize texts are horizontal or multi-or...
read it
-
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network
Reading text from images remains challenging due to multi-orientation, p...
read it
-
Compact Generalized Non-local Network
The non-local module is designed for capturing long-range spatio-tempora...
read it
-
Fine-grained Video Categorization with Redundancy Reduction Attention
For fine-grained categorization tasks, videos could serve as a better so...
read it
-
Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
Attention-based learning for fine-grained image recognition remains a ch...
read it
-
3D Pose Estimation for Fine-Grained Object Categories
Existing object pose estimation datasets are related to generic object t...
read it
-
WordSup: Exploiting Word Annotations for Character based Text Detection
Imagery texts are usually organized as a hierarchy of several visual ele...
read it
-
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition
A key challenge in fine-grained recognition is how to find and represent...
read it