
-
Interactive Reinforcement Learning for Feature Selection with Decision Tree in the Loop
We study the problem of balancing effectiveness and efficiency in automa...
read it
-
ACFNet: Attentional Class Feature Network for Semantic Segmentation
Recent works have made great progress in semantic segmentation by exploi...
read it
-
Advanced Variations of Two-Dimensional Principal Component Analysis for Face Recognition
The two-dimensional principal component analysis (2DPCA) has become one ...
read it
-
CSPN++: Learning Context and Resource Aware Convolutional Spatial Propagation Networks for Depth Completion
Depth Completion deals with the problem of converting a sparse depth map...
read it
-
NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results
This paper reviews the NTIRE 2020 challenge on real image denoising with...
read it
-
Graph Analysis and Graph Pooling in the Spatial Domain
The spatial convolution layer which is widely used in the Graph Neural N...
read it
-
Robust Invisible Hyperlinks in Physical Photographs Based on 3D Rendering Attacks
In the era of multimedia and Internet, people are eager to obtain inform...
read it
-
CASIA-SURF: A Large-scale Multi-modal Benchmark for Face Anti-spoofing
Face anti-spoofing is essential to prevent face recognition systems from...
read it
-
AGAN: Towards Automated Design of Generative Adversarial Networks
Recent progress in Generative Adversarial Networks (GANs) has shown prom...
read it
-
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
Video Recognition has drawn great research interest and great progress h...
read it
-
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
Extracting entity from images is a crucial part of many OCR applications...
read it
-
PP-OCR: A Practical Ultra Lightweight OCR System
The Optical Character Recognition (OCR) systems have been widely used in...
read it
-
Cross-modality Person re-identification with Shared-Specific Feature Transfer
Cross-modality person re-identification (cm-ReID) is a challenging but k...
read it
-
Towards Accurate Knowledge Transfer via Target-awareness Representation Disentanglement
Fine-tuning deep neural networks pre-trained on large scale datasets is ...
read it
-
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
We propose a knowledge-enhanced approach, ERNIE-ViL, to learn joint repr...
read it
-
Fine-grained Video Categorization with Redundancy Reduction Attention
For fine-grained categorization tasks, videos could serve as a better so...
read it
-
Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning
Most existing text reading benchmarks make it difficult to evaluate the ...
read it
-
Cross-Task Transfer for Multimodal Aerial Scene Recognition
Aerial scene recognition is a fundamental task in remote sensing and has...
read it
-
AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results
This paper introduces the real image Super-Resolution (SR) challenge tha...
read it
-
ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving
Autonomous driving has attracted remarkable attention from both industry...
read it
-
Detailed Human Shape Estimation from a Single Image by Hierarchical Mesh Deformation
This paper presents a novel framework to recover detailed human body sha...
read it
-
Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction
In this paper, we consider the problem of open information extraction (O...
read it
-
Fast Universal Style Transfer for Artistic and Photorealistic Rendering
Universal style transfer is an image editing task that renders an input ...
read it
-
Curriculum Audiovisual Learning
Associating sound and its producer in complex audiovisual scene is a cha...
read it
-
UGAN: Untraceable GAN for Multi-Domain Face Translation
The multi-domain image-to-image translation is received increasing atten...
read it
-
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild
Pedestrian detection has achieved significant progress with the availabi...
read it
-
Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution
Despite the remarkable progresses made in deep-learning based depth map ...
read it
-
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
End-to-end Speech-to-text Translation (E2E- ST), which directly translat...
read it
-
Deep Bi-Dense Networks for Image Super-Resolution
This paper proposes Deep Bi-Dense Networks (DBDN) for single image super...
read it
-
Learning from Large-scale Noisy Web Data with Ubiquitous Reweighting for Image Classification
Many advances of deep learning techniques originate from the efforts of ...
read it
-
TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents
To safely and efficiently navigate in complex urban traffic, autonomous ...
read it
-
Consensus Feature Network for Scene Parsing
Scene parsing is challenging as it aims to assign one of the semantic ca...
read it
-
Editing Text in the Wild
In this paper, we are interested in editing text in natural images, whic...
read it
-
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning
Detecting scene text of arbitrary shapes has been a challenging task ove...
read it
-
3D Pose Estimation for Fine-Grained Object Categories
Existing object pose estimation datasets are related to generic object t...
read it
-
Cancer Metastasis Detection With Neural Conditional Random Field
Breast cancer diagnosis often requires accurate detection of metastasis ...
read it
-
A Network Structure to Explicitly Reduce Confusion Errors in Semantic Segmentation
Confusing classes that are ubiquitous in real world often degrade perfor...
read it
-
Depth Estimation via Affinity Learned with Convolutional Spatial Propagation Network
Depth estimation from a single image is a fundamental problem in compute...
read it
-
An End-to-end Video Text Detector with Online Tracking
Video text detection is considered as one of the most difficult tasks in...
read it
-
ODE-CNN: Omnidirectional Depth Extension Networks
Omnidirectional 360 camera proliferates rapidly for autonomous robots si...
read it
-
CrowdMove: Autonomous Mapless Navigation in Crowded Scenarios
Navigation is an essential capability for mobile robots. In this paper, ...
read it
-
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
Temporal action proposal generation is an challenging and promising task...
read it
-
ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition
The ChaLearn large-scale gesture recognition challenge has been run twic...
read it
-
Deep Speech: Scaling up end-to-end speech recognition
We present a state-of-the-art speech recognition system developed using ...
read it
-
Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization
Stochastic gradient descent (SGD) holds as a classical method to build l...
read it
-
DeepLung: 3D Deep Convolutional Nets for Automated Pulmonary Nodule Detection and Classification
In this work, we present a fully automated lung CT cancer diagnosis syst...
read it
-
Question Answering over Knowledge Base with Neural Attention Combining Global Knowledge Information
With the rapid growth of knowledge bases (KBs) on the web, how to take f...
read it
-
Recruitment Market Trend Analysis with Sequential Latent Variable Models
Recruitment market analysis provides valuable understanding of industry-...
read it
-
Interactive Reinforcement Learning for Object Grounding via Self-Talking
Humans are able to identify a referred visual object in a complex scene ...
read it
-
Block-Sparse Recurrent Neural Networks
Recurrent Neural Networks (RNNs) are used in state-of-the-art models in ...
read it