
-
Trans2Seg: Transparent Object Segmentation with Transformer
This work presents a new fine-grained transparent object segmentation da...
read it
-
TransTrack: Multiple-Object Tracking with Transformer
Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
read it
-
OneNet: Towards End-to-End One-Stage Object Detection
End-to-end one-stage object detection trailed thus far. This paper disco...
read it
-
SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training
Although a polygon is a more accurate representation than an upright bou...
read it
-
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
We present Sparse R-CNN, a purely sparse method for object detection in ...
read it
-
Do 2D GANs Know 3D Shape? Unsupervised 3D shape reconstruction from 2D Image GANs
Natural images are projections of 3D objects on a 2D image plane. While ...
read it
-
UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation
Aggregating multi-level feature representation plays a critical role in ...
read it
-
RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning
Despite the remarkable successes of Convolutional Neural Networks (CNNs)...
read it
-
Compensation Tracker: Data Association Method for Lost Object
At present, the main research direction of multi-object tracking framewo...
read it
-
Webly Supervised Image Classification with Self-Contained Confidence
This paper focuses on webly supervised learning (WSL), where datasets ar...
read it
-
Dynamic and Static Context-aware LSTM for Multi-agent Motion Prediction
Multi-agent motion prediction is challenging because it aims to foresee ...
read it
-
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
Scene text spotting aims to detect and recognize the entire word or sent...
read it
-
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
Multi-person pose estimation is challenging because it localizes body ke...
read it
-
Whole-Body Human Pose Estimation in the Wild
This paper investigates the task of 2D human whole-body pose estimation,...
read it
-
3D Human Mesh Regression with Dense Correspondence
Estimating 3D mesh of the human body from a single 2D image is an import...
read it
-
Learning a Reinforced Agent for Flexible Exposure Bracketing Selection
Automatically selecting exposure bracketing (images exposed differently)...
read it
-
Convolution-Weight-Distribution Assumption: Rethinking the Criteria of Channel Pruning
Channel pruning is one of the most important techniques for compressing ...
read it
-
AdaX: Adaptive Gradient Descent with Exponential Long Term Memory
Although adaptive optimization algorithms such as Adam show fast converg...
read it
-
Segmenting Transparent Objects in the Wild
Transparent objects such as windows and bottles made by glass widely exi...
read it
-
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
Learning a good image prior is a long-term goal for image restoration an...
read it
-
Exemplar Normalization for Learning Deep Representation
Normalization techniques are important in different advanced neural netw...
read it
-
Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content
Image visual try-on aims at transferring a target clothing image onto a ...
read it
-
Channel Equilibrium Networks for Learning Deep Representation
Convolutional Neural Networks (CNNs) are typically constructed by stacki...
read it
-
How Does BN Increase Collapsed Neural Network Filters?
Improving sparsity of deep neural networks (DNNs) is essential for netwo...
read it
-
Learning Depth-Guided Convolutions for Monocular 3D Object Detection
3D object detection from a single image without LiDAR is a challenging t...
read it
-
Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
A major challenge for video semantic segmentation is the lack of labeled...
read it
-
Vision-Infused Deep Audio Inpainting
Multi-modality perception is essential to develop interactive intelligen...
read it
-
PolarMask: Single Shot Instance Segmentation with Polar Representation
In this paper, we introduce an anchor-box free and single shot instance ...
read it
-
TextSR: Content-Aware Text Super-Resolution Guided by Recognition
Scene text recognition has witnessed rapid development with the advance ...
read it
-
Towards Improving Generalization of Deep Networks via Consistent Normalization
Batch Normalization (BN) was shown to accelerate training and improve ge...
read it
-
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid
Matching clothing images from customers and online shopping stores has r...
read it
-
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Group convolution, which divides the channels of ConvNets into groups, h...
read it
-
Differentiable Learning-to-Group Channels viaGroupable Convolutional Neural Networks
Group convolution, which divides the channels of ConvNets into groups, h...
read it
-
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
Modern deep neural networks are often vulnerable to adversarial samples....
read it
-
Deep Self-Learning From Noisy Labels
ConvNets achieve good results when training from clean data, but learnin...
read it
-
MaskGAN: Towards Diverse and Interactive Facial Image Manipulation
Facial image manipulation has achieved great progresses in recent years....
read it
-
Switchable Normalization for Learning-to-Normalize Deep Representation
We address a learning-to-normalize problem by proposing Switchable Norma...
read it
-
Atom Responding Machine for Dialog Generation
Recently, improving the relevance and diversity of dialogue system has a...
read it
-
Switchable Whitening for Deep Representation Learning
Normalization methods are essential components in convolutional neural n...
read it
-
SSN: Learning Sparse Switchable Normalization via SparsestMax
Normalization methods improve both optimization and generalization of Co...
read it
-
WIDER Face and Pedestrian Challenge 2018: Methods and Results
This paper presents a review of the 2018 WIDER Challenge on Face and Ped...
read it
-
DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images
Understanding fashion images has been advanced by benchmarks with rich a...
read it
-
FaceFeat-GAN: a Two-Stage Approach for Identity-Preserving Face Synthesis
The advance of Generative Adversarial Networks (GANs) enables realistic ...
read it
-
Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct?
Yes, they do. This work investigates a perspective for deep learning: wh...
read it
-
Understanding Regularization in Batch Normalization
Batch Normalization (BN) makes output of hidden neuron had zero mean and...
read it
-
Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents
In this study, we focus on extracting knowledgeable snippets and annotat...
read it
-
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos
Video Analytics Software as a Service (VA SaaS) has been rapidly growing...
read it
-
Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net
Convolutional neural networks (CNNs) have achieved great successes in ma...
read it
-
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
Talking face generation aims to synthesize a sequence of face images tha...
read it
-
SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification
Video person re-identification attracts much attention in recent years. ...
read it