
-
Probabilistic Graph Attention Network with Conditional Kernels for Pixel-Wise Prediction
Multi-scale representations deeply learned via convolutional neural netw...
read it
-
Inception Convolution with Efficient Dilation Search
Dilation convolution is a critical mutant of standard convolution neural...
read it
-
DETR for Pedestrian Detection
Pedestrian detection in crowd scenes poses a challenging problem due to ...
read it
-
Full Matching on Low Resolution for Disparity Estimation
A Multistage Full Matching disparity estimation scheme (MFM) is proposed...
read it
-
Direct Depth Learning Network for Stereo Matching
Being a crucial task of autonomous driving, Stereo matching has made gre...
read it
-
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving
The strong demand of autonomous driving in the industry has lead to stro...
read it
-
Evolving Search Space for Neural Architecture Search
The automation of neural architecture design has been a coveted alternat...
read it
-
Adaptive Gradient Method with Resilience and Momentum
Several variants of stochastic gradient descent (SGD) have been proposed...
read it
-
Once Quantized for All: Progressively Searching for Quantized Efficient Models
Automatic search of Quantized Neural Networks has attracted a lot of att...
read it
-
Improving Auto-Augment via Augmentation-Wise Weight Sharing
The recent progress on automatically searching augmentation policies has...
read it
-
SAMOT: Switcher-Aware Multi-Object Tracking and Still Another MOT Measure
Multi-Object Tracking (MOT) is a popular topic in computer vision. Howev...
read it
-
Improving Deep Video Compression by Resolution-adaptive Flow Coding
In the learning based video compression approaches, it is an essential i...
read it
-
Exploring the Hierarchy in Relation Labels for Scene Graph Generation
By assigning each relationship a single label, current approaches formul...
read it
-
BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation
Few-shot segmentation focuses on the generalization of models to segment...
read it
-
Rethinking Pseudo-LiDAR Representation
The recently proposed pseudo-LiDAR based 3D detectors greatly improve th...
read it
-
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
Multi-person pose estimation is challenging because it localizes body ke...
read it
-
Whole-Body Human Pose Estimation in the Wild
This paper investigates the task of 2D human whole-body pose estimation,...
read it
-
3D Human Mesh Regression with Dense Correspondence
Estimating 3D mesh of the human body from a single 2D image is an import...
read it
-
Scope Head for Accurate Localizationin Object Detection
Existing anchor-based and anchor-free object detectors in multi-stage or...
read it
-
Scope Head for Accurate Localization in Object Detection
Existing anchor-based and anchor-free object detectors in multi-stage or...
read it
-
Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection
In this paper, we propose a general and efficient pre-training paradigm,...
read it
-
Location-Aware Feature Selection for Scene Text Detection
Direct regression-based natural scene text detection methods have alread...
read it
-
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition
Spatial-temporal graphs have been widely used by skeleton-based action r...
read it
-
Content Adaptive and Error Propagation Aware Deep Video Compression
Recently, learning based video compression methods attract increasing at...
read it
-
Channel Pruning Guided by Classification Loss and Feature Importance
In this work, we propose a new layer-by-layer channel pruning method cal...
read it
-
Equalization Loss for Long-Tailed Object Recognition
Object recognition techniques using convolutional neural networks (CNN) ...
read it
-
EcoNAS: Finding Proxies for Economical Neural Architecture Search
Neural Architecture Search (NAS) achieves significant progress in many c...
read it
-
Learning 3D Human Shape and Pose from Dense Body Parts
Reconstructing 3D human shape and pose from a monocular image is challen...
read it
-
Computation Reallocation for Object Detection
The allocation of computation resources in the backbone is a crucial iss...
read it
-
A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection
Deep learning-based computer vision is usually data-hungry. Many researc...
read it
-
TRB: A Novel Triplet Representation for Understanding 2D Human Body
Human pose and shape are two important components of 2D human body. Howe...
read it
-
Improving One-shot NAS by Suppressing the Posterior Fading
There is a growing interest in automated neural architecture search (NAS...
read it
-
IntersectGAN: Learning Domain Intersection for Generating Images with Multiple Attributes
Generative adversarial networks (GANs) have demonstrated great success i...
read it
-
GradNet: Gradient-Guided Network for Visual Object Tracking
The fully-convolutional siamese network based on template matching has s...
read it
-
Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection
Recent saliency models extensively explore to incorporate multi-scale co...
read it
-
Crowd Counting with Deep Structured Scale Integration Network
Automatic estimation of the number of people in unconstrained crowded sc...
read it
-
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
Description-based person re-identification (Re-id) is an important task ...
read it
-
MMDetection: Open MMLab Detection Toolbox and Benchmark
We present MMDetection, an object detection toolbox that contains a rich...
read it
-
Improving Action Localization by Progressive Cross-stream Cooperation
Spatio-temporal action localization consists of three levels of tasks: s...
read it
-
AM-LFS: AutoML for Loss Function Search
Designing an effective loss function plays an important role in visual a...
read it
-
Online Hyper-parameter Learning for Auto-Augmentation Strategy
Data augmentation is critical to the success of modern deep learning tec...
read it
-
Contextualized Spatial-Temporal Network for Taxi Origin-Destination Demand Prediction
Taxi demand prediction has recently attracted increasing research intere...
read it
-
Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation
Semantic segmentation has achieved huge progress via adopting deep Fully...
read it
-
Libra R-CNN: Towards Balanced Learning for Object Detection
Compared with model architectures, the training process, which is also c...
read it
-
Feature Intertwiner for Object Detection
A well-trained model should classify objects with a unanimous score for ...
read it
-
Accurate Monocular 3D Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving
In this paper, we propose a monocular 3D object detection framework in t...
read it
-
GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving
We present an efficient 3D object detection framework based on a single ...
read it
-
Multi-person Articulated Tracking with Spatial and Temporal Embeddings
We propose a unified framework for multi-person pose estimation and trac...
read it
-
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory Prediction
In crowd scenarios, reliable trajectory prediction of pedestrians requir...
read it
-
WIDER Face and Pedestrian Challenge 2018: Methods and Results
This paper presents a review of the 2018 WIDER Challenge on Face and Ped...
read it