
-
Semantic Scene Completion via Integrating Instances and Scene in-the-Loop
Semantic Scene Completion aims at reconstructing a complete 3D scene wit...
read it
-
LIFE: Lighting Invariant Flow Estimation
We tackle the problem of estimating flow between two images with large l...
read it
-
FocusNetv2: Imbalanced Large and Small Organ Segmentation with Adversarial Shape Constraint for Head and Neck CT Images
Radiotherapy is a treatment where radiation is used to eliminate cancer ...
read it
-
Fixing the Teacher-Student Knowledge Discrepancy in Distillation
Training a small student network with the guidance of a larger teacher n...
read it
-
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks
Significant progress has been achieved in automating the design of vario...
read it
-
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
Conditional generative adversarial networks (cGANs) target at synthesizi...
read it
-
ST3D: Self-training for Unsupervised Domain Adaptation on 3D ObjectDetection
We present a new domain adaptive self-training pipeline, named ST3D, for...
read it
-
Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch
Sparsity in Deep Neural Networks (DNNs) has been widely studied to compr...
read it
-
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
3D object detection is receiving increasing attention from both industry...
read it
-
Fast Convergence of DETR with Spatially Modulated Co-Attention
The recently proposed Detection Transformer (DETR) model successfully ap...
read it
-
Consensus-Guided Correspondence Denoising
Correspondence selection between two groups of feature points aims to co...
read it
-
A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection
Both high-level and high-resolution feature representations are of great...
read it
-
LiDAR-based Panoptic Segmentation via Dynamic Shifting Network
With the rapid advances of autonomous driving, it becomes critical to eq...
read it
-
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation
State-of-the-art methods for large-scale driving-scene LiDAR segmentatio...
read it
-
End-to-End Object Detection with Adaptive Clustering Transformer
End-to-end Object Detection with Transformer (DETR)proposes to perform o...
read it
-
SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks
Recent learning-based LiDAR odometry methods have demonstrated their com...
read it
-
PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges
In this technical report, we present the top-performing LiDAR-only solut...
read it
-
EfficientFCN: Holistically-guided Decoding for Semantic Segmentation
Both performance and efficiency are important to semantic segmentation. ...
read it
-
Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets
Multi-organ segmentation has extensive applications in many clinical app...
read it
-
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
We propose a novel algorithm, named Open-Edit, which is the first attemp...
read it
-
Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation
State-of-the-art methods for large-scale driving-scene LiDAR semantic se...
read it
-
Balanced Meta-Softmax for Long-Tailed Visual Recognition
Deep classifiers have achieved great success in visual recognition. Howe...
read it
-
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020
This technical report presents an overview of our solution used in the s...
read it
-
Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation
Depth information has proven to be a useful cue in the semantic segmenta...
read it
-
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020
This technical report introduces our winning solution to the spatio-temp...
read it
-
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Localizing persons and recognizing their actions from videos is a challe...
read it
-
Self-supervising Fine-grained Region Similarities for Large-scale Image Localization
The task of large-scale retrieval-based image localization is to estimat...
read it
-
Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID
Domain adaptive object re-ID aims to transfer the learned knowledge from...
read it
-
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching
Large-scale synthetic datasets are beneficial to stereo matching but usu...
read it
-
Learning to Predict Context-adaptive Convolution for Semantic Segmentation
Long-range contextual information is essential for achieving high-perfor...
read it
-
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior
The goal of the Semantic Scene Completion (SSC) task is to simultaneousl...
read it
-
Structured Domain Adaptation for Unsupervised Person Re-identification
Unsupervised domain adaptation (UDA) aims at adapting the model trained ...
read it
-
Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation
Monocular 3D object detection task aims to predict the 3D bounding boxes...
read it
-
Structure-Feature based Graph Self-adaptive Pooling
Various methods to deal with graph data have been proposed in recent yea...
read it
-
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification
Person re-identification (re-ID) aims at identifying the same persons' i...
read it
-
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
We present a novel and high-performance 3D object detection framework, n...
read it
-
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
Semantic image synthesis aims at generating photorealistic images from s...
read it
-
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints
Depth completion aims to recover dense depth maps from sparse depth meas...
read it
-
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Text-image cross-modal retrieval is a challenging task in the field of l...
read it
-
Interpolated Convolutional Networks for 3D Point Cloud Understanding
Point cloud is an important type of 3D representation. However, directly...
read it
-
Multi-modality Latent Interaction Network for Visual Question Answering
Exploiting relationships between visual regions and question words have ...
read it
-
FocusNet: Imbalanced Large and Small Organ Segmentation with an End-to-End Deep Neural Network for Head and Neck CT Images
In this paper, we propose an end-to-end deep neural network for solving ...
read it
-
Signet Ring Cell Detection With a Semi-supervised Learning Framework
Signet ring cell carcinoma is a type of rare adenocarcinoma with poor pr...
read it
-
Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud
In this paper, we propose the part-aware and aggregation neural network ...
read it
-
P2SGrad: Refined Gradients for Optimizing Deep Face Models
Cosine-based softmax losses significantly improve the performance of dee...
read it
-
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations
The cosine-based softmax losses and their variants achieve great success...
read it
-
Generalizing Monocular 3D Human Pose Estimation in the Wild
The availability of the large-scale labeled 3D poses in the Human3.6M da...
read it
-
Conditional Adversarial Generative Flow for Controllable Image Synthesis
Flow-based generative models show great potential in image synthesis due...
read it
-
Group-wise Correlation Stereo Network
Stereo matching estimates the disparity between a rectified image pair, ...
read it
-
Unsupervised Cross-spectral Stereo Matching by Learning to Synthesize
Unsupervised cross-spectral stereo matching aims at recovering disparity...
read it