
-
DF-VO: What Should Be Learnt for Visual Odometry?
Multi-view geometry-based methods dominate the last few decades in monoc...
read it
-
Semantics for Robotic Mapping, Perception and Interaction: A Survey
For robots to navigate and interact more richly with the world around th...
read it
-
MO-LTR: Multiple Object Localization, Tracking, and Reconstruction from Monocular RGB Videos
Semantic aware reconstruction is more advantageous than geometric-only r...
read it
-
EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels
The efficacy of deep learning depends on large-scale data sets that have...
read it
-
HM4: Hidden Markov Model with Memory Management for Visual Place Recognition
Visual place recognition needs to be robust against appearance variabili...
read it
-
MOTChallenge: A Benchmark for Single-camera Multiple Target Tracking
Standardized benchmarks have been crucial in pushing the performance of ...
read it
-
How Trustworthy are the Existing Performance Evaluations for Basic Vision Tasks?
Performance evaluation is indispensable to the advancement of machine vi...
read it
-
Joint learning of Social Groups, Individuals Action and Sub-group Activities in Videos
The state-of-the art solutions for human activity understanding from a v...
read it
-
Unsupervised Depth Learning in Challenging Indoor Video: Weak Rectification to Rescue
Single-view depth estimation using CNNs trained from unlabelled videos h...
read it
-
FroDO: From Detections to 3D Objects
Object-oriented maps are important for scene understanding since they jo...
read it
-
MOT20: A benchmark for multi object tracking in crowded scenes
Standardized benchmarks are crucial for the majority of computer vision ...
read it
-
Real-time Image Smoothing via Iterative Least Squares
Edge-preserving image smoothing is a fundamental procedure for many comp...
read it
-
3D Gated Recurrent Fusion for Semantic Scene Completion
This paper tackles the problem of data fusion in the semantic scene comp...
read it
-
Hyperspectral Classification Based on 3D Asymmetric Inception Network with Data Fusion Transfer Learning
Hyperspectral image(HSI) classification has been improved with convoluti...
read it
-
Switchable Precision Neural Networks
Instantaneous and on demand accuracy-efficiency trade-off has been recen...
read it
-
Automatic Pruning for Quantized Neural Networks
Neural network quantization and pruning are two techniques commonly used...
read it
-
Learn to Predict Sets Using Feed-Forward Neural Networks
This paper addresses the task of set prediction using deep feed-forward ...
read it
-
Depth Based Semantic Scene Completion with Position Importance Aware Loss
Semantic Scene Completion (SSC) refers to the task of inferring the 3D s...
read it
-
Learning to generate new indoor scenes
Deep generative models have been used in recent years to learn coherent ...
read it
-
NeuRoRA: Neural Robust Rotation Averaging
Multiple rotation averaging is an essential task for structure from moti...
read it
-
Improved Visual Localization via Graph Smoothing
Vision based localization is the problem of inferring the pose of the ca...
read it
-
Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation
This paper tackles the problem of video object segmentation. We are spec...
read it
-
Structured Binary Neural Networks for Image Recognition
We propose methods to train convolutional neural networks (CNNs) with bo...
read it
-
Visual Odometry Revisited: What Should Be Learnt?
In this work we present a monocular visual odometry (VO) algorithm which...
read it
-
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
Recent work has shown that CNN-based depth and ego-motion estimators can...
read it
-
An Evaluation of Feature Matchers forFundamental Matrix Estimation
Matching two images while estimating their relative geometry is a key st...
read it
-
In defense of OSVOS
As a milestone for video object segmentation, one-shot video object segm...
read it
-
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
This paper tackles the problem of training a deep convolutional neural n...
read it
-
Scalable Place Recognition Under Appearance Change for Autonomous Driving
A major challenge in place recognition for autonomous driving is to be r...
read it
-
A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing
Image smoothing is a fundamental procedure in applications of both compu...
read it
-
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks
Predicting the future trajectories of multiple interacting agents in a s...
read it
-
CVPR19 Tracking and Detection Challenge: How crowded can it get?
Standardized benchmarks are crucial for the majority of computer vision ...
read it
-
Seeing Behind Things: Extending Semantic Segmentation to Occluded Regions
Semantic segmentation and instance level segmentation made substantial p...
read it
-
Practical Robot Learning from Demonstrations using Deep End-to-End Training
Robots need to learn behaviors in intuitive and practical ways for wides...
read it
-
Bayesian Generative Active Deep Learning
Deep learning models have demonstrated outstanding performance in severa...
read it
-
Attention-guided Network for Ghost-free High Dynamic Range Imaging
Ghosting artifacts caused by moving objects or misalignments is a key ch...
read it
-
A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning
We propose a method that substantially improves the efficiency of deep d...
read it
-
Architecture Search of Dynamic Cells for Semantic Video Segmentation
In semantic video segmentation the goal is to acquire consistent dense s...
read it
-
Template-Based Automatic Search of Compact Semantic Segmentation Architectures
Automatic search of neural architectures for various vision and natural ...
read it
-
Training Quantized Network with Auxiliary Gradient Module
In this paper, we seek to tackle two challenges in training low-precisio...
read it
-
V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation
We propose V2CNet, a new deep learning framework to automatically transl...
read it
-
RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion
RGB images differentiate from depth images as they carry more details ab...
read it
-
Self-supervised Learning for Single View Depth and Surface Normal Estimation
In this work we present a self-supervised learning framework to simultan...
read it
-
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
Intersection over Union (IoU) is the most popular evaluation metric used...
read it
-
Visual SLAM: Why Bundle Adjust?
Bundle adjustment plays a vital role in feature-based monocular SLAM. In...
read it
-
Multi-modal Ensemble Classification for Generalized Zero Shot Learning
Generalized zero shot learning (GZSL) is defined by a training process c...
read it
-
Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes
As the post-processing step for object detection, non-maximum suppressio...
read it
-
Optimizable Object Reconstruction from a Single View
3D shape reconstruction from a single image is a highly ill-posed proble...
read it
-
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
In this paper, we propose to train convolutional neural networks (CNNs) ...
read it
-
Rethinking Binary Neural Network for Accurate Image Classification and Semantic Segmentation
In this paper, we propose to train a network with both binary weights an...
read it