
-
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
Recent advances on 3D object detection heavily rely on how the 3D data a...
read it
-
Contrastive Transformation for Self-supervised Correspondence Learning
In this paper, we focus on the self-supervised learning of visual corres...
read it
-
Unsupervised Pre-training for Person Re-identification
In this paper, we present a large scale unlabeled person re-identificati...
read it
-
Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
Skeleton-based human action recognition has attracted much attention wit...
read it
-
Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations
Contrastive learning has achieved great success in self-supervised visua...
read it
-
Can Semantic Labels Assist Self-Supervised Visual Representation Learning?
Recently, contrastive learning has largely advanced the progress of unsu...
read it
-
ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine
In a sponsored search engine, generative retrieval models are recently p...
read it
-
Masked Contrastive Representation Learning for Reinforcement Learning
Improving sample efficiency is a key research problem in reinforcement l...
read it
-
Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Continuous sign language recognition (SLR) deals with unaligned video-te...
read it
-
Improving Person Re-identification with Iterative Impression Aggregation
Our impression about one person often updates after we see more aspects ...
read it
-
Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Sign language recognition (SLR) is a challenging problem, involving comp...
read it
-
Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation
Existing person re-identification methods rely on the visual sensor to c...
read it
-
Unsupervised Deep Representation Learning for Real-Time Tracking
The advancement of visual tracking has continuously been brought by deep...
read it
-
Single Shot Video Object Detector
Single shot detectors that are potentially faster and simpler than two-s...
read it
-
Efficient Integer-Arithmetic-Only Convolutional Neural Networks
Integer-arithmetic-only networks have been demonstrated effective to red...
read it
-
Cascaded Regression Tracking: Towards Online Hard Distractor Discrimination
Visual tracking can be easily disturbed by similar surrounding objects. ...
read it
-
Long Short-Term Relation Networks for Video Action Detection
It has been well recognized that modeling human-object or object-object ...
read it
-
Incorporating BERT into Neural Machine Translation
The recently proposed BERT has shown great power on a variety of natural...
read it
-
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
Despite the recent success of deep learning in continuous sign language ...
read it
-
Soft Hindsight Experience Replay
Efficient learning in the environment with sparse rewards is one of the ...
read it
-
A Generalization Theory based on Independent and Task-Identically Distributed Assumption
Existing generalization theories analyze the generalization performance ...
read it
-
Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization
Model-based reinforcement learning algorithms tend to achieve higher sam...
read it
-
Quantization Networks
Although deep neural networks are highly effective, their high computati...
read it
-
AETv2: AutoEncoding Transformations for Self-Supervised Representation Learning by Minimizing Geodesic Distances in Lie Groups
Self-supervised learning by predicting transformations has demonstrated ...
read it
-
Video-based compression for plenoptic point clouds
The plenoptic point cloud that has multiple colors from various directio...
read it
-
Progressive Unsupervised Person Re-identification by Tracklet Association with Spatio-Temporal Regularization
Existing methods for person re-identification (Re-ID) are mostly based o...
read it
-
An End-to-End Foreground-Aware Network for Person Re-Identification
Person re-identification is a crucial task of identifying pedestrians of...
read it
-
Relation Distillation Networks for Video Object Detection
It has been well recognized that modeling object-to-object relations wou...
read it
-
Real-Time Correlation Tracking via Joint Model Compression and Transfer
Correlation filters (CF) have received considerable attention in visual ...
read it
-
Online Filter Clustering and Pruning for Efficient Convnets
Pruning filters is an effective method for accelerating deep neural netw...
read it
-
Progressive Learning of Low-Precision Networks
Recent years have witnessed the great advance of deep learning in a vari...
read it
-
Deep Learning-Based Video Coding: A Review and A Case Study
The past decade has witnessed great success of deep learning technology ...
read it
-
Unsupervised Deep Tracking
We propose an unsupervised visual tracking method in this paper. Differe...
read it
-
Occupancy-map-based rate distortion optimization for video-based point cloud compression
The state-of-the-art video-based point cloud compression scheme projects...
read it
-
Spatial and Temporal Mutual Promotion for Video-based Person Re-identification
Video-based person re-identification is a crucial task of matching video...
read it
-
Affinity Derivation and Graph Merge for Instance Segmentation
We present an instance segmentation scheme based on pixel affinity infor...
read it
-
In Defense of the Classification Loss for Person Re-Identification
The recent research for person re-identification has been focused on two...
read it
-
Low-Latency Human Action Recognition with Weighted Multi-Region Convolutional Neural Network
Spatio-temporal contexts are crucial in understanding human actions in v...
read it
-
Visual Attribute-augmented Three-dimensional Convolutional Neural Network for Enhanced Human Action Recognition
Visual attributes in individual video frames, such as the presence of ch...
read it
-
To Create What You Tell: Generating Videos from Captions
We are creating multimedia contents everyday and everywhere. While autom...
read it
-
Towards Open-Set Identity Preserving Face Synthesis
We propose a framework based on Generative Adversarial Networks to disen...
read it
-
Video-based Sign Language Recognition without Temporal Segmentation
Millions of hearing impaired people around the world routinely use some ...
read it
-
Feature Selective Networks for Object Detection
Objects for detection usually have distinct characteristics in different...
read it
-
Neural network-based arithmetic coding of intra prediction modes in HEVC
In both H.264 and HEVC, context-adaptive binary arithmetic coding (CABAC...
read it
-
Recent Advance in Content-based Image Retrieval: A Literature Survey
The explosive increase and ubiquitous accessibility of visual data on th...
read it
-
Co-projection-plane based 3-D padding for polyhedron projection for 360-degree video
The polyhedron projection for 360-degree video is becoming more and more...
read it
-
CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training
We present variational generative adversarial networks, a general learni...
read it
-
A Convolutional Neural Network Approach for Half-Pel Interpolation in Video Coding
Motion compensation is a fundamental technology in video coding to remov...
read it
-
Convolutional Neural Network-Based Block Up-sampling for Intra Frame Coding
Inspired by the recent advances of image super-resolution using convolut...
read it
-
An Efficient Four-Parameter Affine Motion Model for Video Coding
In this paper, we study a simplified affine motion model based coding fr...
read it