
-
CM-NAS: Rethinking Cross-Modality Neural Architectures for Visible-Infrared Person Re-Identification
Visible-Infrared person re-identification (VI-ReID) aims at matching cro...
read it
-
FaceX-Zoo: A PyTorch Toolbox for Face Recognition
Deep learning based face recognition has achieved significant progress i...
read it
-
Synthetic Training for Monocular Human Mesh Recovery
Recovering 3D human mesh from monocular images is a popular topic in com...
read it
-
Joint Contrastive Learning with Infinite Possibilities
This paper explores useful modifications of the recent development in co...
read it
-
The Elements of End-to-end Deep Face Recognition: A Survey of Recent Advances
Face recognition is one of the most fundamental and long-standing topics...
read it
-
Learning to Localize Actions from Moments
With the knowledge of action moments (i.e., trimmed video clips that eac...
read it
-
CenterHMR: a Bottom-up Single-shot Method for Multi-person 3D Mesh Recovery from a Single Image
In this paper, we propose a method to recover multi-person 3D mesh from ...
read it
-
Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification
Person re-identification (Re-ID) aims at retrieving an input person imag...
read it
-
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
A steady momentum of innovations and breakthroughs has convincingly push...
read it
-
NPCFace: A Negative-Positive Cooperation Supervision for Training Large-scale Face Recognition
Deep face recognition has made remarkable advances in the last few years...
read it
-
Classes Matter: A Fine-grained Adversarial Approach to Cross-domain Semantic Segmentation
Despite great progress in supervised semantic segmentation,a large perfo...
read it
-
Semi-Siamese Training for Shallow Face Learning
Most existing public face datasets, such as MS-Celeb-1M and VGGFace2, pr...
read it
-
Loss Function Search for Face Recognition
In face recognition, designing margin-based (e.g., angular, additive, ad...
read it
-
Single Shot Video Object Detector
Single shot detectors that are potentially faster and simpler than two-s...
read it
-
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
In this work, we present Auto-captions on GIF, which is a new large-scal...
read it
-
Transferring and Regularizing Prediction for Semantic Segmentation
Semantic segmentation often requires a large set of images with pixel-le...
read it
-
Learning a Unified Sample Weighting Network for Object Detection
Region sampling or weighting is significantly important to the success o...
read it
-
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation
Unsupervised domain adaptation has received significant attention in rec...
read it
-
FastReID: A Pytorch Toolbox for Real-world Person Re-identification
We present FastReID, as a widely used object re-identification (re-id) s...
read it
-
Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks
The current deep learning based visual tracking approaches have been ver...
read it
-
VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification
One fundamental challenge of vehicle re-identification (re-id) is to lea...
read it
-
Look-into-Object: Self-supervised Structure Modeling for Object Recognition
Most object recognition approaches predominantly focus on learning discr...
read it
-
X-Linear Attention Networks for Image Captioning
Recent progress on fine-grained visual recognition and visual question a...
read it
-
Long Short-Term Relation Networks for Video Action Detection
It has been well recognized that modeling human-object or object-object ...
read it
-
Adaptive Semantic-Visual Tree for Hierarchical Embeddings
Merchandise categories inherently form a semantic hierarchy with differe...
read it
-
Vision and Language: from Visual Perception to Content Creation
Vision and language are two fundamental capabilities of human intelligen...
read it
-
Down to the Last Detail: Virtual Try-on with Detail Carving
Virtual try-on under arbitrary poses has attracted lots of research atte...
read it
-
Theme-Matters: Fashion Compatibility Learning via Theme Attention
Fashion compatibility learning is important to many fashion markets such...
read it
-
Zooming into Face Forensics: A Pixel-level Analysis
The stunning progress in face manipulation methods has made it possible ...
read it
-
Mis-classified Vector Guided Softmax Loss for Face Recognition
Face recognition has witnessed significant progress due to the advances ...
read it
-
Scheduled Differentiable Architecture Search for Visual Recognition
Convolutional Neural Networks (CNN) have been regarded as a capable clas...
read it
-
Hierarchy Parsing for Image Captioning
It is always well believed that parsing an image into constituent visual...
read it
-
Deep Metric Learning with Density Adaptivity
The problem of distance metric learning is mostly considered from the pe...
read it
-
Gaussian Temporal Awareness Networks for Action Localization
Temporally localizing actions in a video is a fundamental challenge in v...
read it
-
Relationship-Aware Spatial Perception Fusion for Realistic Scene Layout Generation
The significant progress on Generative Adversarial Networks (GANs) have ...
read it
-
Customizable Architecture Search for Semantic Segmentation
In this paper, we propose a Customizable Architecture Search (CAS) appro...
read it
-
Mocycle-GAN: Unpaired Video-to-Video Translation
Unsupervised image-to-image translation is the task of translating an im...
read it
-
Relation Distillation Networks for Video Object Detection
It has been well recognized that modeling object-to-object relations wou...
read it
-
daBNN: A Super Fast Inference Framework for Binary Neural Networks on ARM devices
It is always well believed that Binary Neural Networks (BNNs) could dras...
read it
-
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Image paragraph generation is the task of producing a coherent story (us...
read it
-
Regularizing Proxies with Multi-Adversarial Training for Unsupervised Domain-Adaptive Semantic Segmentation
Training a semantic segmentation model requires a large amount of pixel-...
read it
-
Hard-Aware Fashion Attribute Classification
Fashion attribute classification is of great importance to many high-lev...
read it
-
Learning Spatio-Temporal Representation with Local and Global Diffusion
Convolutional Neural Networks (CNN) have been regarded as a powerful cla...
read it
-
Group Re-Identification with Multi-grained Matching and Integration
The task of re-identifying groups of people underdifferent camera views ...
read it
-
A High-Efficiency Framework for Constructing Large-Scale Face Parsing Benchmark
Face parsing, which is to assign a semantic label to each pixel in face ...
read it
-
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
It is well believed that video captioning is a fundamental but challengi...
read it
-
Pointing Novel Objects in Image Captioning
Image captioning has received significant attention with remarkable impr...
read it
-
Transferrable Prototypical Networks for Unsupervised Domain Adaptation
In this paper, we introduce a new idea for unsupervised domain adaptatio...
read it
-
Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks
Selfie and cartoon are two popular artistic forms that are widely presen...
read it
-
Unsupervised Person Image Generation with Semantic Parsing Transformation
In this paper, we address unsupervised pose-guided person image generati...
read it