
-
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Most recent semantic segmentation methods adopt a fully-convolutional ne...
read it
-
Whose hand is this? Person Identification from Egocentric Hand Gestures
Recognizing people by faces and other biometrics has been extensively st...
read it
-
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions
The task of video and text sequence alignment is a prerequisite step tow...
read it
-
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Humans can easily recognize actions with only a few examples given, whil...
read it
-
M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging
To counter the outbreak of COVID-19, the accurate diagnosis of suspected...
read it
-
Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking
Existing Multiple-Object Tracking (MOT) methods either follow the tracki...
read it
-
How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning
Deep learning based models have excelled in many computer vision task an...
read it
-
DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths
Over-parameterization is ubiquitous nowadays in training neural networks...
read it
-
Self-supervised Video Object Segmentation
The objective of this paper is self-supervised representation learning, ...
read it
-
Long-Term Cloth-Changing Person Re-identification
Person re-identification (Re-ID) aims to match a target person across ca...
read it
-
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Previous researches of sketches often considered sketches in pixel forma...
read it
-
Instance Credibility Inference for Few-Shot Learning
Few-shot learning (FSL) aims to recognize new objects with extremely lim...
read it
-
Neural Pose Transfer by Spatially Adaptive Instance Normalization
Pose transfer has been studied for decades, in which the pose of a sourc...
read it
-
When Person Re-identification Meets Changing Clothes
Person re-identification (Reid) is now an active research topic for AI-b...
read it
-
Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition
Affective computing and cognitive theory are widely used in modern human...
read it
-
DeepSFM: Structure From Motion Via Deep Bundle Adjustment
Structure from motion (SfM) is an essential computer vision problem whic...
read it
-
Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition
One-shot fine-grained visual recognition often suffers from the problem ...
read it
-
Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation
We study the problem of shape generation in 3D mesh representation from ...
read it
-
A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition
The recent research of facial expression recognition has made a lot of p...
read it
-
Image Deformation Meta-Networks for One-Shot Learning
Humans can robustly learn novel visual concepts even when images undergo...
read it
-
Parsimonious Deep Learning: A Differential Inclusion Approach with Global Convergence
Over-parameterization is ubiquitous nowadays in training neural networks...
read it
-
S^2-LBI: Stochastic Split Linearized Bregman Iterations for Parsimonious Deep Learning
This paper proposes a novel Stochastic Split Linearized Bregman Iteratio...
read it
-
Question Guided Modular Routing Networks for Visual Question Answering
Visual Question Answering (VQA) faces two major challenges: how to bette...
read it
-
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Emotional content is a crucial ingredient in user-generated videos. Howe...
read it
-
Learning Large Euclidean Margin for Sketch-based Image Retrieval
This paper addresses the problem of Sketch-Based Image Retrieval (SBIR),...
read it
-
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network
Sketch has been employed as an effective communicative tool to express t...
read it
-
Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective
This paper studies the problem of domain division problem which aims to ...
read it
-
Progressive Deep Neural Networks Acceleration via Soft Filter Pruning
This paper proposed a Progressive Soft Filter Pruning method (PSFP) to p...
read it
-
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
This paper proposed a Soft Filter Pruning (SFP) method to accelerate the...
read it
-
Detecting Tiny Moving Vehicles in Satellite Videos
In recent years, the satellite videos have been captured by a moving sat...
read it
-
SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners
Deep Convolutional Neural Networks (CNN) has achieved significant succes...
read it
-
MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning
It is one typical and general topic of learning a good embedding model t...
read it
-
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
Zero-Shot Learning (ZSL) is achieved via aligning the semantic relations...
read it
-
Semantic Feature Augmentation in Few-shot Learning
A fundamental problem with few-shot learning is the scarcity of data in ...
read it
-
A Large-scale Attribute Dataset for Zero-shot Learning
Zero-Shot Learning (ZSL) has attracted huge research attention over the ...
read it
-
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images
We propose an end-to-end deep learning architecture that produces a 3D s...
read it
-
Learning to score the figure skating sports videos
This paper targets at learning to score the figure skating sports videos...
read it
-
Learning to score and summarize figure skating sport videos
This paper focuses on fully understanding the figure skating sport video...
read it
-
Pose-Normalized Image Generation for Person Re-identification
Person Re-identification (re-id) faces two major challenges: the lack of...
read it
-
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Significant progress has been achieved in Computer Vision by leveraging ...
read it
-
Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization
Inspired by the recent neuroscience studies on the left-right asymmetry ...
read it
-
Recent Advances in Zero-shot Recognition
With the recent renaissance of deep convolution neural networks, encoura...
read it
-
Multi-scale Deep Learning Architectures for Person Re-identification
Person Re-identification (re-id) aims to match people across non-overlap...
read it
-
A Jointly Learned Deep Architecture for Facial Attribute Analysis and Face Detection in the Wild
Facial attribute analysis in the real world scenario is very challenging...
read it
-
Vocabulary-informed Extreme Value Learning
The novel unseen classes can be formulated as the extreme values of know...
read it
-
Semi-Latent GAN: Learning to generate and modify facial images from attributes
Generating and manipulating human facial images using high-level attribu...
read it
-
Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models
Researchers often summarize their work in the form of scientific posters...
read it
-
Deep Learning for Video Classification and Captioning
Accelerated by the tremendous increase in Internet bandwidth and storage...
read it
-
Semi-supervised Vocabulary-informed Learning
Despite significant progress in object categorization, in recent years, ...
read it
-
Learning to Generate Posters of Scientific Papers
Researchers often summarize their work in the form of posters. Posters p...
read it