
-
A Generic Object Re-identification System for Short Videos
Short video applications like TikTok and Kwai have been a great hit rece...
read it
-
Nonlinear Monte Carlo Method for Imbalanced Data Learning
For basic machine learning problems, expected error is used to evaluate ...
read it
-
M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging
To counter the outbreak of COVID-19, the accurate diagnosis of suspected...
read it
-
Temporal Context Aggregation for Video Retrieval with Contrastive Learning
The current research focus on Content-Based Video Retrieval requires hig...
read it
-
Long-Term Cloth-Changing Person Re-identification
Person re-identification (Re-ID) aims to match a target person across ca...
read it
-
MOTS: Multiple Object Tracking for General Categories Based On Few-Shot Method
Most modern Multi-Object Tracking (MOT) systems typically apply REID-bas...
read it
-
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Previous researches of sketches often considered sketches in pixel forma...
read it
-
BERT-ATTACK: Adversarial Attack Against BERT Using BERT
Adversarial attacks for discrete data (such as text) has been proved sig...
read it
-
Neural Pose Transfer by Spatially Adaptive Instance Normalization
Pose transfer has been studied for decades, in which the pose of a sourc...
read it
-
3DCFS: Fast and Robust Joint 3D Semantic-Instance Segmentation via Coupled Feature Selection
We propose a novel fast and robust 3D point clouds segmentation framewor...
read it
-
Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition
Affective computing and cognitive theory are widely used in modern human...
read it
-
DeepSFM: Structure From Motion Via Deep Bundle Adjustment
Structure from motion (SfM) is an essential computer vision problem whic...
read it
-
Multi-Scale Self-Attention for Text Classification
In this paper, we introduce the prior knowledge, multi-scale structure, ...
read it
-
Joint Parsing and Generation for Abstractive Summarization
Sentences produced by abstractive summarization systems can be ungrammat...
read it
-
Fast Color Constancy with Patch-wise Bright Pixels
In this paper, a learning-free color constancy algorithm called the Patc...
read it
-
Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models
The impressive performance of neural networks on natural language proces...
read it
-
Towards Instance-level Image-to-Image Translation
Unpaired Image-to-image Translation is a new rising and challenging visi...
read it
-
Question Guided Modular Routing Networks for Visual Question Answering
Visual Question Answering (VQA) faces two major challenges: how to bette...
read it
-
CODA: Counting Objects via Scale-aware Adversarial Density Adaption
Recent advances in crowd counting have achieved promising results with i...
read it
-
Star-Transformer
Although the fully-connected attention-based model Transformer has achie...
read it
-
Spatial Mixture Models with Learnable Deep Priors for Perceptual Grouping
Humans perceive the seemingly chaotic world in a structured and composit...
read it
-
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Emotional content is a crucial ingredient in user-generated videos. Howe...
read it
-
MEAL: Multi-Model Ensemble via Adversarial Learning
Often the best performing deep neural models are ensembles of multiple b...
read it
-
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network
Sketch has been employed as an effective communicative tool to express t...
read it
-
High Order Neural Networks for Video Classification
Capturing spatiotemporal correlations is an essential topic in video cla...
read it
-
Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective
This paper studies the problem of domain division problem which aims to ...
read it
-
Object Detection from Scratch with Deep Supervision
We propose Deeply Supervised Object Detectors (DSOD), an object detectio...
read it
-
Top-Down Tree Structured Text Generation
Text generation is a fundamental building block in natural language proc...
read it
-
SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners
Deep Convolutional Neural Networks (CNN) has achieved significant succes...
read it
-
Semantic Feature Augmentation in Few-shot Learning
A fundamental problem with few-shot learning is the scarcity of data in ...
read it
-
Learning to score the figure skating sports videos
This paper targets at learning to score the figure skating sports videos...
read it
-
Learning to score and summarize figure skating sport videos
This paper focuses on fully understanding the figure skating sport video...
read it
-
Pose-Normalized Image Generation for Person Re-identification
Person Re-identification (re-id) faces two major challenges: the lack of...
read it
-
Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids
In this paper, we propose gated recurrent feature pyramid for the proble...
read it
-
DeepSkeleton: Skeleton Map for 3D Human Pose Regression
Despite recent success on 2D human pose estimation, 3D human pose estima...
read it
-
Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization
Inspired by the recent neuroscience studies on the left-right asymmetry ...
read it
-
Recent Advances in Zero-shot Recognition
With the recent renaissance of deep convolution neural networks, encoura...
read it
-
Multi-scale Deep Learning Architectures for Person Re-identification
Person Re-identification (re-id) aims to match people across non-overlap...
read it
-
DSOD: Learning Deeply Supervised Object Detectors from Scratch
We present Deeply Supervised Object Detector (DSOD), a framework that ca...
read it
-
A Jointly Learned Deep Architecture for Facial Attribute Analysis and Face Detection in the Wild
Facial attribute analysis in the real world scenario is very challenging...
read it
-
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification
Videos are inherently multimodal. This paper studies the problem of how ...
read it
-
Vocabulary-informed Extreme Value Learning
The novel unseen classes can be formulated as the extreme values of know...
read it
-
Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
In this paper, we study the task of 3D human pose estimation in the wild...
read it
-
Semi-Latent GAN: Learning to generate and modify facial images from attributes
Generating and manipulating human facial images using high-level attribu...
read it
-
Weakly Supervised Dense Video Captioning
This paper focuses on a novel and challenging vision task, dense video c...
read it
-
Iterative Object and Part Transfer for Fine-Grained Recognition
The aim of fine-grained recognition is to identify sub-ordinate categori...
read it
-
Arbitrary-Oriented Scene Text Detection via Rotation Proposals
This paper introduces a novel rotation-based framework for arbitrary-ori...
read it
-
Evolving Boxes for Fast Vehicle Detection
We perform fast vehicle detection from traffic surveillance cameras. A n...
read it
-
Model-based Deep Hand Pose Estimation
Previous learning based hand pose estimation methods does not fully expl...
read it
-
Learning to Point and Count
This paper proposes the problem of point-and-count as a test case to bre...
read it