
-
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Transformers, which are popular for language modeling, have been explore...
read it
-
ORDNet: Capturing Omni-Range Dependencies for Scene Parsing
Learning to capture dependencies between spatial positions is essential ...
read it
-
ProxylessKD: Direct Knowledge Distillation with Inherited Classifier for Face Recognition
Knowledge Distillation (KD) refers to transferring knowledge from a larg...
read it
-
Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Video-based human pose estimation in crowded scenes is a challenging pro...
read it
-
Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
Detecting and recognizing human action in videos with crowded scenes is ...
read it
-
A Simple Baseline for Pose Tracking in Videos of Crowded Scenes
This paper presents our solution to ACM MM challenge: Large-scale Human-...
read it
-
Dual-constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior
Nonnegative matrix factorization is usually powerful for learning the "s...
read it
-
Learning Target Domain Specific Classifier for Partial Domain Adaptation
Unsupervised domain adaptation (UDA) aims at reducing the distribution d...
read it
-
Dual Adversarial Auto-Encoders for Clustering
As a powerful approach for exploratory data analysis, unsupervised clust...
read it
-
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Pre-trained language models like BERT and its variants have recently ach...
read it
-
A Survey on Concept Factorization: From Shallow to Deep Representation Learning
The quality of learned features by representation learning determines th...
read it
-
Rethinking Bottleneck Structure for Efficient Mobile Network Design
The inverted residual block is dominating architecture design for mobile...
read it
-
Recapture as You Want
With the increasing prevalence and more powerful camera systems of mobil...
read it
-
PANDA: Prototypical Unsupervised Domain Adaptation
Previous adversarial domain alignment methods for unsupervised domain ad...
read it
-
Highly Efficient Salient Object Detection with 100K Parameters
Salient object detection models often demand a considerable amount of co...
read it
-
Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition
Deep Convolutional Neural Networks (CNNs), such as Dense Convolutional N...
read it
-
RC-DARTS: Resource Constrained Differentiable Architecture Search
Recent advances show that Neural Architectural Search (NAS) method is ab...
read it
-
Very Long Natural Scenery Image Prediction by Outpainting
Comparing to image inpainting, image outpainting receives less attention...
read it
-
Learning Hybrid Representation by Robust Dictionary Learning in Factorized Compressed Space
In this paper, we investigate the robust dictionary learning (DL) to dis...
read it
-
Asymmetric GAN for Unpaired Image-to-image Translation
Unpaired image-to-image translation problem aims to model the mapping fr...
read it
-
Fast DenseNet: Towards Efficient and Accurate Text Recognition with Fast Dense Networks
Convolutional Recurrent Neural Network (CRNN) is a popular network for r...
read it
-
DerainCycleGAN: An Attention-guided Unsupervised Benchmark for Single Image Deraining and Rainmaking
Single image deraining (SID) is an important and challenging topic in em...
read it
-
Multilayer Collaborative Low-Rank Coding Network for Robust Deep Subspace Discovery
For subspace recovery, most existing low-rank representation (LRR) model...
read it
-
Efficient Differentiable Neural Architecture Search with Meta Kernels
The searching procedure of neural architecture search (NAS) is notorious...
read it
-
AdversarialNAS: Adversarial Neural Architecture Search for GANs
Neural Architecture Search (NAS) that aims to automate the procedure of ...
read it
-
Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning
In this paper, we propose a structured Robust Adaptive Dic-tionary Pair ...
read it
-
PSGAN: Pose-Robust Spatial-Aware GAN for Customizable Makeup Transfer
We propose a novel Pose-robust Spatial-aware GAN (PSGAN) for transferrin...
read it
-
Flexible Auto-weighted Local-coordinate Concept Factorization: A Robust Framework for Unsupervised Clustering
Concept Factorization (CF) and its variants may produce inaccurate repre...
read it
-
Single-Stage Multi-Person Pose Machines
Multi-person pose estimation is a challenging problem. Existing methods ...
read it
-
Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning
We propose a joint subspace recovery and enhanced locality based robust ...
read it
-
Kernel-Induced Label Propagation by Mapping for Semi-Supervised Classification
Kernel methods have been successfully applied to the areas of pattern re...
read it
-
Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier
In this paper, we propose an analysis mechanism based structured Analysi...
read it
-
Joint Label Prediction based Semi-Supervised Adaptive Concept Factorization for Robust Data Representation
Constrained Concept Factorization (CCF) yields the enhanced representati...
read it
-
Robust Unsupervised Flexible Auto-weighted Local-Coordinate Concept Factorization for Image Clustering
We investigate the high-dimensional data clustering problem by proposing...
read it
-
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
In natural images, information is conveyed at different frequencies wher...
read it
-
Multi-Prototype Networks for Unconstrained Set-based Face Recognition
In this paper, we study the challenging unconstrained set-based face rec...
read it
-
Graph-Based Global Reasoning Networks
Globally modeling and reasoning over relations between regions can be be...
read it
-
Style Separation and Synthesis via Generative Adversarial Networks
Style synthesis attracts great interests recently, while few works focus...
read it
-
A^2-Nets: Double Attention Networks
Learning to capture long-range relations is fundamental to image/video r...
read it
-
Look Across Elapse: Disentangled Representation Learning and Photorealistic Cross-Age Face Synthesis for Age-Invariant Face Recognition
Despite the remarkable progress in face recognition related technologies...
read it
-
Multi-Fiber Networks for Video Recognition
In this paper, we aim to reduce the computational cost of spatio-tempora...
read it
-
Exact Low Tubal Rank Tensor Recovery from Gaussian Measurements
The recent proposed Tensor Nuclear Norm (TNN) [Lu et al., 2016; 2018a] i...
read it
-
Subspace Clustering by Block Diagonal Representation
This paper studies the subspace clustering problem. Given some data poin...
read it
-
Tensor Robust Principal Component Analysis with A New Tensor Nuclear Norm
In this paper, we consider the Tensor Robust Principal Component Analysi...
read it
-
Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing
Despite the noticeable progress in perceptual tasks like detection, inst...
read it
-
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
Previous deep learning based state-of-the-art scene text detection metho...
read it
-
Face Aging with Contextual Generative Adversarial Nets
Face aging, which renders aging faces for an input face, has attracted e...
read it
-
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition
Recently, deep neural networks (DNNs) have been regarded as the state-of...
read it
-
Weaving Multi-scale Context for Single Shot Detector
Aggregating context information from multiple scales has been proved to ...
read it
-
Nonconvex Sparse Spectral Clustering by Alternating Direction Method of Multipliers and Its Convergence Analysis
Spectral Clustering (SC) is a widely used data clustering method which f...
read it