
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
Lossy compression algorithms are typically designed to achieve the lowes...
The Sobolev Regularization Effect of Stochastic Gradient Descent
The multiplicative structure of parameters and input data in the first l...
Learning Transferable Kinematic Dictionary for 3D Human Pose and Shape Reconstruction
Estimating 3D human pose and shape from a single image is highly underc...
Nonlinear Weighted Directed Acyclic Graph and A Priori Estimates for Neural Networks
In an attempt to better understand structural benefits and generalizatio...
IoU Attack: Towards Temporally Coherent BlackBox Adversarial Attack for Visual Object Tracking
Adversarial attack arises due to the vulnerability of deep neural networ...
Continual Learning for Blind Image Quality Assessment
The explosive growth of image data facilitates the fast development of i...
Achieving Adversarial Robustness Requires An Active Teacher
A new understanding of adversarial examples and adversarial robustness i...
FIT: a Fast and Accurate Framework for Solving Medical Inquiring and Diagnosing Tasks
Automatic selfdiagnosis provides lowcost and accessible healthcare via...
Languageguided Navigation via CrossModal Grounding and Alternate Adversarial Learning
The emerging visionandlanguage navigation (VLN) problem aims at learni...
DEAL: Difficultyaware Active Learning for Semantic Segmentation
Active learning aims to address the paucity of labeled data by finding t...
Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning
It is not clear yet why ADAMalike adaptive gradient algorithms suffer f...
DistDGL: Distributed Graph Neural Network Training for BillionScale Graphs
Graph neural networks (GNN) have shown great success in learning from gr...
Infrared target tracking based on proximal robust principal component analysis method
Infrared target tracking plays an important role in both civil and milit...
Interpretable Neural Computation for RealWorld Compositional Visual Question Answering
There are two main lines of research on visual question answering (VQA):...
Towards a Mathematical Understanding of Neural NetworkBased Machine Learning: what we know and what we don't
The purpose of this article is to review the achievements made in the la...
Complexity Measures for Neural Networks with General Activation Functions Using Pathbased Norms
A simple approach is proposed to obtain complexity controls for neural n...
A Qualitative Study of the Dynamic Behavior of Adaptive Gradient Algorithms
The dynamic behavior of RMSprop and Adam algorithms is studied through a...
The Slow Deterioration of the Generalization Error of the Random Feature Model
The random feature model exhibits a kind of resonance behavior when the ...
Rethinking Image Deraining via Rain Streaks and Vapors
Single image deraining regards an input image as a fusion of a backgroun...
Unsupervised Deep Representation Learning for RealTime Tracking
The advancement of visual tracking has continuously been brought by deep...
Robust Tracking against Adversarial Attacks
While deep convolutional neural networks (CNNs) are vulnerable to advers...
Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering
Visual Question Answering (VQA) has achieved great success thanks to the...
The QuenchingActivation Behavior of the Gradient Descent Dynamics for Twolayer Neural Network Models
A numerical and phenomenological study of the gradient descent (GD) algo...
Accelerating MRI Reconstruction on TPUs
The advanced magnetic resonance (MR) image reconstructions such as the c...
VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data
Deep generative models often perform poorly in realworld applications d...
DGLKE: Training Knowledge Graph Embeddings at Scale
Knowledge graphs have emerged as a key abstraction for organizing inform...
Twostage model and optimal SISNR for monaural multispeaker speech separation in noisy environment
In daily listening environments, speech is always distorted by backgroun...
A Meanfield Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From Depth
Training deep neural networks with stochastic gradient descent (SGD) can...
See More, Know More: Unsupervised Video Object Segmentation with CoAttention Siamese Networks
We introduce a novel network, called COattention Siamese Network (COSNe...
Machine Learning from a Continuous Viewpoint
We present a continuous formulation of machine learning, as a problem in...
On the Generalization Properties of Minimumnorm Solutions for Overparameterized Neural Network Models
We study the generalization properties of minimumnorm solutions for thr...
Deep Image Deraining Via Intrinsic Rainy Image Priors and Multiscale Auxiliary Decoding
Different rain models and novel network structures have been proposed to...
Global Convergence of Gradient Descent for Deep Linear Residual Networks
We analyze the global convergence of gradient descent for deep linear re...
RealTime Correlation Tracking via Joint Model Compression and Transfer
Correlation filters (CF) have received considerable attention in visual ...
Deep Single Image Deraining Via Estimating Transmission and Atmospheric Light in rainy Scenes
Rain removal in images/videos is still an important task in computer vis...
Barron Spaces and the Compositional Function Spaces for Neural Network Models
One of the key issues in the analysis of machine learning models is to i...
Analysis of the Gradient Descent Algorithm for a Deep Neural Network Model with Skipconnections
The behavior of the gradient descent (GD) algorithm is analyzed for a de...
A Comparative Analysis of the Optimization and Generalization Property of Twolayer Neural Network and Random Feature Models Under Gradient Descent Dynamics
A fairly comprehensive analysis is presented for the gradient descent dy...
Unsupervised Deep Tracking
We propose an unsupervised visual tracking method in this paper. Differe...
TargetAware Deep Tracking
Existing deep trackers mainly use convolutional neural networks pretrai...
DepthAware Video Frame Interpolation
Video frame interpolation aims to synthesize nonexistent frames inbetwe...
A Priori Estimates of the Population Risk for Residual Networks
Optimal a priori estimates are derived for the population risk of a regu...
A Priori Estimates of the Generalization Error for Twolayer Neural Networks
New estimates for the generalization error are established for the twol...
Deep Attentive Tracking via Reciprocative Learning
Visual attention, derived from cognitive neuroscience, facilitates human...
PersonJob Fit: Adapting the Right Talent for the Right Job with Joint Representation Learning
PersonJob Fit is the process of matching the right talent for the right...
EDDI: Efficient Dynamic Discovery of HighValue Information with Partial VAE
Making decisions requires information relevant to the task at hand. Many...
Model Reduction with Memory and the Machine Learning of Dynamical Systems
The wellknown MoriZwanzig theory tells us that model reduction leads t...
Joint Neural Entity Disambiguation with Output Space Search
In this paper, we present a novel model for entity disambiguation that c...
Variational Implicit Processes
This paper introduces the variational implicit processes (VIPs), a Bayes...
VITAL: VIsual Tracking via Adversarial Learning
The trackingbydetection framework consists of two stages, i.e., drawin...
Chao Ma
