
Improved ZerothOrder Variance Reduced Algorithms and Analysis for Nonconvex Optimization
Two types of zerothorder stochastic algorithms have recently been desig...
read it

Faster Stochastic Algorithms via HistoryGradient Aided Batch Size Adaptation
Various schemes for adapting batch size have been recently proposed to a...
read it

SGD Converges to Global Minimum in Deep Learning via Starconvex Path
Stochastic gradient descent (SGD) has been found to be surprisingly effe...
read it

DRGAN: Conditional Generative Adversarial Network for FineGrained Lesion Synthesis on Diabetic Retinopathy Images
Diabetic retinopathy (DR) is a complication of diabetes that severely af...
read it

Elastic Neural Networks for Classification
In this work we propose a framework for improving the performance of any...
read it

AEDNet: An Abnormal Event Detection Network
It is challenging to detect the anomaly in crowded scenes for quite a lo...
read it

Iterative Normalization: Beyond Standardization towards Efficient Whitening
Batch Normalization (BN) is ubiquitously employed for accelerating neura...
read it

On the Continuity of Rotation Representations in Neural Networks
In neural networks, it is often desirable to work with various represent...
read it

SpiderBoost: A Class of Faster Variancereduced Algorithms for Nonconvex Optimization
There has been extensive research on developing stochastic variance redu...
read it

Hybrid coarsefine classification for head pose estimation
Head pose estimation, which computes the intrinsic Euler angles (yaw, pi...
read it

KDSL: a KnowledgeDriven Supervised Learning Framework for Word Sense Disambiguation
We propose KDSL, a new word sense disambiguation (WSD) framework that ut...
read it

SemiDense 3D Reconstruction with a Stereo Event Camera
Event cameras are bioinspired sensors that offer several advantages, su...
read it

Cubic Regularization with Momentum for Nonconvex Optimization
Momentum is a popular technique to accelerate the convergence in practic...
read it

Convergence of Cubic Regularization for Nonconvex Optimization under KL Property
Cubicregularized Newton's method (CR) is a popular algorithm that guara...
read it

Toward Understanding the Impact of Staleness in Distributed Machine Learning
Many distributed machine learning (ML) systems adopt the nonsynchronous...
read it

MotionAttentive Transition for ZeroShot Video Object Segmentation
In this paper, we present a novel MotionAttentive Transition Network (M...
read it

Critical Points of Neural Networks: Analytical Forms and Landscape Properties
Due to the success of deep learning to solving a variety of challenging ...
read it

Characterization of Gradient Dominance and Regularity Conditions for Neural Networks
The past decade has witnessed a successful application of deep learning ...
read it

Combining tabu search and graph reduction to solve the maximum balanced biclique problem
The Maximum Balanced Biclique Problem is a wellknown graph model with r...
read it

Structured Production System (extended abstract)
In this extended abstract, we propose Structured Production Systems (SPS...
read it

From FirstOrder Logic to Assertional Logic
FirstOrder Logic (FOL) is widely regarded as one of the most important ...
read it

Conditional Accelerated Lazy Stochastic Gradient Descent
In this work we introduce a conditional accelerated lazy stochastic grad...
read it

A Set Theoretic Approach for Knowledge Representation: the Representation Part
In this paper, we propose a set theoretic approach for knowledge represe...
read it

SemiDense Visual Odometry for RGBD Cameras Using Approximate Nearest Neighbour Fields
This paper presents a robust and efficient semidense visual odometry so...
read it

Reshaped Wirtinger Flow and Incremental Algorithm for Solving Quadratic System of Equations
We study the phase retrieval problem, which solves quadratic system of e...
read it

A Logical Study of Partial Entailment
We introduce a novel logical notionpartial entailmentto propositiona...
read it

Majority Rule for Belief Evolution in Social Networks
In this paper, we study how an agent's belief is affected by her neighbo...
read it

An optimal randomized incremental gradient method
In this paper, we consider a class of finitesum convex optimization pro...
read it

DAP3DNet: Where, What and How Actions Occur in Videos?
Action parsing in videos with complex scenes is an interesting but chall...
read it

Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization
The success of deep learning has led to a rising interest in the general...
read it

Sample Complexity of Stochastic VarianceReduced Cubic Regularization for Nonconvex Optimization
The popular cubic regularization (CR) method converges with first and s...
read it

Random gradient extrapolation for distributed and stochastic optimization
In this paper, we consider a class of finitesum convex optimization pro...
read it

Convergence of SGD in Learning ReLU Models with Separable Data
We consider the binary classification problem in which the objective fun...
read it

SingleView Hair Reconstruction using Convolutional Neural Networks
We introduce a deep learningbased method to generate full 3D hair geome...
read it

A Note on Inexact Condition for Cubic Regularized Newton's Method
This note considers the inexact cubicregularized Newton's method (CR), ...
read it

Asynchronous decentralized accelerated stochastic gradient descent
In this work, we introduce an asynchronous decentralized accelerated sto...
read it

MRGAN: Manifold Regularized Generative Adversarial Networks
Despite the growing interest in generative adversarial networks (GANs), ...
read it

Momentum Schemes with Stochastic Variance Reduction for Nonconvex Composite Optimization
Two new stochastic variancereduced algorithms named SARAH and SPIDER ha...
read it

The square root rule for adaptive importance sampling
In adaptive importance sampling, and other contexts, we have unbiased an...
read it

HairNet: SingleView Hair Reconstruction using Convolutional Neural Networks
We introduce a deep learningbased method to generate full 3D hair geome...
read it

A unified variancereduced accelerated gradient method for convex optimization
We propose a novel randomized incremental gradient algorithm, namely, VA...
read it

Towards Federated Graph Learning for Collaborative Financial Crimes Detection
Financial crime is a large and growing problem, in some way touching alm...
read it

Distributed SGD Generalizes Well Under Asynchrony
The performance of fully synchronized distributed systems has faced a bo...
read it

Reanalysis of Variance Reduced Temporal Difference Learning
Temporal difference (TD) learning is a popular algorithm for policy eval...
read it

Chinese Named Entity Recognition Augmented with Lexicon Memory
Inspired by a concept of contentaddressable retrieval from cognitive sc...
read it

HybridAlpha: An Efficient Approach for PrivacyPreserving Federated Learning
Federated learning has emerged as a promising approach for collaborative...
read it

Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization
Various types of parameter restart schemes have been proposed for accele...
read it

GFTE: Graphbased Financial Table Extraction
Tabular data is a crucial form of information expression, which can orga...
read it

TiFL: A Tierbased Federated Learning System
Federated Learning (FL) enables learning a shared model across many clie...
read it

An Investigation into the Stochasticity of Batch Whitening
Batch Normalization (BN) is extensively employed in various network arch...
read it
Yi Zhou
is this you? claim profile
Assistant Researcher at Fudan University School of Mathematical Sciences, Professor of School of Mathematical Sciences, Fudan University