
Multiple Code Hashing for Efficient Image Retrieval
Due to its low storage cost and fast query speed, hashing has been widel...
ExchNet: A Unified Hashing Network for LargeScale FineGrained Image Retrieval
Retrieving content relevant images from a largescale finegrained datas...
Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Stochastic gradient descent (SGD) and its variants have been the dominat...
TOMA: Topological Map Abstraction for Reinforcement Learning
Animals are able to discover the topological map (graph) of surrounding ...
BASGD: Buffered Asynchronous SGD for Byzantine Learning
Distributed learning has become a hot research topic, due to its wide ap...
Stagewise Enlargement of Batch Size for SGDbased Learning
Existing research shows that the batch size can seriously affect the per...
Weight Normalization based Quantization for Deep Neural Network Compression
With the development of deep neural networks, the size of network models...
ADASS: Adaptive Sample Selection for Training Acceleration
Stochastic gradient decent (SGD) and its variants, including some accele...
Clustered Reinforcement Learning
Exploration strategy design is one of the challenging problems in reinfo...
On the Convergence of MemoryBased Distributed SGD
Distributed stochastic gradient descent (DSGD) has been widely used for ...
Global Momentum Compression for Sparse Communication in Distributed SGD
With the rapid growth of data, distributed stochastic gradient descent (...
Deep MultiIndex Hashing for Person ReIdentification
Traditional person reidentification (ReID) methods typically represent ...
On the Evaluation Metric for Hashing
Due to its low storage cost and fast query speed, hashing has been widel...
Collaborative SelfAttention for Recommender Systems
Recommender systems (RS), which have been an essential part in a wide ra...
Gated Group SelfAttention for Answer Selection
Answer selection (answer ranking) is one of the key steps in many kinds ...
Hashing based Answer Selection
Answer selection is an important subtask of question answering (QA), whe...
Quantized EpochSGD for CommunicationEfficient Distributed Learning
Due to its efficiency and ease to implement, stochastic gradient descent...
Proximal SCOPE for Distributed Sparse Learning: Better Data Partition Implies Faster Convergence Rate
Distributed sparse learning with a cluster of multiple machines has attr...
Convolutional Geometric Matrix Completion
Geometric matrix completion (GMC) has been proposed for recommendation b...
FeatureDistributed SVRG for HighDimensional Linear Classification
Linear classification has been widely used in many highdimensional appl...
Asymmetric Deep Supervised Hashing
Hashing has been widely used for largescale approximate nearest neighbo...
FullTime Supervision based Bidirectional RNN for Factoid Question Answering
Recently, bidirectional recurrent neural network (BRNN) has been widely ...
A Proximal Stochastic QuasiNewton Algorithm
In this paper, we discuss the problem of minimizing the sum of two conve...
SCOPE: Scalable Composite Optimization for Learning on Spark
Many machine learning models, such as logistic regression (LR) and suppo...
Feature Learning based Deep Supervised Hashing with Pairwise Labels
Recent years have witnessed wide application of hashing for largescale ...
A Parallel algorithm for XArmed bandits
The target of Xarmed bandit problem is to find the global maximum of an...
Fast Asynchronous Parallel Stochastic Gradient Decent
Stochastic gradient descent (SGD) and its variants have become more and ...
WuJun Li
