We give a O(n) time sampler for independent sets of a matroid
with n ele...
The human brain can easily focus on one speaker and suppress others in
s...
k-Clustering in ℝ^d (e.g., k-median and k-means) is a
fundamental machin...
Back Translation (BT) is widely used in the field of machine translation...
With strong capabilities of reasoning and a generic understanding of the...
We characterize the uniqueness condition in the hardcore model for bipar...
Image forgery localization aims to identify forged regions by capturing
...
One of the key challenges in deploying RL to real-world applications is ...
Generalization in Reinforcement Learning (RL) aims to learn an agent dur...
Fog manufacturing can greatly enhance traditional manufacturing systems
...
We give a near-linear time sampler for the Gibbs distribution of the
fer...
We study human-in-the-loop reinforcement learning (RL) with trajectory
p...
Partially Observable Markov Decision Process (POMDP) provides a principl...
We prove an optimal Ω(n^-1) lower bound for modified log-Sobolev
(MLS) c...
Few-shot learning is an established topic in natural images for years, b...
We prove an optimal O(n log n) mixing time of the Glauber dynamics for t...
Pre-training lays the foundation for recent successes in radiograph anal...
A growing trend for value-based reinforcement learning (RL) algorithms i...
In this paper, we present WenetSpeech, a multi-domain Mandarin corpus
co...
Although model-based reinforcement learning (RL) approaches are consider...
Reinforcement learning encounters many challenges when applied directly ...
The framework of deep reinforcement learning (DRL) provides a powerful a...
Variational autoencoders (VAEs), as an important aspect of generative mo...
Ensemble reinforcement learning (RL) aims to mitigate instability in
Q-l...
The maximum independent set problem is one of the most important problem...
Reducing the shortage of organ donations to meet the demands of patients...
Motivated by the recent discovery that the interpretation maps of CNNs c...
The unified streaming and non-streaming two-pass (U2) end-to-end model f...
We prove an optimal Ω(n^-1) lower bound on the spectral gap of
Glauber d...
This paper studies representation learning for multi-task linear bandits...
In this paper, we present a new open source, production first and produc...
Point cloud segmentation is a fundamental visual understanding task in 3...
Relation extraction is the task of identifying relation instance between...
Reinforcement learning (RL) in episodic, factored Markov decision proces...
Humans integrate multiple sensory modalities (e.g. visual and audio) to ...
In this paper, we study Combinatorial Semi-Bandits (CSB) that is an exte...
Information visualization significantly enhances human perception by
gra...
At present, supervised stereo methods based on deep neural network have
...
In this paper, we present a convolution neural network based method to
r...
We study the communication complexity of distributed multi-armed bandits...
Transmission control protocol (TCP) congestion control is one of the key...
Pixel-level semantic segmentation is a challenging task with a huge amou...
A fundamental question in reinforcement learning is whether model-free
a...
Many computer vision applications, such as object recognition and
segmen...
We study the data space D of any given data set X and explain how
functi...
We propose an approach to generate geometric theorems from electronic im...
Electronic Geometry Textbook is a knowledge management system that manag...