This paper focuses on reinforcement learning for the regularized robust
...
This paper considers a single-trajectory system identification problem f...
Ultra-fine entity typing (UFET) is the task of inferring the semantic ty...
Learning vectors that capture the meaning of concepts remains a fundamen...
Optimal control is notoriously difficult for stochastic nonlinear system...
This paper studies the finite-time horizon Markov games where the agents...
With a computationally efficient approximation of the second-order
infor...
We study the multi-agent Bayesian optimization (BO) problem, where multi...
Deep latent variable models have achieved significant empirical successe...
This paper considers multi-agent reinforcement learning (MARL) where the...
In real-world decision-making, uncertainty is important yet difficult to...
Motivated by the increasing need of saving search effort by obtaining
re...
This paper is the system description of the DKU-Tencent System for the
V...
Gradient-based methods have been widely used for system design and
optim...
In this paper, we design an information-based multi-robot source seeking...
The multi-armed bandit(MAB) problem is a simple yet powerful framework t...
Cross-silo Federated learning (FL) has become a promising tool in machin...
Confusion is a mental state triggered by cognitive disequilibrium that c...
Pathologists need to combine information from differently stained
pathol...
Automatic speaker verification has achieved remarkable progress in recen...
This paper studies policy optimization algorithms for multi-agent
reinfo...
In order to enhance levels of engagement with conversational systems, ou...
Human-robot studies are expensive to conduct and difficult to control, a...
Decentralized online learning (DOL) has been increasingly researched in ...
In intra coding, Rate Distortion Optimization (RDO) is performed to achi...
This paper describes our speaker diarization system submitted to the
Mul...
Softmax policy gradient is a popular algorithm for policy optimization i...
We consider large scale distributed optimization over a set of edge devi...
Virtual Research Environments (VREs) provide user-centric support in the...
We study the adaptive control of an unknown linear system with a quadrat...
Recently, the attention mechanism such as squeeze-and-excitation module ...
Multi-task optimization (MTO) studies how to simultaneously solve multip...
Human gait is one of important biometric characteristics for human
ident...
Epidemics are a serious public health threat, and the resources for
miti...
With the emerging of 360-degree image/video, augmented reality (AR) and
...
We study the performance of the gradient play algorithm for multi-agent
...
We consider a many-to-one wireless architecture for federated learning a...
With large-scale integration of renewable generation and ubiquitous
dist...
Platelet products are both expensive and have very short shelf lives. As...
While the success of pre-trained language models has largely eliminated ...
Author disambiguation arises when different authors share the same name,...
We consider online convex optimization with time-varying stage costs and...
We study a class of cooperative multi-agent optimization problems, where...
In many multi-agent reinforcement learning applications such as flocking...
Existing approaches for replay and synthetic speech detection still lack...
The rapid development in data collecting devices and computation platfor...
This paper studies the automated control method for regulating air
condi...
This paper considers online optimal control with affine constraints on t...
Nowadays, there are many approaches to acquire three-dimensional (3D) po...
Blood transfusion is one of the most crucial and commonly administered
t...