b'Chao Yu'

research

∙ 09/04/2023

AlphaZero Gomoku

In the past few years, AlphaZero's exceptional capability in mastering i...

0 Wen Liang, et al. ∙

research

∙ 06/27/2023

Automatic Truss Design with Reinforcement Learning

Truss layout design, namely finding a lightweight truss layout satisfyin...

0 Weihua Du, et al. ∙

research

∙ 06/21/2023

SIFTER: A Task-specific Alignment Strategy for Enhancing Sentence Embeddings

The paradigm of pre-training followed by fine-tuning on downstream tasks...

0 Chao Yu, et al. ∙

research

∙ 06/18/2023

Language-Guided Generation of Physically Realistic Robot Motion and Control

We aim to control a robot to physically behave in the real world followi...

0 Shusheng Xu, et al. ∙

research

∙ 06/01/2023

Safe Offline Reinforcement Learning with Real-Time Budget Constraints

Aiming at promoting the safe real-world deployment of Reinforcement Lear...

0 Qian Lin, et al. ∙

research

∙ 04/24/2023

Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey

Reinforcement Learning(RL) has achieved tremendous development in recent...

0 Chao Yu, et al. ∙

research

∙ 02/08/2023

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation

This paper investigates the multi-agent navigation problem, which requir...

5 Xinyi Yang, et al. ∙

research

∙ 02/03/2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

There is a recent trend of applying multi-agent reinforcement learning (...

0 Chao Yu, et al. ∙

research

∙ 01/09/2023

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration

We consider the problem of cooperative exploration where multiple robots...

0 Chao Yu, et al. ∙

research

∙ 11/28/2022

Causal Deep Reinforcement Learning using Observational Data

Deep reinforcement learning (DRL) requires the collection of plenty of i...

0 Wenxuan Zhu, et al. ∙

research

∙ 10/06/2022

DeltaFS: Pursuing Zero Update Overhead via Metadata-Enabled Delta Compression for Log-structured File System on Mobile Devices

Data compression has been widely adopted to release mobile devices from ...

0 Chao Wu, et al. ∙

research

∙ 07/08/2022

Robust optimal investment and risk control for an insurer with general insider information

In this paper, we study the robust optimal investment and risk control p...

0 Chao Yu, et al. ∙

research

∙ 07/08/2022

Malliavin calculus and its application to robust optimal portfolio for an insider

Insider information and model uncertainty are two unavoidable problems f...

0 Chao Yu, et al. ∙

research

∙ 06/15/2022

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Many advances in cooperative multi-agent reinforcement learning (MARL) a...

0 Wei Fu, et al. ∙

research

∙ 04/03/2022

ESCM^2: Entire Space Counterfactual Multi-Task Model for Post-Click Conversion Rate Estimation

Accurate estimation of post-click conversion rate is critical for buildi...

0 Hao Wang, et al. ∙

research

∙ 04/02/2022

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

Hierarchical Text Classification (HTC) is a challenging task where a doc...

0 Chao Yu, et al. ∙

research

∙ 12/12/2021

Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward

Discovering hazardous scenarios is crucial in testing and further improv...

0 Weilin Liu, et al. ∙

research

∙ 11/07/2021

Coordinated Proximal Policy Optimization

We present Coordinated Proximal Policy Optimization (CoPPO), an algorith...

0 Zifan Wu, et al. ∙

research

∙ 10/12/2021

Learning Efficient Multi-Agent Cooperative Visual Exploration

We consider the task of visual indoor exploration with multiple agents, ...

0 Chao Yu, et al. ∙

research

∙ 09/11/2021

Quasi-Monte Carlo-Based Conditional Malliavin Method for Continuous-Time Asian Option Greeks

Although many methods for computing the Greeks of discrete-time Asian op...

0 Chao Yu, et al. ∙

research

∙ 05/09/2021

Reinforcement Learning with Expert Trajectory For Quantitative Trading

In recent years, quantitative investment methods combined with artificia...

0 Sihang Chen, et al. ∙

research

∙ 03/02/2021

The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games

Proximal Policy Optimization (PPO) is a popular on-policy reinforcement ...

7 Chao Yu, et al. ∙

research

∙ 01/04/2021

A Joint Training Dual-MRC Framework for Aspect Based Sentiment Analysis

Aspect based sentiment analysis (ABSA) involves three fundamental subtas...

0 Yue Mao, et al. ∙

research

∙ 04/27/2020

Multi-IF : An Approach to Anomaly Detection in Self-Driving Systems

Autonomous driving vehicles (ADVs) are implemented with rich software fu...

0 Kun Cheng, et al. ∙

research

∙ 11/10/2019

Symmetrical Gaussian Error Linear Units (SGELUs)

In this paper, a novel neural network activation function, called Symmet...

0 Chao Yu, et al. ∙

research

∙ 08/22/2019

Reinforcement Learning in Healthcare: A Survey

As a subfield of machine learning, reinforcement learning (RL) aims at e...

0 Chao Yu, et al. ∙

research

∙ 11/10/2018

Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning

Providing reinforcement learning agents with informationally rich human ...

0 Chao Yu, et al. ∙

research

∙ 11/09/2018

The Price of Governance: A Middle Ground Solution to Coordination in Organizational Control

Achieving coordination is crucial in organizational control. This paper ...

0 Chao Yu, et al. ∙

research

∙ 09/22/2018

DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments

Simultaneous Localization and Mapping (SLAM) is considered to be a funda...

0 Chao Yu, et al. ∙

Chao Yu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro