Mingfei Sun

research

∙ 06/23/2023

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

In this paper we explore few-shot imitation learning for control problem...

0 Massimiliano Patacchiola, et al. ∙

research

∙ 02/15/2023

Trust-Region-Free Policy Optimization for Stochastic Policies

Trust Region Policy Optimization (TRPO) is an iterative method that simu...

0 Mingfei Sun, et al. ∙

research

∙ 02/05/2023

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Recent success in Deep Reinforcement Learning (DRL) methods has shown th...

0 Zichuan Lin, et al. ∙

research

∙ 01/25/2023

Imitating Human Behaviour with Diffusion Models

Diffusion models have emerged as powerful generative models in the text-...

0 Tim Pearce, et al. ∙

research

∙ 01/20/2023

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

We revisit the estimation bias in policy gradients for the discounted ep...

0 Haoxuan Pan, et al. ∙

research

∙ 12/14/2022

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

The availability of challenging benchmarks has played a key role in the ...

0 Benjamin Ellis, et al. ∙

research

∙ 11/20/2022

UniMASK: Unified Inference in Sequential Decision Problems

Randomly masking and predicting word tokens has been a successful approa...

0 Micah Carroll, et al. ∙

research

∙ 04/28/2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Randomly masking and predicting word tokens has been a successful approa...

2 Micah Carroll, et al. ∙

research

∙ 01/31/2022

Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO

We present a new monotonic improvement guarantee for optimizing decentra...

0 Mingfei Sun, et al. ∙

research

∙ 01/31/2022

You May Not Need Ratio Clipping in PPO

Proximal Policy Optimization (PPO) methods learn a policy by iteratively...

0 Mingfei Sun, et al. ∙

research

∙ 12/14/2021

Birds Eye View Social Distancing Analysis System

Social distancing can reduce the infection rates in respiratory pandemic...

0 Zhengye Yang, et al. ∙

research

∙ 12/11/2021

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Sample efficiency is crucial for imitation learning methods to be applic...

0 Mingfei Sun, et al. ∙

research

∙ 06/06/2021

SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching

We present SoftDICE, which achieves state-of-the-art performance for imi...

0 Mingfei Sun, et al. ∙

research

∙ 11/25/2020

Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings

We present JueWu-SL, the first supervised-learning-based artificial inte...

6 Deheng Ye, et al. ∙

research

∙ 05/03/2020

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

Robot Learning from Demonstration (RLfD) is a technique for robots to de...

0 Mingfei Sun, et al. ∙

research

∙ 05/29/2019

Adversarial Imitation Learning from Incomplete Demonstrations

Imitation learning targets deriving a mapping from states to actions, a....

0 Mingfei Sun, et al. ∙

research

∙ 04/20/2019

Estimating Emotional Intensity from Body Poses for Human-Robot Interaction

Equipping social and service robots with the ability to perceive human e...

0 Mingfei Sun, et al. ∙

Mingfei Sun

Featured Co-authors

Sign in with Google

Consider DeepAI Pro