Mingfei Sun
University of Oxford
H-index: 13
Europe-United Kingdom
Top articles of Mingfei Sun
FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers’ Preference Elicitation
2024/3/18
Xin Liang
H-Index: 1
Mingfei Sun
H-Index: 5
TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
arXiv preprint arXiv:2403.01977
2024/3/4
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning
Advances in Neural Information Processing Systems
2024/2/13
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
2023/11/20
Modeling adaptive expression of robot learning engagement and exploring its effects on human teachers
ACM Transactions on Computer-Human Interaction
2023/9/23
Shuai Ma
H-Index: 3
Mingfei Sun
H-Index: 5
Imitating human behaviour with diffusion models
arXiv preprint arXiv:2301.10677
2023/1/25
Trust Region Bounds for Decentralized PPO Under Non-stationarity
arXiv preprint arXiv:2202.00082
2022/1/31
Towards flexible inference in sequential decision problems via bidirectional transformers
arXiv preprint arXiv:2204.13326
2022/4/28
How humans perceive human-like behavior in video game navigation
2022/4/27
Stephanie Milani
H-Index: 4
Mingfei Sun
H-Index: 5
You may not need ratio clipping in ppo
arXiv preprint arXiv:2202.00079
2022/1/31
Generalization in cooperative multi-agent systems
arXiv preprint arXiv:2202.00104
2022/1/31
Trust-Region-Free Policy Optimization for Stochastic Policies
arXiv preprint arXiv:2302.07985
2023/2/15
Uni[MASK]: Unified Inference in Sequential Decision Problems
2022/11/20
Investigating the Effects of Robot Engagement Communication on Learning from Demonstration
International Journal of Social Robotics
2022
Mingfei Sun
H-Index: 5
Meng Xia
H-Index: 16
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Proceedings of the AAAI Conference on Artificial Intelligence
2022/6/28
Mingfei Sun
H-Index: 5
Shimon Whiteson
H-Index: 39
CrowdPatrol: A mobile crowdsensing framework for traffic violation hotspot patrolling
IEEE Transactions on Mobile Computing
2021/9/8
Meta-reinforcement learning for mastering multiple skills and generalizing across environments in text-based games
2021/7/27
Zhenjie Zhao
H-Index: 5
Mingfei Sun
H-Index: 5
Softdice for imitation learning: Rethinking off-policy distribution matching
arXiv preprint arXiv:2106.03155
2021/6/6
Investigating Ratio Clipping in Multi-agent Reinforcement Learning
2021
Mingfei Sun
H-Index: 5
Shimon Whiteson
H-Index: 39
Is independent learning all you need in the starcraft multi-agent challenge?
arXiv preprint arXiv:2011.09533
2020/11/18
Mingfei Sun
H-Index: 5
Shimon Whiteson
H-Index: 39