Mingfei Sun

Mingfei Sun

University of Oxford

H-index: 13

Europe-United Kingdom

About Mingfei Sun

Mingfei Sun, With an exceptional h-index of 13 and a recent h-index of 12 (since 2020), a distinguished researcher at University of Oxford, specializes in the field of Reinforcement Learning, Generative Models, Human-Robot Interaction.

His recent articles reflect a diverse array of research interests and contributions to the field:

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers’ Preference Elicitation

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Modeling adaptive expression of robot learning engagement and exploring its effects on human teachers

Imitating human behaviour with diffusion models

Trust Region Bounds for Decentralized PPO Under Non-stationarity

Towards flexible inference in sequential decision problems via bidirectional transformers

Mingfei Sun Information

University

Position

___

Citations(all)

818

Citations(since 2020)

797

Cited By

94

hIndex(all)

13

hIndex(since 2020)

12

i10Index(all)

16

i10Index(since 2020)

15

Email

University Profile Page

Google Scholar

Mingfei Sun Skills & Research Interests

Reinforcement Learning

Generative Models

Human-Robot Interaction

Top articles of Mingfei Sun

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers’ Preference Elicitation

2024/3/18

Xin Liang
Xin Liang

H-Index: 1

Mingfei Sun
Mingfei Sun

H-Index: 5

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

arXiv preprint arXiv:2403.01977

2024/3/4

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

Advances in Neural Information Processing Systems

2024/2/13

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

2023/11/20

Modeling adaptive expression of robot learning engagement and exploring its effects on human teachers

ACM Transactions on Computer-Human Interaction

2023/9/23

Shuai Ma
Shuai Ma

H-Index: 3

Mingfei Sun
Mingfei Sun

H-Index: 5

Imitating human behaviour with diffusion models

arXiv preprint arXiv:2301.10677

2023/1/25

Trust Region Bounds for Decentralized PPO Under Non-stationarity

arXiv preprint arXiv:2202.00082

2022/1/31

Towards flexible inference in sequential decision problems via bidirectional transformers

arXiv preprint arXiv:2204.13326

2022/4/28

How humans perceive human-like behavior in video game navigation

2022/4/27

Stephanie Milani
Stephanie Milani

H-Index: 4

Mingfei Sun
Mingfei Sun

H-Index: 5

You may not need ratio clipping in ppo

arXiv preprint arXiv:2202.00079

2022/1/31

Generalization in cooperative multi-agent systems

arXiv preprint arXiv:2202.00104

2022/1/31

Trust-Region-Free Policy Optimization for Stochastic Policies

arXiv preprint arXiv:2302.07985

2023/2/15

Uni[MASK]: Unified Inference in Sequential Decision Problems

2022/11/20

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

International Journal of Social Robotics

2022

Mingfei Sun
Mingfei Sun

H-Index: 5

Meng Xia
Meng Xia

H-Index: 16

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Proceedings of the AAAI Conference on Artificial Intelligence

2022/6/28

Mingfei Sun
Mingfei Sun

H-Index: 5

Shimon Whiteson
Shimon Whiteson

H-Index: 39

CrowdPatrol: A mobile crowdsensing framework for traffic violation hotspot patrolling

IEEE Transactions on Mobile Computing

2021/9/8

Meta-reinforcement learning for mastering multiple skills and generalizing across environments in text-based games

2021/7/27

Zhenjie Zhao
Zhenjie Zhao

H-Index: 5

Mingfei Sun
Mingfei Sun

H-Index: 5

Softdice for imitation learning: Rethinking off-policy distribution matching

arXiv preprint arXiv:2106.03155

2021/6/6

Investigating Ratio Clipping in Multi-agent Reinforcement Learning

2021

Mingfei Sun
Mingfei Sun

H-Index: 5

Shimon Whiteson
Shimon Whiteson

H-Index: 39

Is independent learning all you need in the starcraft multi-agent challenge?

arXiv preprint arXiv:2011.09533

2020/11/18

Mingfei Sun
Mingfei Sun

H-Index: 5

Shimon Whiteson
Shimon Whiteson

H-Index: 39

See List of Professors in Mingfei Sun University(University of Oxford)

Co-Authors

academic-engine