Mingfei Sun at University of Oxford

University	University of Oxford
Position	___
Citations(all)	818
Citations(since 2020)	797
Cited By	94
hIndex(all)	13
hIndex(since 2020)	12
i10Index(all)	16
i10Index(since 2020)	15
Email	Access Email
University Profile Page	University of Oxford
Google Scholar	View Google Scholar Profile

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers’ Preference Elicitation

2024/3/18

Xin Liang

H-Index: 1

Mingfei Sun

H-Index: 5

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

arXiv preprint arXiv:2403.01977

2024/3/4

Mingfei Sun

H-Index: 5

Mengmi Zhang

H-Index: 5

Wei Pan

H-Index: 34

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

Advances in Neural Information Processing Systems

2024/2/13

Jonathan Cook

H-Index: 16

Mingfei Sun

H-Index: 5

Anuj Mahajan

H-Index: 6

Shimon Whiteson

H-Index: 39

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

2023/11/20

Massimiliano Patacchiola

H-Index: 9

Mingfei Sun

H-Index: 5

Richard E Turner

H-Index: 33

Modeling adaptive expression of robot learning engagement and exploring its effects on human teachers

ACM Transactions on Computer-Human Interaction

2023/9/23

Shuai Ma

H-Index: 3

Mingfei Sun

H-Index: 5

Imitating human behaviour with diffusion models

arXiv preprint arXiv:2301.10677

2023/1/25

Tim Pearce

H-Index: 6

Tabish Rashid

H-Index: 7

Mingfei Sun

H-Index: 5

Trust Region Bounds for Decentralized PPO Under Non-stationarity

arXiv preprint arXiv:2202.00082

2022/1/31

Mingfei Sun

H-Index: 5

Jacob Beck

H-Index: 10

Shimon Whiteson

H-Index: 39

Towards flexible inference in sequential decision problems via bidirectional transformers

arXiv preprint arXiv:2204.13326

2022/4/28

Micah Carroll

H-Index: 1

Jessy Lin

H-Index: 2

Orr Paradise

H-Index: 2

Mingfei Sun

H-Index: 5

Stephanie Milani

H-Index: 4

How humans perceive human-like behavior in video game navigation

2022/4/27

Stephanie Milani

H-Index: 4

Mingfei Sun

H-Index: 5

You may not need ratio clipping in ppo

arXiv preprint arXiv:2202.00079

2022/1/31

Mingfei Sun

H-Index: 5

Vitaly Kurin

H-Index: 8

Guoqing Liu

H-Index: 9

Tao Qin

H-Index: 1

Shimon Whiteson

H-Index: 39

Generalization in cooperative multi-agent systems

arXiv preprint arXiv:2202.00104

2022/1/31

Anuj Mahajan

H-Index: 6

Tarun Gupta

H-Index: 2

Mingfei Sun

H-Index: 5

Tim Rocktäschel

H-Index: 27

Shimon Whiteson

H-Index: 39

Trust-Region-Free Policy Optimization for Stochastic Policies

arXiv preprint arXiv:2302.07985

2023/2/15

Mingfei Sun

H-Index: 5

Anuj Mahajan

H-Index: 6

Shimon Whiteson

H-Index: 39

Uni[MASK]: Unified Inference in Sequential Decision Problems

2022/11/20

Micah Carroll

H-Index: 1

Orr Paradise

H-Index: 2

Jessy Lin

H-Index: 2

Mingfei Sun

H-Index: 5

Stephanie Milani

H-Index: 4

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

International Journal of Social Robotics

2022

Mingfei Sun

H-Index: 5

Meng Xia

H-Index: 16

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Proceedings of the AAAI Conference on Artificial Intelligence

2022/6/28

Mingfei Sun

H-Index: 5

Shimon Whiteson

H-Index: 39

CrowdPatrol: A mobile crowdsensing framework for traffic violation hotspot patrolling

IEEE Transactions on Mobile Computing

2021/9/8

Binbin Zhou

H-Index: 14

Mingfei Sun

H-Index: 5

Xiaoliang Fan

H-Index: 3

Cheng Wang

H-Index: 0

Meta-reinforcement learning for mastering multiple skills and generalizing across environments in text-based games

2021/7/27

Zhenjie Zhao

H-Index: 5

Mingfei Sun

H-Index: 5

Softdice for imitation learning: Rethinking off-policy distribution matching

arXiv preprint arXiv:2106.03155

2021/6/6

Mingfei Sun

H-Index: 5

Anuj Mahajan

H-Index: 6

Shimon Whiteson

H-Index: 39

Investigating Ratio Clipping in Multi-agent Reinforcement Learning

2021

Mingfei Sun

H-Index: 5

Shimon Whiteson

H-Index: 39

Is independent learning all you need in the starcraft multi-agent challenge?

arXiv preprint arXiv:2011.09533

2020/11/18

Mingfei Sun

H-Index: 5

Shimon Whiteson

H-Index: 39

Mingfei Sun

University of Oxford

About Mingfei Sun

Mingfei Sun Information

Mingfei Sun Skills & Research Interests

Top articles of Mingfei Sun

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers’ Preference Elicitation

Xin Liang

Mingfei Sun

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

Mingfei Sun

Mengmi Zhang

Wei Pan

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

Jonathan Cook

Mingfei Sun

Anuj Mahajan

Shimon Whiteson

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Massimiliano Patacchiola

Mingfei Sun

Richard E Turner

Modeling adaptive expression of robot learning engagement and exploring its effects on human teachers

Shuai Ma

Mingfei Sun

Imitating human behaviour with diffusion models

Tim Pearce

Tabish Rashid

Mingfei Sun

Trust Region Bounds for Decentralized PPO Under Non-stationarity

Mingfei Sun

Jacob Beck

Shimon Whiteson

Towards flexible inference in sequential decision problems via bidirectional transformers

Micah Carroll

Jessy Lin

Orr Paradise

Mingfei Sun

Stephanie Milani

How humans perceive human-like behavior in video game navigation

Stephanie Milani

Mingfei Sun

You may not need ratio clipping in ppo

Mingfei Sun

Vitaly Kurin

Guoqing Liu

Tao Qin

Shimon Whiteson

Generalization in cooperative multi-agent systems

Anuj Mahajan

Tarun Gupta

Mingfei Sun

Tim Rocktäschel

Shimon Whiteson

Trust-Region-Free Policy Optimization for Stochastic Policies

Mingfei Sun

Anuj Mahajan

Shimon Whiteson

Uni[MASK]: Unified Inference in Sequential Decision Problems

Micah Carroll

Orr Paradise

Jessy Lin

Mingfei Sun

Stephanie Milani

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

Mingfei Sun

Meng Xia

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Mingfei Sun

Shimon Whiteson

CrowdPatrol: A mobile crowdsensing framework for traffic violation hotspot patrolling

Binbin Zhou

Mingfei Sun

Xiaoliang Fan

Cheng Wang

Meta-reinforcement learning for mastering multiple skills and generalizing across environments in text-based games

Zhenjie Zhao

Mingfei Sun

Softdice for imitation learning: Rethinking off-policy distribution matching

Mingfei Sun