ProfessorsProfessors of University College LondonJun Wang

Jun Wang

University College London

H-index: 62

Europe-United Kingdom

About Jun Wang

Jun Wang, With an exceptional h-index of 62 and a recent h-index of 51 (since 2020), a distinguished researcher at University College London, specializes in the field of Machine Learning, Multi-agent Learning, Information Retrieval, Recommender Systems, Computational Advertising.

His recent articles reflect a diverse array of research interests and contributions to the field:

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

Natural Language Reinforcement Learning

Entropy-Regularized Token-Level Policy Optimization for Large Language Models

A survey on algorithms for Nash equilibria in finite normal-form games

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

Token-level Direct Preference Optimization

On the complexity of computing markov perfect equilibrium in general-sum stochastic games

Self-Supervised MAFENN for Classifying Low-labeled Distorted Images over Mobile Fading Channels

Jun Wang Information

University	University College London
Position	Professor Computer Science
Citations(all)	19636
Citations(since 2020)	13977
Cited By	10389
hIndex(all)	62
hIndex(since 2020)	51
i10Index(all)	153
i10Index(since 2020)	120
Email	Access Email
University Profile Page	University College London
Google Scholar	View Google Scholar Profile

Jun Wang Skills & Research Interests

Machine Learning

Multi-agent Learning

Information Retrieval

Recommender Systems

Computational Advertising

Top articles of Jun Wang

Title	Journal	Author(s)	Publication Date
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning	arXiv preprint arXiv:2402.17453	Siyuan Guo Cheng Deng Ying Wen Hechang Chen Yi Chang ...	2024/2/27
Natural Language Reinforcement Learning	arXiv preprint arXiv:2402.07157	Xidong Feng Ziyu Wan Mengyue Yang Ziyan Wang Girish A Koushiks ...	2024/2/11
Entropy-Regularized Token-Level Policy Optimization for Large Language Models	arXiv preprint arXiv:2402.06700	Muning Wen Cheng Deng Jun Wang Weinan Zhang Ying Wen	2024/2/9
A survey on algorithms for Nash equilibria in finite normal-form games		Hanyu Li Wenhan Huang Zhijian Duan David Henry Mguni Kun Shao ...	2024/2/1
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training	NeurIPS2023 FMDM workshop	Xidong Feng* Ziyu Wan* Muning Wen Ying Wen Weinan Zhang ...	2023/9/29
Token-level Direct Preference Optimization	International Conference on Machine Learning (ICML 2024)	Yongcheng Zeng Guoqing Liu Weiyu Ma Ning Yang Haifeng Zhang ...	2024/4/18
On the complexity of computing markov perfect equilibrium in general-sum stochastic games	National Science Review	Xiaotie Deng Ningyuan Li David Mguni Jun Wang Yaodong Yang	2023/1
Self-Supervised MAFENN for Classifying Low-labeled Distorted Images over Mobile Fading Channels	IEEE Transactions on Mobile Computing	Yang Li Fanglei Sun Jingchen Hu Chang Liu Fan Wu ...	2023/12/19
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem		Chen Yu Weinan Zhang Hang Lai Zheng Tian Laurent Kneip ...	2023/5/29
Debiased recommendation with user feature balancing	ACM TOIS: ACM Transactions on Information Systems	Mengyue Yang Guohao Cai Jiarui Jin Zhenhua Dong Xiuqiang He ...	2022/1/16
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination		Xue Yan Jiaxian Guo Xingzhou Lou Jun Wang Haifeng Zhang ...	2023/8
Online PCA in Converging Self-consistent Field Equations	Advances in Neural Information Processing Systems	Xihan Li Xiang Chen Rasul Tutunov Haitham Bou Ammar Lei Wang ...	2024/2/13
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach	arXiv preprint arXiv:2312.11865	Weiyu Ma Qirui Mi Xue Yan Yuqiao Wu Runji Lin ...	2023/12/19
MANSA: Learning Fast and Slow in Multi-Agent Systems		David Henry Mguni Haojun Chen Taher Jafferjee Jianhong Wang Longfei Yue ...	2023/7/3
Offline Pre-trained Multi-agent Decision Transformer	arXiv preprint arXiv:2112.02845	Linghui Meng Muning Wen Yaodong Yang Chenyang Le Xiyun Li ...	2021/12/6
Rectifying unfairness in recommendation feedback loop	SIGIR 2023: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval	Mengyue Yang Jun Wang Jean-Francois Ton	2023
Invariant Learning via Probability of Sufficient and Necessary Causes	Advances in Neural Information Processing Systems	Mengyue Yang Yonggang Zhang Zhen Fang Yali Du Furui Liu ...	2024/2/13
Large sequence models for sequential decision-making: a survey		Muning Wen Runji Lin Hanjing Wang Yaodong Yang Ying Wen ...	2023/12
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems		Oliver Slumbers David Henry Mguni Stefano B Blumberg Stephen Marcus McAleer Yaodong Yang ...	2023/6/15
GEO: A Computational Design Framework for Automotive Exterior Facelift	ACM Transactions on Knowledge Discovery from Data	Jingmin Huang Bowei Chen Zhi Yan Iadh Ounis Jun Wang	2023/3/1