Jun Wang
University College London
H-index: 62
Europe-United Kingdom
Top articles of Jun Wang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning | arXiv preprint arXiv:2402.17453 | Siyuan Guo Cheng Deng Ying Wen Hechang Chen Yi Chang | 2024/2/27 |
Natural Language Reinforcement Learning | arXiv preprint arXiv:2402.07157 | Xidong Feng Ziyu Wan Mengyue Yang Ziyan Wang Girish A Koushiks | 2024/2/11 |
Entropy-Regularized Token-Level Policy Optimization for Large Language Models | arXiv preprint arXiv:2402.06700 | Muning Wen Cheng Deng Jun Wang Weinan Zhang Ying Wen | 2024/2/9 |
A survey on algorithms for Nash equilibria in finite normal-form games | Hanyu Li Wenhan Huang Zhijian Duan David Henry Mguni Kun Shao | 2024/2/1 | |
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | NeurIPS2023 FMDM workshop | Xidong Feng* Ziyu Wan* Muning Wen Ying Wen Weinan Zhang | 2023/9/29 |
Token-level Direct Preference Optimization | International Conference on Machine Learning (ICML 2024) | Yongcheng Zeng Guoqing Liu Weiyu Ma Ning Yang Haifeng Zhang | 2024/4/18 |
On the complexity of computing markov perfect equilibrium in general-sum stochastic games | National Science Review | Xiaotie Deng Ningyuan Li David Mguni Jun Wang Yaodong Yang | 2023/1 |
Self-Supervised MAFENN for Classifying Low-labeled Distorted Images over Mobile Fading Channels | IEEE Transactions on Mobile Computing | Yang Li Fanglei Sun Jingchen Hu Chang Liu Fan Wu | 2023/12/19 |
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem | Chen Yu Weinan Zhang Hang Lai Zheng Tian Laurent Kneip | 2023/5/29 | |
Debiased recommendation with user feature balancing | ACM TOIS: ACM Transactions on Information Systems | Mengyue Yang Guohao Cai Jiarui Jin Zhenhua Dong Xiuqiang He | 2022/1/16 |
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination | Xue Yan Jiaxian Guo Xingzhou Lou Jun Wang Haifeng Zhang | 2023/8 | |
Online PCA in Converging Self-consistent Field Equations | Advances in Neural Information Processing Systems | Xihan Li Xiang Chen Rasul Tutunov Haitham Bou Ammar Lei Wang | 2024/2/13 |
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | arXiv preprint arXiv:2312.11865 | Weiyu Ma Qirui Mi Xue Yan Yuqiao Wu Runji Lin | 2023/12/19 |
MANSA: Learning Fast and Slow in Multi-Agent Systems | David Henry Mguni Haojun Chen Taher Jafferjee Jianhong Wang Longfei Yue | 2023/7/3 | |
Offline Pre-trained Multi-agent Decision Transformer | arXiv preprint arXiv:2112.02845 | Linghui Meng Muning Wen Yaodong Yang Chenyang Le Xiyun Li | 2021/12/6 |
Rectifying unfairness in recommendation feedback loop | SIGIR 2023: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval | Mengyue Yang Jun Wang Jean-Francois Ton | 2023 |
Invariant Learning via Probability of Sufficient and Necessary Causes | Advances in Neural Information Processing Systems | Mengyue Yang Yonggang Zhang Zhen Fang Yali Du Furui Liu | 2024/2/13 |
Large sequence models for sequential decision-making: a survey | Muning Wen Runji Lin Hanjing Wang Yaodong Yang Ying Wen | 2023/12 | |
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems | Oliver Slumbers David Henry Mguni Stefano B Blumberg Stephen Marcus McAleer Yaodong Yang | 2023/6/15 | |
GEO: A Computational Design Framework for Automotive Exterior Facelift | ACM Transactions on Knowledge Discovery from Data | Jingmin Huang Bowei Chen Zhi Yan Iadh Ounis Jun Wang | 2023/3/1 |