Deheng Ye
Nanyang Technological University
H-index: 18
Asia-Singapore
Top articles of Deheng Ye
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
More agents is all you need | arXiv preprint arXiv:2402.05120 | Junyou Li Qin Zhang Yangbin Yu Qiang Fu Deheng Ye | 2024/2/3 |
Affordable Generative Agents | arXiv preprint arXiv:2402.02053 | Yangbin Yu Qin Zhang Junyou Li Qiang Fu Deheng Ye | 2024/2/3 |
HGAttack: Transferable Heterogeneous Graph Adversarial Attack | arXiv preprint arXiv:2401.09945 | He Zhao Zhiwei Zeng Yongwei Wang Deheng Ye Chunyan Miao | 2024/1/18 |
Mutual-Information Regularized Multi-Agent Policy Iteration | Advances in Neural Information Processing Systems | Deheng Ye Zongqing Lu | 2024/2/13 |
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Advances in Neural Information Processing Systems | Yun Qu Boyuan Wang Jianzhun Shao Yuhang Jiang Chen Chen | 2024/2/13 |
A survey on transformers in reinforcement learning | arXiv preprint arXiv:2301.03044 | Wenzhe Li Hao Luo Zichuan Lin Chongjie Zhang Zongqing Lu | 2023/1/8 |
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization | IEEE Transactions on Neural Networks and Learning Systems | Tiantian Zhang Zichuan Lin Yuxing Wang Deheng Ye Qiang Fu | 2023/6/7 |
Replay-enhanced Continual Reinforcement Learning | Transactions on Machine Learning Research | Tiantian Zhang Kevin Zehua Shen Zichuan Lin Bo Yuan Xueqian Wang | 2023/11/20 |
SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics | Dehai Zhao Zhenchang Xing Xin Xia Deheng Ye Xiwei Xu | 2023/5/14 | |
Llm-based agent society investigation: Collaboration and confrontation in avalon gameplay | arXiv preprint arXiv:2310.14985 | Yihuai Lan Zhiqiang Hu Lei Wang Yang Wang Deheng Ye | 2023/10/23 |
Deploying Offline Reinforcement Learning with Human Feedback | arXiv preprint arXiv:2303.07046 | Ziniu Li Ke Xu Liu Liu Lanqing Li Deheng Ye | 2023/3/13 |
Rltf: Reinforcement learning from unit test feedback | arXiv preprint arXiv:2307.04349 | Jiate Liu Yiqin Zhu Kaiwen Xiao Qiang Fu Xiao Han | 2023/7/10 |
Sample dropout: A simple yet effective variance reduction technique in deep policy optimization | arXiv preprint arXiv:2302.02299 | Zichuan Lin Xiapeng Wu Mingfei Sun Deheng Ye Qiang Fu | 2023/2/5 |
Future-conditioned unsupervised pretraining for decision transformer | Zhihui Xie Zichuan Lin Deheng Ye Qiang Fu Yang Wei | 2023/7/3 | |
HoK3v3: an Environment for Generalization in Heterogeneous Multi-agent Reinforcement Learning | Lin Liu Jianzhun Shao Xinkai Chen Yun Qu Boyuan Wang | 2023/12/12 | |
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning | arXiv preprint arXiv:2301.08442 | Haoxuan Pan Deheng Ye Xiaoming Duan Qiang Fu Wei Yang | 2023/1/20 |
RLogist: fast observation strategy on whole-slide images with deep reinforcement learning | Proceedings of the AAAI Conference on Artificial Intelligence | Boxuan Zhao Jun Zhang Deheng Ye Jian Cao Xiao Han | 2023/6/26 |
Master–Slave Deep Architecture for Top- Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints | IEEE Transactions on Neural Networks and Learning Systems | Hanchi Huang Li Shen Deheng Ye Wei Liu | 2023/11/24 |
More centralized training, still decentralized execution: Multi-agent conditional policy factorization | arXiv preprint arXiv:2209.12681 | Jiangxing Wang Deheng Ye Zongqing Lu | 2022/9/26 |
Revisiting discrete soft actor-critic | arXiv preprint arXiv:2209.10081 | Haibin Zhou Zichuan Lin Junyou Li Qiang Fu Wei Yang | 2022/9/21 |