Dhawal Gupta
University of Alberta
H-index: 6
North America-Canada
Top articles of Dhawal Gupta
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
From Past to Future: Rethinking Eligibility Traces | Proceedings of the AAAI Conference on Artificial Intelligence | Dhawal Gupta Scott M Jordan Shreyas Chaudhari Bo Liu Philip S Thomas | 2024/3/24 |
Behavior Alignment via Reward Function Optimization | Advances in Neural Information Processing Systems | Dhawal Gupta Yash Chandak Scott M. Jordan Philip S Thomas Bruno C da Silva | 2024/2/13 |
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF | arXiv preprint arXiv:2309.09055 | Simeng Sun Dhawal Gupta Mohit Iyyer | 2023/9/16 |
Coagent Networks: Generalized and Scaled | arXiv preprint arXiv:2305.09838 | James E Kostas Scott M Jordan Yash Chandak Georgios Theocharous Dhawal Gupta | 2023/5/16 |
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management | arXiv preprint arXiv:2302.10850 | Dhawal Gupta Yinlam Chow Mohammad Ghavamzadeh Craig Boutilier | 2023/2/21 |
A Mixture-of-Expert Approach to RL-based Dialogue Management | arXiv preprint arXiv:2206.00059 | Yinlam Chow Aza Tulepbergenov Ofir Nachum MoonKyung Ryu Mohammad Ghavamzadeh | 2022/5/31 |
A unified dialogue management strategy for multi-intent dialogue conversations in multiple languages | Transactions on Asian and Low-Resource Language Information Processing | Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya | 2021/9/20 |
Structural Credit Assignment in Neural Networks using Reinforcement Learning | Advances in Neural Information Processing Systems | Dhawal Gupta Gabor Mihucz Matthew Schlegel James Kostas Philip S Thomas | 2021/12/6 |
A hierarchical approach for efficient multi-intent dialogue policy learning | Multimedia Tools and Applications | Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya | 2020/7/2 |
Applicability of Momentum in the Methods of Temporal Learning | Dhawal Gupta | 2020/3/6 | |
Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework | Cognitive Computation | Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya | 2020/1/22 |
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning | Expert Systems with Applications | Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya | 2020/12/30 |
Gradient Temporal-Difference Learning with Regularized Corrections | Sina Ghiassian Andrew Patterson Shivam Garg Dhawal Gupta Adam White | 2020/11/21 |