ProfessorsProfessors of University of AlbertaDhawal Gupta

Dhawal Gupta

University of Alberta

H-index: 6

North America-Canada

About Dhawal Gupta

Dhawal Gupta, With an exceptional h-index of 6 and a recent h-index of 6 (since 2020), a distinguished researcher at University of Alberta, specializes in the field of Reinforcement Learning, Machine Learning, Robotics, Optimal Control.

His recent articles reflect a diverse array of research interests and contributions to the field:

From Past to Future: Rethinking Eligibility Traces

Behavior Alignment via Reward Function Optimization

Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF

Coagent Networks: Generalized and Scaled

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

A Mixture-of-Expert Approach to RL-based Dialogue Management

A unified dialogue management strategy for multi-intent dialogue conversations in multiple languages

Structural Credit Assignment in Neural Networks using Reinforcement Learning

Dhawal Gupta Information

University	University of Alberta
Position	Graduate Student
Citations(all)	117
Citations(since 2020)	117
Cited By	10
hIndex(all)	6
hIndex(since 2020)	6
i10Index(all)	3
i10Index(since 2020)	3
Email	Access Email
University Profile Page	University of Alberta
Google Scholar	View Google Scholar Profile

Dhawal Gupta Skills & Research Interests

Reinforcement Learning

Machine Learning

Robotics

Optimal Control

Top articles of Dhawal Gupta

Title	Journal	Author(s)	Publication Date
From Past to Future: Rethinking Eligibility Traces	Proceedings of the AAAI Conference on Artificial Intelligence	Dhawal Gupta Scott M Jordan Shreyas Chaudhari Bo Liu Philip S Thomas ...	2024/3/24
Behavior Alignment via Reward Function Optimization	Advances in Neural Information Processing Systems	Dhawal Gupta Yash Chandak Scott M. Jordan Philip S Thomas Bruno C da Silva	2024/2/13
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF	arXiv preprint arXiv:2309.09055	Simeng Sun Dhawal Gupta Mohit Iyyer	2023/9/16
Coagent Networks: Generalized and Scaled	arXiv preprint arXiv:2305.09838	James E Kostas Scott M Jordan Yash Chandak Georgios Theocharous Dhawal Gupta ...	2023/5/16
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management	arXiv preprint arXiv:2302.10850	Dhawal Gupta Yinlam Chow Mohammad Ghavamzadeh Craig Boutilier	2023/2/21
A Mixture-of-Expert Approach to RL-based Dialogue Management	arXiv preprint arXiv:2206.00059	Yinlam Chow Aza Tulepbergenov Ofir Nachum MoonKyung Ryu Mohammad Ghavamzadeh ...	2022/5/31
A unified dialogue management strategy for multi-intent dialogue conversations in multiple languages	Transactions on Asian and Low-Resource Language Information Processing	Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya	2021/9/20
Structural Credit Assignment in Neural Networks using Reinforcement Learning	Advances in Neural Information Processing Systems	Dhawal Gupta Gabor Mihucz Matthew Schlegel James Kostas Philip S Thomas ...	2021/12/6
A hierarchical approach for efficient multi-intent dialogue policy learning	Multimedia Tools and Applications	Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya	2020/7/2
Applicability of Momentum in the Methods of Temporal Learning		Dhawal Gupta	2020/3/6
Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework	Cognitive Computation	Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya	2020/1/22
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning	Expert Systems with Applications	Tulika Saha Dhawal Gupta Sriparna Saha Pushpak Bhattacharyya	2020/12/30
Gradient Temporal-Difference Learning with Regularized Corrections		Sina Ghiassian Andrew Patterson Shivam Garg Dhawal Gupta Adam White ...	2020/11/21