Subbarao Kambhampati
Arizona State University
H-index: 63
North America-United States
Top articles of Subbarao Kambhampati
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks | arXiv preprint arXiv:2402.08115 | Kaya Stechly Karthik Valmeekam Subbarao Kambhampati | 2024/2/12 |
Theory of Mind abilities of Large Language Models in Human-Robot Interaction: An Illusion? | Mudit Verma Siddhant Bhambri Subbarao Kambhampati | 2024/3/11 | |
Learning from ambiguous demonstrations with self-explanation guided reinforcement learning | Proceedings of the AAAI Conference on Artificial Intelligence | Yantian Zha Lin Guan Subbarao Kambhampati | 2024/3/24 |
" Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors | arXiv preprint arXiv:2402.04210 | Lin Guan Yifan Zhou Denis Liu Yantian Zha Heni Ben Amor | 2024/2/6 |
‘Why didn’t you allocate this task to them?’Negotiation-Aware Task Allocation and Contrastive Explanation Generation | Proceedings of the AAAI Conference on Artificial Intelligence | Zahra Zahedi Sailik Sengupta Subbarao Kambhampati | 2024/3/24 |
Planbench: An extensible benchmark for evaluating large language models on planning and reasoning about change | Advances in Neural Information Processing Systems | Karthik Valmeekam Matthew Marquez Alberto Olmo Sarath Sreedharan Subbarao Kambhampati | 2024/2/13 |
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks | arXiv preprint arXiv:2402.01817 | Subbarao Kambhampati Karthik Valmeekam Lin Guan Kaya Stechly Mudit Verma | 2024/2/2 |
On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels | Upasana Biswas Lin Guan Subbarao Kambhampati | 2024/3/11 | |
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments | arXiv preprint arXiv:2302.14208 | Tung Thai Ming Shen Mayank Garg Ayush Kalani Nakul Vaidya | 2023/2/28 |
GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems | arXiv preprint arXiv:2310.12397 | Kaya Stechly Matthew Marquez Subbarao Kambhampati | 2023/10/19 |
Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming | arXiv preprint arXiv:2312.14292 | Siddhant Bhambri Mudit Verma Anil Murthy Subbarao Kambhampati | 2023/12/21 |
A State Augmentation based approach to Reinforcement Learning from Human Preferences | arXiv preprint arXiv:2302.08734 | Mudit Verma Subbarao Kambhampati | 2023/2/17 |
Trust-aware planning: Modeling trust evolution in iterated human-robot interaction | Zahra Zahedi Mudit Verma Sarath Sreedharan Subbarao Kambhampati | 2023/3/13 | |
Preference proxies: Evaluating large language models in capturing human preferences in human-AI tasks | Mudit Verma Siddhant Bhambri Subbarao Kambhampati | 2023/10/4 | |
Exploiting Action Distances for Reward Learning from Human Preferences | Mudit Verma Siddhant Bhambri Subbarao Kambhampati | 2023/10/13 | |
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning | arXiv preprint arXiv:2302.08738 | Mudit Verma Siddhant Bhambri Subbarao Kambhampati | 2023/2/17 |
Generalizing action justification and causal links to policies | Proceedings of the International Conference on Automated Planning and Scheduling | Sarath Sreedharan Christian Muise Subbarao Kambhampati | 2023/7/1 |
Data Driven Reward Initialization for Preference based Reinforcement Learning | arXiv preprint arXiv:2302.08733 | Mudit Verma Subbarao Kambhampati | 2023/2/17 |
Leveraging pre-trained large language models to construct and utilize world models for model-based task planning | Advances in Neural Information Processing Systems | Lin Guan Karthik Valmeekam Sarath Sreedharan Subbarao Kambhampati | 2023/12/15 |
Can Large Language Models Really Improve by Self-critiquing Their Own Plans? | arXiv preprint arXiv:2310.08118 | Karthik Valmeekam Matthew Marquez Subbarao Kambhampati | 2023/10/12 |