Peter Vamplew
Federation University Australia
H-index: 29
Oceania-Australia
Top articles of Peter Vamplew
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep Q-Networks | Neurocomputing | Adrian Ly Richard Dazeley Peter Vamplew Francisco Cruz Sunil Aryal | 2024/4/1 |
Value function interference and greedy action selection in value-based multi-objective reinforcement learning | arXiv preprint arXiv:2402.06266 | Peter Vamplew Cameron Foale Richard Dazeley | 2024/2/9 |
Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning | arXiv preprint arXiv:2402.02665 | Peter Vamplew Cameron Foale Conor F Hayes Patrick Mannion Enda Howley | 2024/2/5 |
An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments | arXiv preprint arXiv:2401.03163 | Kewen Ding Peter Vamplew Cameron Foale Richard Dazeley | 2024/1/6 |
Scalar reward is not enough: JAAMAS track | Peter Vamplew Benjamin J Smith Johan Källström Gabriel De Oliveira Ramos Roxana Radulescu | 2023 | |
AI apology: interactive multi-objective reinforcement learning for human-aligned AI | Neural Computing and Applications | Hadassah Harland Richard Dazeley Bahareh Nakisa Francisco Cruz Peter Vamplew | 2023/8 |
A Brief Guide to Multi-Objective Reinforcement Learning and Planning: JAAMAS Track | Conor F Hayes Roxana Rădulescu Eugenio Bargiacchi Johan Kallstrom Matthew Macfarlane | 2023/5/30 | |
Explainable reinforcement learning for broad-xai: a conceptual framework and survey | Neural Computing and Applications | Richard Dazeley Peter Vamplew Francisco Cruz | 2023/8 |
Elastic step DDPG: Multi-step reinforcement learning for improved sample efficiency | Adrian Ly Richard Dazeley Peter Vamplew Francisco Cruz Sunil Aryal | 2023/6/18 | |
A NetHack Learning Environment Language Wrapper for Autonomous Agents | Journal of Open Research Software | Nikolaj Goodger Peter Vamplew Cameron Foale Richard Dazeley | 2023/6/13 |
Intent-aligned AI systems deplete human agency: the need for agency foundations research in AI safety | arXiv preprint arXiv:2305.19223 | Catalin Mitelut Ben Smith Peter Vamplew | 2023/5/30 |
A practical guide to multi-objective reinforcement learning and planning | Autonomous Agents and Multi-Agent Systems | Conor F Hayes Roxana Rădulescu Eugenio Bargiacchi Johan Källström Matthew Macfarlane | 2022/4 |
Discrete-to-deep reinforcement learning methods | Neural Computing and Applications | Budi Kurniawan Peter Vamplew Michael Papasimeon Richard Dazeley Cameron Foale | 2022/2 |
Evaluating Human-like Explanations for Robot Actions in Reinforcement Learning Scenarios | Francisco Cruz Charlotte Young Richard Dazeley Peter Vamplew | 2022/10/23 | |
The impact of environmental stochasticity on value-based multiobjective reinforcement learning | Neural Computing and Applications | Peter Vamplew Cameron Foale Richard Dazeley | 2022/2/3 |
Broad-persistent Advice for Interactive Reinforcement Learning Scenarios | arXiv preprint arXiv:2210.05187 | Francisco Cruz Adam Bignold Hung Son Nguyen Richard Dazeley Peter Vamplew | 2022/10/11 |
Human engagement providing evaluative and informative advice for interactive reinforcement learning | Neural Computing and Applications | Adam Bignold Francisco Cruz Richard Dazeley Peter Vamplew Cameron Foale | 2023/9 |
Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021) | Autonomous Agents and Multi-Agent Systems | Peter Vamplew Benjamin J Smith Johan Källström Gabriel Ramos Roxana Rădulescu | 2022/10 |
An online scalarization multi-objective reinforcement learning algorithm: TOPSIS Q-learning | The Knowledge Engineering Review | Mohammad Mirzanejad Morteza Ebrahimi Peter Vamplew Hadi Veisi | 2022/1 |
A Low-Level Hybrid Intrusion Detection System Based on Hardware Performance Counters | Ansam Khraisat Iqbal Gondal Peter Vamplew Joarder Kamruzzaman | 2022/6/30 |