Martha White
University of Alberta
H-index: 31
North America-Canada
Top articles of Martha White
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Compound Returns Reduce Variance in Reinforcement Learning | arXiv preprint arXiv:2402.03903 | Brett Daley Martha White Marlos C Machado | 2024/2/6 |
Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL | arXiv preprint arXiv:2404.02113 | Golnaz Mesbahi Olya Mastikhina Parham Mohammad Panahi Martha White Adam White | 2024/4/2 |
Investigating the properties of neural network representations in reinforcement learning | Artificial Intelligence | Han Wang Erfan Miahi Martha White Marlos C Machado Zaheer Abbas | 2024/3/1 |
Investigating the Histogram Loss in Regression | arXiv preprint arXiv:2402.13425 | Ehsan Imani Kai Luedemann Sam Scholnick-Hughes Esraa Elelimy Martha White | 2024/2/20 |
What to do when your discrete optimization is the size of a neural network? | arXiv preprint arXiv:2402.10339 | Hugo Silva Martha White | 2024/2/15 |
General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence | Advances in Neural Information Processing Systems | Lingwei Zhu Zheng Chen Matthew Schlegel Martha White | 2024/2/13 |
Off-policy actor-critic with emphatic weightings | Journal of Machine Learning Research | Eric Graves Ehsan Imani Raksha Kumaraswamy Martha White | 2023 |
Empirical Design in Reinforcement Learning | arXiv preprint arXiv:2304.01315 | Andrew Patterson Samuel Neumann Martha White Adam White | 2023/4/3 |
GVFs in the real world: making predictions online for water treatment | Machine Learning | Muhammad Kamran Janjua Haseeb Shah Martha White Erfan Miahi Marlos C Machado | 2023/11/8 |
The In-Sample Softmax for Offline Reinforcement Learning | arXiv preprint arXiv:2302.14372 | Chenjun Xiao Han Wang Yangchen Pan Adam White Martha White | 2023/2/28 |
Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning | Abdul Wahab Raksha Kumaraswamy Martha White | 2023/10/13 | |
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence | arXiv preprint arXiv:2301.11476 | Lingwei Zhu Zheng Chen Matthew Schlegel Martha White | 2023/1/27 |
Trajectory-aware eligibility traces for off-policy reinforcement learning | Brett Daley Martha White Christopher Amato Marlos C Machado | 2023/7/3 | |
Online real-time recurrent learning using sparse connections and selective learning | arXiv preprint arXiv:2302.05326 | Khurram Javed Haseeb Shah Rich Sutton Martha White | 2023/1/20 |
Exploiting action impact regularity and exogenous state variables for offline reinforcement learning | Journal of Artificial Intelligence Research | Vincent Liu James R Wright Martha White | 2023/5/11 |
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks | Journal of Machine Learning Research | Khurram Javed Haseeb Shah Richard S Sutton Martha White | 2023 |
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments | Vincent Liu Yash Chandak Philip Thomas Martha White | 2023/4/11 | |
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning | Journal of Machine Learning Research | Andrew Patterson Adam White Martha White | 2022 |
Goal-Space Planning with Subgoal Models | arXiv preprint arXiv:2206.02902 | Chunlok Lo Gabor Mihucz Adam White Farzane Aminmansour Martha White | 2022/6/6 |
Intermediate Machine Learning | University of Alberta course notes | Martha White | 2022/12/8 |