Doina Precup
McGill University
H-index: 63
North America-Canada
Top articles of Doina Precup
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
On the Privacy of Selection Mechanisms with Gaussian Noise | arXiv preprint arXiv:2402.06137 | Jonathan Lebensold Doina Precup Borja Balle | 2024/2/9 |
Mixtures of Experts Unlock Parameter Scaling for Deep RL | arXiv preprint arXiv:2402.08609 | Johan Obando-Ceron Ghada Sokar Timon Willi Clare Lyle Jesse Farebrother | 2024/2/13 |
On learning history-based policies for controlling Markov decision processes | Gandharv Patil Aditya Mahajan Doina Precup | 2024/4/18 | |
QGFN: Controllable Greediness with Action Values | arXiv preprint arXiv:2402.05234 | Elaine Lau Stephen Zhewen Lu Ling Pan Doina Precup Emmanuel Bengio | 2024/2/7 |
Prediction and Control in Continual Reinforcement Learning | Advances in Neural Information Processing Systems | Nishanth Anand Doina Precup | 2024/2/13 |
Mudiff: Unified diffusion for complete molecule generation | Chenqing Hua Sitao Luan Minkai Xu Zhitao Ying Jie Fu | 2024/4/17 | |
Code as Reward: Empowering Reinforcement Learning with VLMs | arXiv preprint arXiv:2402.04764 | David Venuto Sami Nur Islam Martin Klissarov Doina Precup Sherry Yang | 2024/2/7 |
A definition of continual reinforcement learning | Advances in Neural Information Processing Systems | David Abel André Barreto Benjamin Van Roy Doina Precup Hado P van Hasselt | 2024/2/13 |
Cryceleb: a speaker verification dataset based on infant cry sounds | David Budaghyan Charles C Onu Arsenii Gorin Cem Subakan Doina Precup | 2024/4/14 | |
Effective Protein-Protein Interaction Exploration with PPIretrieval | arXiv preprint arXiv:2402.03675 | Chenqing Hua Connor Coley Guy Wolf Doina Precup Shuangjia Zheng | 2024/2/6 |
For sale: State-action representation learning for deep reinforcement learning | Advances in Neural Information Processing Systems | Scott Fujimoto Wei-Di Chang Edward Smith Shixiang Shane Gu Doina Precup | 2024/2/13 |
Offline Multitask Representation Learning for Reinforcement Learning | arXiv preprint arXiv:2403.11574 | Haque Ishfaq Thanh Nguyen-Tang Songtao Feng Raman Arora Mengdi Wang | 2024/3/18 |
Device-free localization methods within smart indoor environments | 2024/1/25 | ||
When Do Graph Neural Networks Help with Node Classification? Investigating the Homophily Principle on Node Distinguishability | Sitao Luan Chenqing Hua Minkai Xu Qincheng Lu Jiaqi Zhu | 2023/9/22 | |
Discrete Probabilistic Inference as Control in Multi-path Environments | arXiv preprint arXiv:2402.10309 | Tristan Deleu Padideh Nouri Nikolay Malkin Doina Precup Yoshua Bengio | 2024/2/15 |
Conditions on Preference Relations that Guarantee the Existence of Optimal Policies | Jonathan Colaço Carr Prakash Panangaden Doina Precup | 2024/4/18 | |
Policy Gradient Methods in the Presence of Symmetries and State Abstractions | Journal of Machine Learning Research | Prakash Panangaden Sahand Rezaei-Shoshtari Rosie Zhao David Meger Doina Precup | 2024 |
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search | arXiv preprint arXiv:2311.03583 | Abbas Mehrabian Ankit Anand Hyunjik Kim Nicolas Sonnerat Matej Balog | 2023/11/6 |
Combining Spatial and Temporal Abstraction in Planning for Better Generalization | arXiv preprint arXiv:2310.00229 | Mingde Zhao Safa Alver Harm van Seijen Romain Laroche Doina Precup | 2023/9/30 |
Accelerating exploration and representation learning with offline pre-training | arXiv preprint arXiv:2304.00046 | Bogdan Mazoure Jake Bruce Doina Precup Rob Fergus Ankit Anand | 2023/3/31 |