Gergely Neu
Universidad Pompeu Fabra
H-index: 26
Europe-Spain
Top articles of Gergely Neu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Importance-weighted offline learning done right | Germano Gabbianelli Gergely Neu Matteo Papini | 2024/3/15 | |
Optimisic Information Directed Sampling | arXiv preprint arXiv:2402.15411 | Gergely Neu Matteo Papini Ludovic Schwartz | 2024/2/23 |
Dealing with unbounded gradients in stochastic saddle-point optimization | arXiv preprint arXiv:2402.13903 | Gergely Neu Nneka Okolo | 2024/2/21 |
Seconder of the vote of thanks to Waudby-Smith and Ramdas and contribution to the Discussion of ‘Estimating means of bounded random variables by betting’ | Journal of the Royal Statistical Society Series B: Statistical Methodology | Gergely Neu | 2024/2 |
Adversarial Contextual Bandits Go Kernelized | Proceedings of Machine Learning Research vol | Gergely Neu Julia Olkhovskaya Sattar Vakili | 2024 |
On the Hardness of Learning from Censored and Nonstationary Demand | INFORMS Journal on Optimization | Gábor Lugosi Mihalis G Markakis Gergely Neu | 2023/9/28 |
Optimistic Planning by Regularized Dynamic Programming | Antoine Moulin Gergely Neu | 2023/2/27 | |
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization | Gergely Neu Nneka Okolo | 2023/2/13 | |
Online-to-PAC Conversions: Generalization Bounds via Regret Analysis | arXiv preprint arXiv:2305.19674 | Gábor Lugosi Gergely Neu | 2023/5/31 |
Online Learning with Off-Policy Feedback | Germano Gabbianelli Gergely Neu Matteo Papini | 2023/2/13 | |
Offline Primal-Dual Reinforcement Learning for Linear MDPs | arXiv preprint arXiv:2305.12944 | Germano Gabbianelli Gergely Neu Nneka Okolo Matteo Papini | 2023/5/22 |
First-and Second-Order Bounds for Adversarial Linear Contextual Bandits | Advances in Neural Information Processing Systems | Julia Olkhovskaya Jack Mayo Tim van Erven Gergely Neu Chen-Yu Wei | 2023/12 |
Nonstochastic Contextual Combinatorial Bandits | Lukas Zierahn Dirk van der Hoeven Nicolò Cesa-Bianchi Gergely Neu | 2023/4/11 | |
Lifting the information ratio: An information-theoretic analysis of thompson sampling for contextual bandits | Advances in Neural Information Processing Systems | Gergely Neu Julia Olkhovskaya Matteo Papini Ludovic Schwartz | 2022/12/6 |
Sufficient Exploration for Convex Q-learning | arXiv preprint arXiv:2210.09409 | Fan Lu Prashant Mehta Sean Meyn Gergely Neu | 2022/10/17 |
Generalization Bounds via Convex Analysis | Gábor Lugosi Gergely Neu | 2022/7 | |
Convex analytic theory for convex Q-learning | Fan Lu Prashant G Mehta Sean P Meyn Gergely Neu | 2022/12/6 | |
Proximal point imitation learning | Advances in Neural Information Processing Systems | Luca Viano Angeliki Kamoutsi Gergely Neu Igor Krawczuk Volkan Cevher | 2022/12/6 |
Learning to maximize global influence from local observations | arXiv preprint arXiv:2109.11909 | Gábor Lugosi Gergely Neu Julia Olkhovskaya | 2021/9/24 |
Information-Theoretic Generalization Bounds for Stochastic Gradient Descent | Gergely Neu Gintare Karolina Dziugaite Mahdi Haghifam Daniel M. Roy | 2021/8 |