Matthijs T. J. Spaan

Matthijs T. J. Spaan

Technische Universiteit Delft

H-index: 35

Europe-Netherlands

About Matthijs T. J. Spaan

Matthijs T. J. Spaan, With an exceptional h-index of 35 and a recent h-index of 23 (since 2020), a distinguished researcher at Technische Universiteit Delft,

His recent articles reflect a diverse array of research interests and contributions to the field:

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

4.17 E-MCTS: Deep Exploration in Model-Based Reinforcement Learning by Planning with Epistemic Uncertainty

Bayesian Ensembles for Exploration in Deep Q-Learning

Scalable safe policy improvement via monte carlo tree search

Cem: Constrained entropy maximization for task-agnostic safe exploration

Diverse Projection Ensembles for Distributional Reinforcement Learning

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in Reinforcement Learning

The Role of Diverse Replay for Generalisation in Reinforcement Learning

Matthijs T. J. Spaan Information

University

Position

___

Citations(all)

5696

Citations(since 2020)

2553

Cited By

3995

hIndex(all)

35

hIndex(since 2020)

23

i10Index(all)

79

i10Index(since 2020)

49

Email

University Profile Page

Technische Universiteit Delft

Google Scholar

View Google Scholar Profile

Top articles of Matthijs T. J. Spaan

Title

Journal

Author(s)

Publication Date

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

arXiv preprint arXiv:2402.12034

Davide Mambelli

Stephan Bongers

Onno Zoeter

Matthijs TJ Spaan

Frans A Oliehoek

2024/2/19

4.17 E-MCTS: Deep Exploration in Model-Based Reinforcement Learning by Planning with Epistemic Uncertainty

Scalable Analysis of Probabilistic Models and Programs

Matthijs Spaan

2024/2

Bayesian Ensembles for Exploration in Deep Q-Learning

Pascal Van der Vaart

Neil Yorke-Smith

Matthijs TJ Spaan

2024/5/6

Scalable safe policy improvement via monte carlo tree search

Alberto Castellini

Federico Bianchi

Edoardo Zorzi

Thiago D Simao

Alessandro Farinelli

...

2023/7/3

Cem: Constrained entropy maximization for task-agnostic safe exploration

Proceedings of the AAAI Conference on Artificial Intelligence

Qisong Yang

Matthijs TJ Spaan

2023/6/26

Diverse Projection Ensembles for Distributional Reinforcement Learning

arXiv preprint arXiv:2306.07124

Moritz A Zanger

Wendelin Böhmer

Matthijs TJ Spaan

2023/6/12

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in Reinforcement Learning

Miguel Suau

Matthijs TJ Spaan

Frans A Oliehoek

2023/10/13

The Role of Diverse Replay for Generalisation in Reinforcement Learning

arXiv preprint arXiv:2306.05727

Max Weltevrede

Matthijs TJ Spaan

Wendelin Böhmer

2023/6/9

Reinforcement Learning by Guided Safe Exploration

ECAI 2023

Qisong Yang

Thiago D Simão

Nils Jansen

Simon H Tindemans

Matthijs TJ Spaan

2023/7/26

Safety-constrained reinforcement learning with a distributional safety critic

Machine Learning

Qisong Yang

Thiago D Simão

Simon H Tindemans

Matthijs TJ Spaan

2023/3

Bayesian Deep Q-Learning via Sequential Monte Carlo

Pascal Van der Vaart

Matthijs TJ Spaan

Neil Yorke-Smith

2023/7/20

Refined Risk Management in Safe Reinforcement Learning with a Distributional Safety Critic

Qisong Yang

Thiago D Simão

Simon H Tindemans

Matthijs TJ Spaan

2022

Influence-augmented local simulators: A scalable solution for fast deep rl in large networked systems

ICML 2022

M Suau

J He

MTJ Spaan

FA Oliehoek

2022

Distributed influence-augmented local simulators for parallel MARL in large networked systems

NeurIPS 2022

Miguel Suau

Jinke He

Mustafa Mert Çelikok

Matthijs TJ Spaan

Frans A Oliehoek

2022/7/1

Training and transferring safe policies in reinforcement learning

Qisong Yang

T Simão

Nils Jansen

S Tindemans

M Spaan

2022

Speeding up deep reinforcement learning through influence-augmented local simulators

Miguel Suau

Jinke He

Matthijs TJ Spaan

Frans A Oliehoek

2022/5/9

E-MCTS: Deep Exploration in Model-Based Reinforcement Learning by Planning with Epistemic Uncertainty

arXiv preprint arXiv:2210.13455

Yaniv Oren

Matthijs TJ Spaan

Wendelin Böhmer

2022/10/21

Back to the Future: Solving Hidden Parameter MDPs with Hindsight

Canmanie Ponnambalam

Danial Kamran

Thiago D Simão

Frans A Oliehoek

Matthijs TJ Spaan

2022

Large-scale collaborative vehicle routing

Annals of Operations Research

Johan Los

Frederik Schulte

Margaretha Gansterer

Richard F Hartl

Matthijs TJ Spaan

...

2022/4/8

A modern perspective on safe automated driving for different traffic dynamics using constrained reinforcement learning

Danial Kamran

TD Simão

Q Yang

CT Ponnambalam

Johannes Fischer

...

2022/10

See List of Professors in Matthijs T. J. Spaan University(Technische Universiteit Delft)

Co-Authors

H-index: 64
Shimon Whiteson

Shimon Whiteson

University of Oxford

H-index: 51
Pascal Poupart

Pascal Poupart

University of Waterloo

H-index: 37
Frans Groen

Frans Groen

Universiteit van Amsterdam

H-index: 35
Luis Merino

Luis Merino

Universidad Pablo de Olavide

H-index: 35
Frans A. Oliehoek

Frans A. Oliehoek

Technische Universiteit Delft

H-index: 34
Christopher Amato

Christopher Amato

North Eastern University

academic-engine