Michael Gimelfarb at University of Toronto

University	University of Toronto
Position	Mechanical and Industrial Engineering ; Vector Institute
Citations(all)	76
Citations(since 2020)	74
Cited By	14
hIndex(all)	5
hIndex(since 2020)	5
i10Index(all)	3
i10Index(since 2020)	3
Email	Access Email
University Profile Page	University of Toronto
Google Scholar	View Google Scholar Profile

The 2023 International Planning Competition

2024/4/5

Ayal Taitler

H-Index: 3

Gregor Behnke

H-Index: 16

Michael Gimelfarb

H-Index: 1

Florian Pommerening

H-Index: 11

Scott Sanner

H-Index: 26

Enrico Scala

H-Index: 7

Jendrik Seipp

H-Index: 13

Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs

arXiv preprint arXiv:2401.12243

2024/1/20

Michael Gimelfarb

H-Index: 1

Ayal Taitler

H-Index: 3

Scott Sanner

H-Index: 26

Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions

arXiv preprint arXiv:2305.07844

2023/5/13

Michael Gimelfarb

H-Index: 1

Who Should I Trust?: Uncertainty and Risk for Knowledge Transfer from Multiple Sources in Reinforcement Learning Domains

2023

Michael Gimelfarb

H-Index: 1

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

arXiv preprint arXiv:2210.03802

2022/10/7

Jihwan Jeong

H-Index: 2

Xiaoyu Wang

H-Index: 14

Michael Gimelfarb

H-Index: 1

Hyunwoo Kim

H-Index: 6

Baher Abdulhai

H-Index: 25

Scott Sanner

H-Index: 26

pyrddlgym: From rddl to gym environments

arXiv preprint arXiv:2211.05939

2022/11/11

Ayal Taitler

H-Index: 3

Michael Gimelfarb

H-Index: 1

Jihwan Jeong

H-Index: 2

Sriram Gopalakrishnan

H-Index: 0

Scott Sanner

H-Index: 26

A Distributional Framework for Risk-Sensitive End-to-End Planning in Continuous MDPs

Proceedings of the AAAI Conference on Artificial Intelligence

2022/6/28

Jihwan Jeong

H-Index: 2

Scott Sanner

H-Index: 26

End-to-End Risk-Aware Planning by Gradient Descent

2021

Jihwan Jeong

H-Index: 2

Michael Gimelfarb

H-Index: 1

Scott Sanner

H-Index: 26

Risk-Aware Transfer in Reinforcement Learning using Successor Features

Advances in Neural Information Processing Systems

2021/12/6

Michael Gimelfarb

H-Index: 1

Scott Sanner

H-Index: 26

Chi-Guhn Lee

H-Index: 17

Bayesian Experience Reuse for Learning from Multiple Demonstrators

arXiv preprint arXiv:2006.05725

2020/6/10

Michael Gimelfarb

H-Index: 1

Scott Sanner

H-Index: 26

Chi-Guhn Lee

H-Index: 17

Contextual policy transfer in reinforcement learning domains via deep mixtures-of-experts

2021/12/1

Michael Gimelfarb

H-Index: 1

Scott Sanner

H-Index: 26

Chi-Guhn Lee

H-Index: 17

Michael Gimelfarb

University of Toronto

About Michael Gimelfarb

Michael Gimelfarb Information

Michael Gimelfarb Skills & Research Interests

Top articles of Michael Gimelfarb

The 2023 International Planning Competition

Ayal Taitler

Gregor Behnke

Michael Gimelfarb

Florian Pommerening

Scott Sanner

Enrico Scala

Jendrik Seipp

Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs

Michael Gimelfarb

Ayal Taitler

Scott Sanner

Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions

Michael Gimelfarb

Who Should I Trust?: Uncertainty and Risk for Knowledge Transfer from Multiple Sources in Reinforcement Learning Domains

Michael Gimelfarb

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

Jihwan Jeong

Xiaoyu Wang

Michael Gimelfarb

Hyunwoo Kim

Baher Abdulhai

Scott Sanner

pyrddlgym: From rddl to gym environments

Ayal Taitler

Michael Gimelfarb

Jihwan Jeong

Sriram Gopalakrishnan

Scott Sanner

A Distributional Framework for Risk-Sensitive End-to-End Planning in Continuous MDPs

Jihwan Jeong

Scott Sanner

End-to-End Risk-Aware Planning by Gradient Descent

Jihwan Jeong

Michael Gimelfarb

Scott Sanner

Risk-Aware Transfer in Reinforcement Learning using Successor Features

Michael Gimelfarb

Scott Sanner

Chi-Guhn Lee

Bayesian Experience Reuse for Learning from Multiple Demonstrators

Michael Gimelfarb

Scott Sanner

Chi-Guhn Lee

Contextual policy transfer in reinforcement learning domains via deep mixtures-of-experts

Michael Gimelfarb

Scott Sanner

Chi-Guhn Lee

Co-Authors

Scott Sanner

Baher Abdulhai

Chi-Guhn Lee

Sriram Gopalakrishnan

Ayal Taitler

Jihwan Jeong