Beining Han
Tsinghua University
H-index: 6
Asia-China
Top articles of Beining Han
Infinite photorealistic worlds using procedural generation
2023
Off-policy reinforcement learning with delayed rewards
2022/6/28
Towards understanding cooperative multi-agent q-learning with value factorization
2021/12/6
Learning domain invariant representations in goal-conditioned block mdps
Advances in Neural Information Processing Systems
2021/12/6
On the estimation bias in double q-learning
2021/12/6
Dop: Off-policy multi-agent decomposed policy gradients
2020/9/28