Jesse Mu
Stanford University
H-index: 11
North America-United States
Top articles of Jesse Mu
Sleeper agents: Training deceptive llms that persist through safety training
arXiv preprint arXiv:2401.05566
2024/1/10
Learning to compress prompts with gist tokens
Advances in Neural Information Processing Systems
2023/12/10
Jesse Mu
H-Index: 5
Xiang Li
H-Index: 19
Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems
arXiv preprint arXiv:2305.11374
2023/5/19
Jesse Mu
H-Index: 5
In the ZONE: Measuring difficulty and progression in curriculum generation
2022
Improving policy learning via language dynamics distillation
Advances in Neural Information Processing Systems
2022/12/6
Active learning helps pretrained models learn the intended task
Advances in Neural Information Processing Systems
2022/12/6
Improving intrinsic exploration with language abstractions
Advances in Neural Information Processing Systems
2022/12/6
STaR: Self-taught reasoner bootstrapping reasoning with reasoning
2022
Eric Zelikman
H-Index: 2
Jesse Mu
H-Index: 5
Emergent communication of generalizations
Advances in neural information processing systems
2021/12/6
Jesse Mu
H-Index: 5
Calibrate your listeners! Robust communication-based training for pragmatic speakers
Findings of EMNLP 2021
2021/10/11
Rose E Wang
H-Index: 2
Jesse Mu
H-Index: 5
Multi-party referential communication in complex strategic games
2021/10/1
Learning to refer informatively by amortizing pragmatic reasoning
arXiv preprint arXiv:2006.00418
2020/5/31
Jesse Mu
H-Index: 5
Compositional explanations of neurons
Advances in Neural Information Processing Systems
2020
Jesse Mu
H-Index: 5
Jacob Andreas
H-Index: 24