Nathan Lambert
University of California, Berkeley
H-index: 14
North America-United States
Top articles of Nathan Lambert
Social Choice for AI Alignment: Dealing with Diverse Human Feedback
arXiv preprint arXiv:2404.10271
2024/4/16
Rewardbench: Evaluating reward models for language modeling
arXiv preprint arXiv:2403.13787
2024/3/20
A Survey on Data Selection for Language Models
2024/2/26
Olmo: Accelerating the science of language models
arXiv preprint arXiv:2402.00838
2024/2/1
Yizhong Wang
H-Index: 5
David Atkinson
H-Index: 9
Khyathi Raghavi Chandu
H-Index: 8
Yanai Elazar
H-Index: 7
Yuling Gu
H-Index: 1
Tushar Khot
H-Index: 20
Aakanksha Naik
H-Index: 4
Valentina Pyatkin
H-Index: 1
Abhilasha Ravichander
H-Index: 9
Will Smith
H-Index: 2
Emma Strubell
H-Index: 13
Mitchell Wortsman
H-Index: 5
Nathan Lambert
H-Index: 5
Luke Zettlemoyer
H-Index: 60
Jesse Dodge
H-Index: 12
Hannaneh Hajishirzi
H-Index: 32
BLISS: Interplanetary exploration with swarms of low-cost spacecraft
Acta Astronautica
2024/2/1
Lydia Lee
H-Index: 3
Nathan Lambert
H-Index: 5
Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
arXiv preprint arXiv:2402.00159
2024/1/31
David Atkinson
H-Index: 9
Ben Bogin
H-Index: 8
Yanai Elazar
H-Index: 7
Valentin Hofmann
H-Index: 3
Sachin Kumar
H-Index: 9
Li Lucy
H-Index: 3
Nathan Lambert
H-Index: 5
Aakanksha Naik
H-Index: 4
Abhilasha Ravichander
H-Index: 9
Emma Strubell
H-Index: 13
Luke Zettlemoyer
H-Index: 60
Hannaneh Hajishirzi
H-Index: 32
Jesse Dodge
H-Index: 12
The alignment ceiling: Objective mismatch in reinforcement learning from human feedback
arXiv preprint arXiv:2311.00168
2023/10/31
Nathan Lambert
H-Index: 5
Zephyr: Direct distillation of lm alignment
arXiv preprint arXiv:2310.16944
2023/10/25
Nathan Lambert
H-Index: 5
Shengyi Huang
H-Index: 2
The History and Risks of Reinforcement Learning and Human Feedback
arXiv preprint arXiv:2310.13595
2023/10/20
Nathan Lambert
H-Index: 5
Thomas Krendl Gilbert
H-Index: 3
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
arXiv preprint arXiv:2310.06253
2023/10/10
Reward reports for reinforcement learning
2023/8/8
Confidence-building measures for artificial intelligence: Workshop proceedings
arXiv preprint arXiv:2308.00862
2023/8/1
Camels in a changing climate: Enhancing lm adaptation with tulu 2
arXiv preprint arXiv:2311.10702
2023/11/17
Measuring data
arXiv preprint arXiv:2212.05129
2022/12/9
Nathan Lambert
H-Index: 5
Angelina Mcmillan-Major
H-Index: 2
Investigating compounding prediction errors in learned dynamics models
arXiv preprint arXiv:2203.09637
2022/3/17
Nathan Lambert
H-Index: 5
Kristofer Pister
H-Index: 41
Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems
Center for Long Term Cybersecurity Whitepaper Series
2022/2/11
The challenges of exploration for offline reinforcement learning
arXiv preprint arXiv:2201.11861
2022/1/27
Synergy of Prediction and Control in Model-based Reinforcement Learning
2022
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
2021/12/15
Nathan Lambert
H-Index: 5
Kristofer Pister
H-Index: 41
Predicting Flying Robot Dynamics with Deep Learning
Journal of Student Research
2021/11/22
Nathan Lambert
H-Index: 5