Nathan Lambert

Nathan Lambert

University of California, Berkeley

H-index: 14

North America-United States

About Nathan Lambert

Nathan Lambert, With an exceptional h-index of 14 and a recent h-index of 14 (since 2020), a distinguished researcher at University of California, Berkeley, specializes in the field of Reinforcement Learning, Machine Learning, Robotics, Responsible AI.

His recent articles reflect a diverse array of research interests and contributions to the field:

Social Choice for AI Alignment: Dealing with Diverse Human Feedback

Rewardbench: Evaluating reward models for language modeling

A Survey on Data Selection for Language Models

Olmo: Accelerating the science of language models

BLISS: Interplanetary exploration with swarms of low-cost spacecraft

Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

The alignment ceiling: Objective mismatch in reinforcement learning from human feedback

Zephyr: Direct distillation of lm alignment

Nathan Lambert Information

University

Position

___

Citations(all)

928

Citations(since 2020)

922

Cited By

94

hIndex(all)

14

hIndex(since 2020)

14

i10Index(all)

19

i10Index(since 2020)

19

Email

University Profile Page

Google Scholar

Nathan Lambert Skills & Research Interests

Reinforcement Learning

Machine Learning

Robotics

Responsible AI

Top articles of Nathan Lambert

Social Choice for AI Alignment: Dealing with Diverse Human Feedback

arXiv preprint arXiv:2404.10271

2024/4/16

Rewardbench: Evaluating reward models for language modeling

arXiv preprint arXiv:2403.13787

2024/3/20

BLISS: Interplanetary exploration with swarms of low-cost spacecraft

Acta Astronautica

2024/2/1

Lydia Lee
Lydia Lee

H-Index: 3

Nathan Lambert
Nathan Lambert

H-Index: 5

The alignment ceiling: Objective mismatch in reinforcement learning from human feedback

arXiv preprint arXiv:2311.00168

2023/10/31

Nathan Lambert
Nathan Lambert

H-Index: 5

Zephyr: Direct distillation of lm alignment

arXiv preprint arXiv:2310.16944

2023/10/25

Nathan Lambert
Nathan Lambert

H-Index: 5

Shengyi Huang
Shengyi Huang

H-Index: 2

The History and Risks of Reinforcement Learning and Human Feedback

arXiv preprint arXiv:2310.13595

2023/10/20

Nathan Lambert
Nathan Lambert

H-Index: 5

Thomas Krendl Gilbert
Thomas Krendl Gilbert

H-Index: 3

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

arXiv preprint arXiv:2310.06253

2023/10/10

Reward reports for reinforcement learning

2023/8/8

Confidence-building measures for artificial intelligence: Workshop proceedings

arXiv preprint arXiv:2308.00862

2023/8/1

Camels in a changing climate: Enhancing lm adaptation with tulu 2

arXiv preprint arXiv:2311.10702

2023/11/17

Measuring data

arXiv preprint arXiv:2212.05129

2022/12/9

Nathan Lambert
Nathan Lambert

H-Index: 5

Angelina Mcmillan-Major
Angelina Mcmillan-Major

H-Index: 2

Investigating compounding prediction errors in learned dynamics models

arXiv preprint arXiv:2203.09637

2022/3/17

Nathan Lambert
Nathan Lambert

H-Index: 5

Kristofer Pister
Kristofer Pister

H-Index: 41

Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems

Center for Long Term Cybersecurity Whitepaper Series

2022/2/11

The challenges of exploration for offline reinforcement learning

arXiv preprint arXiv:2201.11861

2022/1/27

Synergy of Prediction and Control in Model-based Reinforcement Learning

2022

Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

2021/12/15

Nathan Lambert
Nathan Lambert

H-Index: 5

Kristofer Pister
Kristofer Pister

H-Index: 41

Predicting Flying Robot Dynamics with Deep Learning

Journal of Student Research

2021/11/22

Nathan Lambert
Nathan Lambert

H-Index: 5

See List of Professors in Nathan Lambert University(University of California, Berkeley)

Co-Authors

academic-engine