Nathan Lambert

Nathan Lambert

University of California, Berkeley

H-index: 14

North America-United States

Contact Nathan Lambert

About Nathan Lambert

Nathan Lambert, With an exceptional h-index of 14 and a recent h-index of 14 (since 2020), a distinguished researcher at University of California, Berkeley, specializes in the field of Reinforcement Learning, Machine Learning, Robotics, Responsible AI.

His recent articles reflect a diverse array of research interests and contributions to the field:

Social Choice for AI Alignment: Dealing with Diverse Human Feedback

Rewardbench: Evaluating reward models for language modeling

A Survey on Data Selection for Language Models

Olmo: Accelerating the science of language models

BLISS: Interplanetary exploration with swarms of low-cost spacecraft

Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

The alignment ceiling: Objective mismatch in reinforcement learning from human feedback

Zephyr: Direct distillation of lm alignment

Nathan Lambert Information

University	University of California, Berkeley
Position	___
Citations(all)	928
Citations(since 2020)	922
Cited By	94
hIndex(all)	14
hIndex(since 2020)	14
i10Index(all)	19
i10Index(since 2020)	19
Email	Access Email
University Profile Page	University of California, Berkeley
Google Scholar	View Google Scholar Profile

Nathan Lambert Skills & Research Interests

Reinforcement Learning

Machine Learning

Robotics

Responsible AI

Top articles of Nathan Lambert

Social Choice for AI Alignment: Dealing with Diverse Human Feedback

arXiv preprint arXiv:2404.10271

2024/4/16

Vincent Conitzer

H-Index: 37

Rachel Freedman

H-Index: 1

Nathan Lambert

H-Index: 5

Eric Pacuit

H-Index: 18

Stuart Russell

H-Index: 54

Rewardbench: Evaluating reward models for language modeling

arXiv preprint arXiv:2403.13787

2024/3/20

Nathan Lambert

H-Index: 5

Valentina Pyatkin

H-Index: 1

Nouha Dziri

H-Index: 3

Sachin Kumar

H-Index: 9

Yejin Choi

H-Index: 53

Hannaneh Hajishirzi

H-Index: 32

A Survey on Data Selection for Language Models

2024/2/26

Alon Albalak

H-Index: 0

Yanai Elazar

H-Index: 7

Sang Michael Xie

H-Index: 8

Shayne Longpre

H-Index: 5

Nathan Lambert

H-Index: 5

Xinyi Wang

H-Index: 3

Bairu Hou

H-Index: 1

Liangming Pan

H-Index: 6

Colin Raffel

H-Index: 29

Tatsunori Hashimoto

H-Index: 17

William Yang Wang

H-Index: 32

Olmo: Accelerating the science of language models

arXiv preprint arXiv:2402.00838

2024/2/1

Yizhong Wang

H-Index: 5

David Atkinson

H-Index: 9

Khyathi Raghavi Chandu

H-Index: 8

Yanai Elazar

H-Index: 7

Yuling Gu

H-Index: 1

Tushar Khot

H-Index: 20

Aakanksha Naik

H-Index: 4

Valentina Pyatkin

H-Index: 1

Abhilasha Ravichander

H-Index: 9

Will Smith

H-Index: 2

Emma Strubell

H-Index: 13

Mitchell Wortsman

H-Index: 5

Nathan Lambert

H-Index: 5

Luke Zettlemoyer

H-Index: 60

Jesse Dodge

H-Index: 12

Hannaneh Hajishirzi

H-Index: 32

BLISS: Interplanetary exploration with swarms of low-cost spacecraft

Acta Astronautica

2024/2/1

Lydia Lee

H-Index: 3

Nathan Lambert

H-Index: 5

Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

arXiv preprint arXiv:2402.00159

2024/1/31

David Atkinson

H-Index: 9

Ben Bogin

H-Index: 8

Yanai Elazar

H-Index: 7

Valentin Hofmann

H-Index: 3

Sachin Kumar

H-Index: 9

Li Lucy

H-Index: 3

Nathan Lambert

H-Index: 5

Aakanksha Naik

H-Index: 4

Abhilasha Ravichander

H-Index: 9

Emma Strubell

H-Index: 13

Luke Zettlemoyer

H-Index: 60

Hannaneh Hajishirzi

H-Index: 32

Jesse Dodge

H-Index: 12

The alignment ceiling: Objective mismatch in reinforcement learning from human feedback

arXiv preprint arXiv:2311.00168

2023/10/31

Nathan Lambert

H-Index: 5

Zephyr: Direct distillation of lm alignment

arXiv preprint arXiv:2310.16944

2023/10/25

Nathan Lambert

H-Index: 5

Shengyi Huang

H-Index: 2

The History and Risks of Reinforcement Learning and Human Feedback

arXiv preprint arXiv:2310.13595

2023/10/20

Nathan Lambert

H-Index: 5

Thomas Krendl Gilbert

H-Index: 3

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

arXiv preprint arXiv:2310.06253

2023/10/10

Ran Wei

H-Index: 5

Nathan Lambert

H-Index: 5

Alfredo Garcia

H-Index: 3

Reward reports for reinforcement learning

2023/8/8

Thomas Krendl Gilbert

H-Index: 3

Nathan Lambert

H-Index: 5

Sarah Dean

H-Index: 34

Soham Mehta

H-Index: 6

Confidence-building measures for artificial intelligence: Workshop proceedings

arXiv preprint arXiv:2308.00862

2023/8/1

Sarah Shoker

H-Index: 3

Andrew Reddie

H-Index: 5

Ritwik Gupta

H-Index: 3

Nathan Lambert

H-Index: 5

Robert Trager

H-Index: 10

Jessica Young

H-Index: 2

Camels in a changing climate: Enhancing lm adaptation with tulu 2

arXiv preprint arXiv:2311.10702

2023/11/17

Yizhong Wang

H-Index: 5

Valentina Pyatkin

H-Index: 1

Nathan Lambert

H-Index: 5

Joel Jang

H-Index: 1

David Wadden

H-Index: 7

Hannaneh Hajishirzi

H-Index: 32

Measuring data

arXiv preprint arXiv:2212.05129

2022/12/9

Nathan Lambert

H-Index: 5

Angelina Mcmillan-Major

H-Index: 2

Investigating compounding prediction errors in learned dynamics models

arXiv preprint arXiv:2203.09637

2022/3/17

Nathan Lambert

H-Index: 5

Kristofer Pister

H-Index: 41

Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems

Center for Long Term Cybersecurity Whitepaper Series

2022/2/11

Thomas Krendl Gilbert

H-Index: 3

Sarah Dean

H-Index: 34

Nathan Lambert

H-Index: 5

The challenges of exploration for offline reinforcement learning

arXiv preprint arXiv:2201.11861

2022/1/27

Nathan Lambert

H-Index: 5

William Whitney

H-Index: 7

Vibhavari Dasagi

H-Index: 4

Synergy of Prediction and Control in Model-based Reinforcement Learning

2022

Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

2021/12/15

Nathan Lambert

H-Index: 5

Kristofer Pister

H-Index: 41

Predicting Flying Robot Dynamics with Deep Learning

Journal of Student Research

2021/11/22

Nathan Lambert

H-Index: 5

See List of Professors in Nathan Lambert University(University of California, Berkeley)

Co-Authors

H-index: 78

Kristofer PISTER — Kristofer PISTER
University of California, Berkeley

Visit Kristofer PISTER Page

H-index: 14

Sarah Dean — Sarah Dean
University of California, Berkeley

Visit Sarah Dean Page

H-index: 12

Daniel S. Drew — Daniel S. Drew
Stanford University

Visit Daniel S. Drew Page

H-index: 11

Thomas Krendl Gilbert — Thomas Krendl Gilbert
University of California, Berkeley

Visit Thomas Krendl Gilbert Page

H-index: 8

Craig B. Schindler — Craig B. Schindler
University of California, Berkeley

Visit Craig B. Schindler Page

H-index: 6

Lydia Lee — Lydia Lee
University of California, Berkeley

Visit Lydia Lee Page

1

academic-engine

Useful Links

List of top 50 universities

List of top 100 universities

List of top 500 universities

Find Professors Email

Find Universities Email

CUFinder Academic Engine