Yonatan Belinkov at Technion - Israel Institute of Technology

University	Technion - Israel Institute of Technology
Position	___
Citations(all)	8836
Citations(since 2020)	8248
Cited By	2585
hIndex(all)	43
hIndex(since 2020)	40
i10Index(all)	76
i10Index(since 2020)	74
Email	Access Email
University Profile Page	Technion - Israel Institute of Technology
Google Scholar	View Google Scholar Profile

Effect of tokenization on transformers for biological sequences

Bioinformatics

2024/4/12

Tal Pupko

H-Index: 40

Yonatan Belinkov

H-Index: 26

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

arXiv preprint arXiv:2403.19647

2024/3/28

Yonatan Belinkov

H-Index: 26

David Bau

H-Index: 20

Aaron Mueller

H-Index: 5

BetaAlign: a deep learning approach for multiple sequence alignment

bioRxiv

2024/3/27

Oren Avram

H-Index: 5

Yonatan Belinkov

H-Index: 26

Tal Pupko

H-Index: 40

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

arXiv preprint arXiv:2403.17806

2024/3/26

Sandro Pezzelle

H-Index: 7

Yonatan Belinkov

H-Index: 26

Accelerating the Global Aggregation of Local Explanations

Proceedings of the AAAI Conference on Artificial Intelligence

2024/3/24

Yonatan Belinkov

H-Index: 26

Benny Kimelfeld

H-Index: 19

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

arXiv preprint arXiv:2403.14705

2024/3/17

Yonatan Belinkov

H-Index: 26

Ron Meir

H-Index: 20

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

arXiv preprint arXiv:2403.05846

2024/3/9

Yonatan Belinkov

H-Index: 26

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

arXiv preprint arXiv:2402.17371

2024/2/27

Benny Kimelfeld

H-Index: 19

Yonatan Belinkov

H-Index: 26

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

arXiv preprint arXiv:2402.14811

2024/2/22

Tamar Rott Shaham

H-Index: 4

Yonatan Belinkov

H-Index: 26

David Bau

H-Index: 20

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

arXiv preprint arXiv:2402.12865

2024/2/20

Yonatan Belinkov

H-Index: 26

Mor Geva

H-Index: 7

Lior Wolf

H-Index: 50

Unified concept editing in diffusion models

2024

Rohit Gandikota

H-Index: 1

Yonatan Belinkov

H-Index: 26

David Bau

H-Index: 20

When language models fall in love: Animacy processing in transformer language models

arXiv preprint arXiv:2310.15004

2023/10/23

Yonatan Belinkov

H-Index: 26

Sandro Pezzelle

H-Index: 7

Linearity of relation decoding in transformer language models

arXiv preprint arXiv:2308.09124

2023/8/17

Evan Hernandez

H-Index: 1

Kevin Meng

H-Index: 2

Jacob Andreas

H-Index: 24

Yonatan Belinkov

H-Index: 26

David Bau

H-Index: 20

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

arXiv preprint arXiv:2308.00225

2023/8/1

Gabriel Stanovsky

H-Index: 13

Nir Rosenfeld

H-Index: 7

Yonatan Belinkov

H-Index: 26

Generating benchmarks for factuality evaluation of language models

arXiv preprint arXiv:2307.06908

2023/7/13

Ori Ram

H-Index: 2

Yoav Levine

H-Index: 7

Yonatan Belinkov

H-Index: 26

Omri Abend

H-Index: 15

Amnon Shashua

H-Index: 46

Emergent quantized communication

Proceedings of the AAAI Conference on Artificial Intelligence

2023/6/26

Ron Meir

H-Index: 20

Yonatan Belinkov

H-Index: 26

Refact: Updating text-to-image models by editing the text encoder

arXiv preprint arXiv:2306.00738

2023/6/1

Yonatan Belinkov

H-Index: 26

Understanding arithmetic reasoning in language models using causal mediation analysis

arXiv preprint arXiv:2305.15054

2023/5/24

Yonatan Belinkov

H-Index: 26

Shielded representations: Protecting sensitive attributes through iterative gradient-based projection

arXiv preprint arXiv:2305.10204

2023/5/17

Yonatan Belinkov

H-Index: 26

ContraSim--A Similarity Measure Based on Contrastive Learning

arXiv preprint arXiv:2303.16992

2023/3/29

Yonatan Belinkov

H-Index: 26

Yonatan Belinkov

Technion - Israel Institute of Technology

About Yonatan Belinkov

Yonatan Belinkov Information

Yonatan Belinkov Skills & Research Interests

Top articles of Yonatan Belinkov

Effect of tokenization on transformers for biological sequences

Tal Pupko

Yonatan Belinkov

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Yonatan Belinkov

David Bau

Aaron Mueller

BetaAlign: a deep learning approach for multiple sequence alignment

Oren Avram

Yonatan Belinkov

Tal Pupko

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Sandro Pezzelle

Yonatan Belinkov

Accelerating the Global Aggregation of Local Explanations

Yonatan Belinkov

Benny Kimelfeld

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

Yonatan Belinkov

Ron Meir

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

Yonatan Belinkov

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

Benny Kimelfeld

Yonatan Belinkov

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

Tamar Rott Shaham

Yonatan Belinkov

David Bau

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Yonatan Belinkov

Mor Geva

Lior Wolf

Unified concept editing in diffusion models

Rohit Gandikota

Yonatan Belinkov

David Bau

When language models fall in love: Animacy processing in transformer language models

Yonatan Belinkov

Sandro Pezzelle

Linearity of relation decoding in transformer language models

Evan Hernandez

Kevin Meng

Jacob Andreas

Yonatan Belinkov

David Bau

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

Gabriel Stanovsky

Nir Rosenfeld

Yonatan Belinkov

Generating benchmarks for factuality evaluation of language models

Ori Ram

Yoav Levine

Yonatan Belinkov

Omri Abend

Amnon Shashua

Emergent quantized communication

Ron Meir

Yonatan Belinkov

Refact: Updating text-to-image models by editing the text encoder

Yonatan Belinkov

Understanding arithmetic reasoning in language models using causal mediation analysis

Yonatan Belinkov

Shielded representations: Protecting sensitive attributes through iterative gradient-based projection

Yonatan Belinkov

ContraSim--A Similarity Measure Based on Contrastive Learning

Yonatan Belinkov

Co-Authors

James Glass

Yoav Goldberg

Stuart Shieber

Benjamin Van Durme

Amir Globerson

David Bau