Yonatan Belinkov

About Yonatan Belinkov

Yonatan Belinkov, With an exceptional h-index of 43 and a recent h-index of 40 (since 2020), a distinguished researcher at Technion - Israel Institute of Technology, specializes in the field of Natural Language Processing, Model Interpretability, Artificial Intelligence.

His recent articles reflect a diverse array of research interests and contributions to the field:

Effect of tokenization on transformers for biological sequences

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

BetaAlign: a deep learning approach for multiple sequence alignment

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Accelerating the Global Aggregation of Local Explanations

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

Yonatan Belinkov Information

University

Position

___

Citations(all)

8836

Citations(since 2020)

8248

Cited By

2585

hIndex(all)

43

hIndex(since 2020)

40

i10Index(all)

76

i10Index(since 2020)

74

Email

University Profile Page

Google Scholar

Yonatan Belinkov Skills & Research Interests

Natural Language Processing

Model Interpretability

Artificial Intelligence

Top articles of Yonatan Belinkov

Effect of tokenization on transformers for biological sequences

Bioinformatics

2024/4/12

Tal Pupko
Tal Pupko

H-Index: 40

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

arXiv preprint arXiv:2403.19647

2024/3/28

BetaAlign: a deep learning approach for multiple sequence alignment

bioRxiv

2024/3/27

Oren Avram
Oren Avram

H-Index: 5

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Tal Pupko
Tal Pupko

H-Index: 40

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

arXiv preprint arXiv:2403.17806

2024/3/26

Sandro Pezzelle
Sandro Pezzelle

H-Index: 7

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Accelerating the Global Aggregation of Local Explanations

Proceedings of the AAAI Conference on Artificial Intelligence

2024/3/24

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Benny Kimelfeld
Benny Kimelfeld

H-Index: 19

Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

arXiv preprint arXiv:2403.14705

2024/3/17

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Ron Meir
Ron Meir

H-Index: 20

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

arXiv preprint arXiv:2403.05846

2024/3/9

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

arXiv preprint arXiv:2402.17371

2024/2/27

Benny Kimelfeld
Benny Kimelfeld

H-Index: 19

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

arXiv preprint arXiv:2402.14811

2024/2/22

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

arXiv preprint arXiv:2402.12865

2024/2/20

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Mor Geva
Mor Geva

H-Index: 7

Lior Wolf
Lior Wolf

H-Index: 50

Unified concept editing in diffusion models

2024

When language models fall in love: Animacy processing in transformer language models

arXiv preprint arXiv:2310.15004

2023/10/23

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Sandro Pezzelle
Sandro Pezzelle

H-Index: 7

Linearity of relation decoding in transformer language models

arXiv preprint arXiv:2308.09124

2023/8/17

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

arXiv preprint arXiv:2308.00225

2023/8/1

Generating benchmarks for factuality evaluation of language models

arXiv preprint arXiv:2307.06908

2023/7/13

Emergent quantized communication

Proceedings of the AAAI Conference on Artificial Intelligence

2023/6/26

Ron Meir
Ron Meir

H-Index: 20

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Refact: Updating text-to-image models by editing the text encoder

arXiv preprint arXiv:2306.00738

2023/6/1

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Understanding arithmetic reasoning in language models using causal mediation analysis

arXiv preprint arXiv:2305.15054

2023/5/24

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

Shielded representations: Protecting sensitive attributes through iterative gradient-based projection

arXiv preprint arXiv:2305.10204

2023/5/17

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

ContraSim--A Similarity Measure Based on Contrastive Learning

arXiv preprint arXiv:2303.16992

2023/3/29

Yonatan Belinkov
Yonatan Belinkov

H-Index: 26

See List of Professors in Yonatan Belinkov University(Technion - Israel Institute of Technology)

Co-Authors

academic-engine