Yonatan Belinkov
Technion - Israel Institute of Technology
H-index: 43
Asia-Israel
Top articles of Yonatan Belinkov
Effect of tokenization on transformers for biological sequences
Bioinformatics
2024/4/12
Tal Pupko
H-Index: 40
Yonatan Belinkov
H-Index: 26
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
arXiv preprint arXiv:2403.19647
2024/3/28
BetaAlign: a deep learning approach for multiple sequence alignment
bioRxiv
2024/3/27
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
arXiv preprint arXiv:2403.17806
2024/3/26
Sandro Pezzelle
H-Index: 7
Yonatan Belinkov
H-Index: 26
Accelerating the Global Aggregation of Local Explanations
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Yonatan Belinkov
H-Index: 26
Benny Kimelfeld
H-Index: 19
Concept-Best-Matching: Evaluating Compositionality in Emergent Communication
arXiv preprint arXiv:2403.14705
2024/3/17
Yonatan Belinkov
H-Index: 26
Ron Meir
H-Index: 20
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
arXiv preprint arXiv:2403.05846
2024/3/9
Yonatan Belinkov
H-Index: 26
A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
arXiv preprint arXiv:2402.17371
2024/2/27
Benny Kimelfeld
H-Index: 19
Yonatan Belinkov
H-Index: 26
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
arXiv preprint arXiv:2402.14811
2024/2/22
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
arXiv preprint arXiv:2402.12865
2024/2/20
Unified concept editing in diffusion models
2024
When language models fall in love: Animacy processing in transformer language models
arXiv preprint arXiv:2310.15004
2023/10/23
Yonatan Belinkov
H-Index: 26
Sandro Pezzelle
H-Index: 7
Linearity of relation decoding in transformer language models
arXiv preprint arXiv:2308.09124
2023/8/17
Evan Hernandez
H-Index: 1
Kevin Meng
H-Index: 2
Jacob Andreas
H-Index: 24
Yonatan Belinkov
H-Index: 26
David Bau
H-Index: 20
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
arXiv preprint arXiv:2308.00225
2023/8/1
Generating benchmarks for factuality evaluation of language models
arXiv preprint arXiv:2307.06908
2023/7/13
Ori Ram
H-Index: 2
Yoav Levine
H-Index: 7
Yonatan Belinkov
H-Index: 26
Omri Abend
H-Index: 15
Amnon Shashua
H-Index: 46
Emergent quantized communication
Proceedings of the AAAI Conference on Artificial Intelligence
2023/6/26
Ron Meir
H-Index: 20
Yonatan Belinkov
H-Index: 26
Refact: Updating text-to-image models by editing the text encoder
arXiv preprint arXiv:2306.00738
2023/6/1
Yonatan Belinkov
H-Index: 26
Understanding arithmetic reasoning in language models using causal mediation analysis
arXiv preprint arXiv:2305.15054
2023/5/24
Yonatan Belinkov
H-Index: 26
Shielded representations: Protecting sensitive attributes through iterative gradient-based projection
arXiv preprint arXiv:2305.10204
2023/5/17
Yonatan Belinkov
H-Index: 26
ContraSim--A Similarity Measure Based on Contrastive Learning
arXiv preprint arXiv:2303.16992
2023/3/29
Yonatan Belinkov
H-Index: 26