Simeng Sun
University of Massachusetts Amherst
H-index: 9
North America-United States
Top articles of Simeng Sun
RULER: What's the Real Context Size of Your Long-Context Language Models?
arXiv preprint arXiv:2404.06654
2024/4/9
Simeng Sun
H-Index: 3
Fei Jia
H-Index: 11
TOWARDS EFFECTIVE MODELING OF LONG-RANGE CONTEXT
2024/1
Simeng Sun
H-Index: 3
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf
arXiv preprint arXiv:2309.09055
2023/9/16
Pearl: Prompting large language models to plan and execute actions over long documents
arXiv preprint arXiv:2305.14564
2023/5/23
How does in-context learning help prompt tuning?
arXiv preprint arXiv:2302.11521
2023/2/22
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
2023/2/7
Simeng Sun
H-Index: 3
James Cross
H-Index: 14
TopicGPT: A prompt-based topic modeling framework
arXiv preprint arXiv:2311.01449
2023/11/2
How Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?
2022/5
Simeng Sun
H-Index: 3
Mohit Iyyer
H-Index: 21
ChapterBreak: A Challenge Dataset for Long-Range Language Models
2022/4/22
Simeng Sun
H-Index: 3
Mohit Iyyer
H-Index: 21
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
2022
Do Long-Range Language Models Actually Use Long-Range Context?
2021/9/19
IGA: An intent-guided authoring assistant
2021/4/14
Revisiting simple neural probabilistic language models
2021/4/8
Simeng Sun
H-Index: 3
Mohit Iyyer
H-Index: 21
Energy-based reranking: Improving neural machine translation using energy-based models
arXiv preprint arXiv:2009.13267
2020/9/20
Hard-coded gaussian attention for neural machine translation
2020/5/2