Maosong Sun
Tsinghua University
H-index: 90
Asia-China
Top articles of Maosong Sun
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Personality-affected Emotion Generation in Dialog Systems | ACM Transactions on Information Systems | Zhiyuan Wen Jiannong Cao Jiaxing Shen Ruosong Yang Shuaiqi Liu | 2024/3 |
Advancing LLM Reasoning Generalists with Preference Trees | arXiv preprint arXiv:2404.02078 | Lifan Yuan Ganqu Cui Hanbin Wang Ning Ding Xingyao Wang | 2024/4/2 |
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution | arXiv preprint arXiv:2401.13996 | Cheng Qian Shihao Liang Yujia Qin Yining Ye Xin Cong | 2024/1/25 |
Exploring Perceptual Limitation of Multimodal Large Language Models | arXiv preprint arXiv:2402.07384 | Jiarui Zhang Jinyi Hu Mahyar Khayatkhoei Filip Ilievski Maosong Sun | 2024/2/12 |
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | arXiv preprint arXiv:2402.11455 | Hanqing Wang Bowen Ping Shuo Wang Xu Han Yun Chen | 2024/2/18 |
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models | arXiv preprint arXiv:2402.13516 | Chenyang Song Xu Han Zhengyan Zhang Shengding Hu Xiyu Shi | 2024/2/21 |
Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition | arXiv preprint arXiv:2402.15175 | Yufei Huang Shengding Hu Xu Han Zhiyuan Liu Maosong Sun | 2024/2/23 |
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | arXiv preprint arXiv:2402.19085 | Yiju Guo Ganqu Cui Lifan Yuan Ning Ding Jiexin Wang | 2024/2/29 |
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density | EURASIP Journal on Audio, Speech, and Music Processing | Shangda Wu Yue Yang Zhaowen Wang Xiaobing Li Maosong Sun | 2024/1/15 |
Robust and Scalable Model Editing for Large Language Models | arXiv preprint arXiv:2403.17431 | Yingfa Chen Zhengyan Zhang Xu Han Chaojun Xiao Zhiyuan Liu | 2024/3/26 |
InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory | arXiv preprint arXiv:2402.04617 | Chaojun Xiao Pengle Zhang Xu Han Guangxuan Xiao Yankai Lin | 2024/2/7 |
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents | arXiv preprint arXiv:2402.09205 | Cheng Qian Bingxiang He Zhong Zhuang Jia Deng Yujia Qin | 2024/2/14 |
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting | arXiv preprint arXiv:2402.13720 | Weilin Zhao Yuxiang Huang Xu Han Chaojun Xiao Zhiyuan Liu | 2024/2/21 |
OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models | arXiv preprint arXiv:2402.13524 | Yang Liu Meng Xu Shuo Wang Liner Yang Haoyu Wang | 2024/2/21 |
Beyond Language Models: Byte Models are Digital World Simulators | arXiv preprint arXiv:2402.19155 | Shangda Wu Xu Tan Zili Wang Rui Wang Xiaobing Li | 2024/2/29 |
Debugbench: Evaluating debugging capability of large language models | arXiv preprint arXiv:2401.04621 | Runchu Tian Yining Ye Yujia Qin Xin Cong Yankai Lin | 2024/1/9 |
Llava-uhd: an lmm perceiving any aspect ratio and high-resolution images | arXiv preprint arXiv:2403.11703 | Ruyi Xu Yuan Yao Zonghao Guo Junbo Cui Zanlin Ni | 2024/3/18 |
LEGENT: Open Platform for Embodied Agents | arXiv preprint arXiv:2404.18243 | Zhili Cheng Zhitong Wang Jinyi Hu Shengding Hu An Liu | 2024/4/28 |
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset | arXiv preprint arXiv:2402.04588 | Haoyu Wang Shuo Wang Yukun Yan Xujia Wang Zhiyu Yang | 2024/2/7 |
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Advances in Neural Information Processing Systems | Yuzhong Wang Xu Han Weilin Zhao Guoyang Zeng Zhiyuan Liu | 2024/2/13 |