Tianxiang Sun (孙天祥)
Fudan University
H-index: 14
Asia-China
Top articles of Tianxiang Sun (孙天祥)
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Turn Waste into Worth: Rectifying Top- Router of MoE | arXiv preprint arXiv:2402.12399 | Zhiyuan Zeng Qipeng Guo Zhaoye Fei Zhangyue Yin Yunhua Zhou | 2024/2/17 |
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation | arXiv preprint arXiv:2402.11251 | Siyin Wang Shimin Li Tianxiang Sun Jinlan Fu Qinyuan Cheng | 2024/2/17 |
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance | arXiv preprint arXiv:2403.16952 | Jiasheng Ye Peiju Liu Tianxiang Sun Yunhua Zhou Jun Zhan | 2024/3/25 |
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning | arXiv preprint arXiv:2401.13621 | Xinghao Wang Junliang He Pengyu Wang Yunhua Zhou Tianxiang Sun | 2024/1/24 |
In-Memory Learning: A Declarative Learning Framework for Large Language Models | arXiv preprint arXiv:2403.02757 | Bo Wang Tianxiang Sun Hang Yan Siyin Wang Qingyuan Cheng | 2024/3/5 |
Can AI Assistants Know What They Don't Know? | arXiv preprint arXiv:2401.13275 | Qinyuan Cheng Tianxiang Sun Xiangyang Liu Wenwei Zhang Zhangyue Yin | 2024/1/24 |
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling | arXiv preprint arXiv:2402.12226 | Jun Zhan Junqi Dai Jiasheng Ye Yunhua Zhou Dong Zhang | 2024/2/19 |
Agent Alignment in Evolving Social Norms | arXiv preprint arXiv:2401.04620 | Shimin Li Tianxiang Sun Xipeng Qiu | 2024/1/9 |
Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT | arXiv preprint arXiv:2402.12201 | Zhengfu He Xuyang Ge Qiong Tang Tianxiang Sun Qinyuan Cheng | 2024/2/19 |
Competition for gradient-free tuning of large language models: approaches, results, current challenges and future directions | National Science Review | Tingfeng Cao Liang Chen Dixiang Zhang Tianxiang Sun Zhengfu He | 2023/6 |
Llatrieval: Llm-verified retrieval for verifiable generation | arXiv preprint arXiv:2311.07838 | Xiaonan Li Changtai Zhu Linyang Li Zhangyue Yin Tianxiang Sun | 2023/11/14 |
CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors | arXiv preprint arXiv:2305.05711 | Peng Li Tianxiang Sun Qiong Tang Hang Yan Yuanbin Wu | 2023/5/9 |
Flames: Benchmarking value alignment of chinese large language models | arXiv preprint arXiv:2311.06899 | Kexin Huang Xiangyang Liu Qianyu Guo Tianxiang Sun Jiawei Sun | 2023/11/12 |
Improving Contrastive Learning of Sentence Embeddings from AI Feedback | arXiv preprint arXiv:2305.01918 | Qinyuan Cheng Xiaogui Yang Tianxiang Sun Linyang Li Xipeng Qiu | 2023/5/3 |
Evaluating hallucinations in chinese large language models | arXiv preprint arXiv:2310.03368 | Qinyuan Cheng Tianxiang Sun Wenwei Zhang Siyin Wang Xiangyang Liu | 2023/10/5 |
Origin tracing and detecting of llms | arXiv preprint arXiv:2304.14072 | Linyang Li Pengyu Wang Ke Ren Tianxiang Sun Xipeng Qiu | 2023/4/27 |
Secrets of rlhf in large language models part i: Ppo | arXiv preprint arXiv:2307.04964 | Rui Zheng Shihan Dou Songyang Gao Yuan Hua Wei Shen | 2023/7/11 |
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning | arXiv preprint arXiv:2210.07565 | Tianxiang Sun Zhengfu He Qin Zhu Xipeng Qiu Xuanjing Huang | 2022/10/14 |
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts | arXiv preprint arXiv:2210.11292 | Xiangyang Liu Tianxiang Sun Xuanjing Huang Xipeng Qiu | 2022/10/20 |
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation | arXiv preprint arXiv:2210.07626 | Tianxiang Sun Junliang He Xipeng Qiu Xuanjing Huang | 2022/10/14 |