Yang You
National University of Singapore
H-index: 24
Asia-Singapore
Top articles of Yang You
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Response length perception and sequence scheduling: An llm-empowered llm inference pipeline | Advances in Neural Information Processing Systems | Zangwei Zheng Xiaozhe Ren Fuzhao Xue Yang Luo Xin Jiang | 2024/2/13 |
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning | arXiv preprint arXiv:2402.15751 | Yong Liu Zirui Zhu Chaoyu Gong Minhao Cheng Cho-Jui Hsieh | 2024/2/24 |
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | arXiv preprint arXiv:2404.12866 | Yang Luo Zangwei Zheng Zirui Zhu Yang You | 2024/4/19 |
AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference | arXiv preprint arXiv:2401.10652 | Xuanlei Zhao Shenggan Cheng Guangyang Lu Jiarui Fang Haotian Zhou | 2024/1/19 |
Two Trades is not Baffled: Condense Graph via Crafting Rational Gradient Matching | arXiv preprint arXiv:2402.04924 | Tianle Zhang Yuchen Zhang Kun Wang Kai Wang Beining Yang | 2024/2/7 |
Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization | arXiv preprint arXiv:2403.00798 | Zirui Zhu Yong Liu Zangwei Zheng Huifeng Guo Yang You | 2024/2/23 |
Summarizing Stream Data for Memory-Constrained Online Continual Learning | Proceedings of the AAAI Conference on Artificial Intelligence | Jianyang Gu Kai Wang Wei Jiang Yang You | 2024/3/24 |
Must: Maximizing Latent Capacity of Spatial Transcriptomics Data | arXiv preprint arXiv:2401.07543 | Zelin Zang Liangyu Li Yongjie Xu Chenrui Duan Kai Wang | 2024/1/15 |
Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching | arXiv preprint arXiv:2402.05011 | Yuchen Zhang Tianle Zhang Kai Wang Ziyao Guo Yuxuan Liang | 2024/2/7 |
Neural Network Diffusion | arXiv preprint arXiv:2402.13144 | Kai Wang Zhaopan Xu Yukun Zhou Zelin Zang Trevor Darrell | 2024/2/20 |
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers | arXiv preprint arXiv:2403.10266 | Xuanlei Zhao Shenggan Cheng Zangwei Zheng Zheming Yang Ziming Liu | 2024/3/15 |
KMT-PLL: K-Means Cross-Attention Transformer for Partial Label Learning | IEEE Transactions on Neural Networks and Learning Systems | Jinfu Fan Linqing Huang Chaoyu Gong Yang You Min Gan | 2024/1/9 |
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents | arXiv preprint arXiv:2402.03610 | Tomoyuki Kagaya Thong Jing Yuan Yuxuan Lou Jayashree Karlekar Sugiri Pranata | 2024/2/6 |
Does graph distillation see like vision dataset counterpart? | Beining Yang Kai Wang Qingyun Sun Cheng Ji Xingcheng Fu | 2023 | |
FastFold: Optimizing AlphaFold Training and Inference on GPU Clusters | Shenggan Cheng Xuanlei Zhao Guangyang Lu Jiarui Fang Tian Zheng | 2024/3/2 | |
Openmoe: An early effort on open mixture-of-experts language models | arXiv preprint arXiv:2402.01739 | Fuzhao Xue Zian Zheng Yao Fu Jinjie Ni Zangwei Zheng | 2024/1/29 |
To repeat or not to repeat: Insights from scaling llm under token-crisis | Advances in Neural Information Processing Systems | Fuzhao Xue Yao Fu Wangchunshu Zhou Zangwei Zheng Yang You | 2024/2/13 |
Self-filling evidential clustering for partial multi-view data | Expert Systems with Applications | Chaoyu Gong Yang You | 2024/3/1 |
Sparse Reconstructive Evidential Clustering for Multi-View Data | IEEE/CAA Journal of Automatica Sinica | Chaoyu Gong Yang You | 2024/1/29 |
An evidential multi-target domain adaptation method based on weighted fusion for cross-domain pattern classification | IEEE Transactions on Neural Networks and Learning Systems | Linqing Huang Wangbo Zhao Yong Liu Duo Yang Alan Wee-Chung Liew | 2023/5 |