Tuo Zhao
Georgia Institute of Technology
H-index: 39
North America-United States
Top articles of Tuo Zhao
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Gear: An efficient kv cache compression recipefor near-lossless generative inference of llm | arXiv preprint arXiv:2403.05527 | Hao Kang Qingru Zhang Souvik Kundu Geonhwa Jeong Zaoxing Liu | 2024/3/8 |
To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO | arXiv preprint arXiv:2404.04575 (ICML) | Zi-Hao Qiu Siqi Guo Mao Xu Tuo Zhao Lijun Zhang | 2024/4/6 |
Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach | arXiv preprint arXiv:2404.02511 | Hoang Huy Nguyen Yan Li Tuo Zhao | 2024/4/3 |
HART: Efficient Adaptation via Regularized Autoregressive Parameter Generation | Chen Liang Nikos Karampatziakis Tuo Zhao Weizhu Chen | 2023/10/13 | |
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms | Advances in Neural Information Processing Systems | Shenao Zhang Boyi Liu Zhaoran Wang Tuo Zhao | 2024/2/13 |
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers | arXiv preprint arXiv:2302.09632 | Chen Liang Haoming Jiang Zheng Li Xianfeng Tang Bin Yin | 2023/2/19 |
LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation | arXiv preprint arXiv:2306.11222 (ICML) | Yixiao Li Yifan Yu Qingru Zhang Chen Liang Pengcheng He | 2023/6/20 |
Deep Reinforcement Learning from Hierarchical Weak Preference Feedback | arXiv preprint arXiv:2309.02632 | Alexander Bukharin Yixiao Li Pengcheng He Weizhu Chen Tuo Zhao | 2023/9/6 |
LoftQ: Lora-fine-tuning-aware quantization for large language models | arXiv preprint arXiv:2310.08659 (ICLR) | Yixiao Li Yifan Yu Chen Liang Pengcheng He Nikos Karampatziakis | 2023/10/12 |
Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult | arXiv preprint arXiv:2310.17087 | Yuqing Wang Zhenghao Xu Tuo Zhao Molei Tao | 2023/10/26 |
HadSkip: Homotopic and Adaptive Layer Skipping of Pre-trained Language Models for Efficient Inference | Haoyu Wang Yaqing Wang Tianci Liu Tuo Zhao Jing Gao | 2023/12/1 | |
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data | Minshuo Chen Kaixuan Huang Tuo Zhao Mengdi Wang | 2023/7/3 | |
County augmented transformer for COVID-19 state hospitalizations prediction | Scientific Reports | Siawpeng Er Shihao Yang Tuo Zhao | 2023/6/20 |
LightToken: A Task and Model-agnostic Lightweight Token Embedding Framework for Pre-trained Language Models | Haoyu Wang Ruirui Li Haoming Jiang Zhengyang Wang Xianfeng Tang | 2023/8/6 | |
Module-wise Adaptive Distillation for Multimodality Foundation Models | NeurIPS 2023, arXiv preprint arXiv:2310.04550 | Chen Liang Jiahui Yu Ming-Hsuan Yang Matthew Brown Yin Cui | 2023/10/6 |
Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification | arXiv preprint arXiv:2310.16310 | Zichong Li Qunzhi Xu Zhenghao Xu Yajun Mei Tuo Zhao | 2023/10/25 |
High dimensional binary classification under label shift: phase transition and regularization | Sampling Theory, Signal Processing, and Data Analysis | Jiahui Cheng Minshuo Chen Hao Liu Tuo Zhao Wenjing Liao | 2023/12 |
Machine Learning Force Fields with Data Cost Aware Training | Alexander Bukharin Tianyi Liu Shengjie Wang Simiao Zuo Weihao Gao | 2023/7/3 | |
Provable benefits of policy learning from human preferences in contextual bandit problems | arXiv preprint arXiv:2307.12975 | Xiang Ji Huazheng Wang Minshuo Chen Tuo Zhao Mengdi Wang | 2023/7/24 |
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds | arXiv preprint arXiv:2309.13915 | Zhenghao Xu Xiang Ji Minshuo Chen Mengdi Wang Tuo Zhao | 2023/9/25 |