Kurt Keutzer
University of California, Berkeley
H-index: 101
North America-United States
Top articles of Kurt Keutzer
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
A Dataset and Benchmark for Copyright Protection from Text-to-Image Diffusion Models | arXiv preprint arXiv:2403.12052 | Rui Ma Qiang Zhou Bangjun Xiao Yizhu Jin Daquan Zhou | 2024/1/4 |
Large language models are visual reasoning coordinators | Advances in Neural Information Processing Systems | Liangyu Chen Bo Li Sheng Shen Jingkang Yang Chunyuan Li | 2024/2/13 |
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement | arXiv preprint arXiv:2403.15042 | Nicholas Lee Thanakul Wattanawong Sehoon Kim Karttikeya Mangalam Sheng Shen | 2024/3/22 |
Multitask vision-language prompt tuning | WACV 2024 | Sheng Shen* Shijia Yang* Tianjun Zhang* Bohan Zhai Joseph E Gonzalez | 2022/11/21 |
Towards foundation models for scientific machine learning: Characterizing scaling and transfer behavior | Advances in Neural Information Processing Systems | Shashank Subramanian Peter Harrington Kurt Keutzer Wahid Bhimji Dmitriy Morozov | 2024/2/13 |
Q-slam: Quadric representations for monocular slam | arXiv preprint arXiv:2403.08125 | Chensheng Peng Chenfeng Xu Yue Wang Mingyu Ding Heng Yang | 2024/3/12 |
Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning | arXiv preprint arXiv:2404.08985 | Yijiang Liu Rongyu Zhang Huanrui Yang Kurt Keutzer Yuan Du | 2024/4/13 |
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization | arXiv preprint arXiv:2401.18079 | Coleman Hooper Sehoon Kim Hiva Mohammadzadeh Michael W Mahoney Yakun Sophia Shao | 2024/1/31 |
LLM Inference Unveiled: Survey and Roofline Model Insights | arXiv preprint arXiv:2402.16363 | Zhihang Yuan Yuzhang Shang Yang Zhou Zhen Dong Chenhao Xue | 2024/2/26 |
LLoCO: Learning Long Contexts Offline | arXiv preprint arXiv:2404.07979 | Sijun Tan Xiuyu Li Shishir Patil Ziyang Wu Tianjun Zhang | 2024/4/11 |
Learned Best-Effort LLM Serving | arXiv preprint arXiv:2401.07886 | Siddharth Jha Coleman Hooper Xiaoxuan Liu Sehoon Kim Kurt Keutzer | 2024/1/15 |
Magic-Me: Identity-Specific Video Customized Diffusion | arXiv preprint arXiv:2402.09368 | Ze Ma Daquan Zhou Chun-Hsiao Yeh Xue-She Wang Xiuyu Li | 2024/2/14 |
Ai and memory wall | IEEE Micro | Amir Gholami Zhewei Yao Sehoon Kim Coleman Hooper Michael W Mahoney | 2024/3/25 |
VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness | arXiv preprint arXiv:2401.07853 | Rongyu Zhang Zefan Cai Huanrui Yang Zidong Liu Denis Gudovskiy | 2024/1/15 |
Speculative decoding with big little decoder | Advances in Neural Information Processing Systems | Sehoon Kim Karttikeya Mangalam Suhong Moon Jitendra Malik Michael W Mahoney | 2024/2/13 |
Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation | Proceedings of the AAAI Conference on Artificial Intelligence | Rongyu Zhang Yulin Luo Jiaming Liu Huanrui Yang Zhen Dong | 2024/3/24 |
Scaling vision-language models with sparse mixture of experts | EMNLP 2023 Findings | Sheng Shen Zhewei Yao Chunyuan Li Trevor Darrell Kurt Keutzer | 2023/3/13 |
Treating Models Better for Language-agnostic Understanding | Brian Yu Kurt Keutzer John DeNero | 2023/5/12 | |
SANA: Sensitivity-Aware Neural Architecture Adaptation for Uniform Quantization | Applied Sciences | Mingfei Guo Zhen Dong Kurt Keutzer | 2023/9/15 |
Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Zhijian Liu Zhuoyang Zhang Shang Yang Haotian Tang Chenfeng Xu | 2023/10/13 |