Jungwook Choi
Hanyang University
H-index: 27
Asia-South Korea
Top articles of Jungwook Choi
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Token-scaled logit distillation for ternary weight generative language models | Advances in Neural Information Processing Systems | Minsoo Kim Sihwa Lee Janghwan Lee Sukjin Hong Du-Seong Chang | 2024/2/13 |
Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference | Youngdeok Hwang Janghwan Lee Jiwoong Park Jieun Lim Jungwook Choi | 2024/1/28 | |
Lightweight Error Correction for In-Storage Acceleration of Large Language Model Inference | Jinwoo Jeong Byungmin Ahn Dongmin Shin Jungwook Choi | 2024/1/28 | |
System-aware selective quantization for performance optimized distributed deep learning | 2023/1/10 | ||
Hybrid floating point representation for deep learning acceleration | 2023/4/4 | ||
Enhancing computation efficiency in large language models through weight and activation quantization | arXiv preprint arXiv:2311.05161 | Jangwhan Lee Minsoo Kim Seungcheol Baek Seok Joong Hwang Wonyong Sung | 2023/11/9 |
Statistics-aware weight quantization | 2023/1/10 | ||
Reusing an operand received from a first-in-first-out (FIFO) buffer according to an operand specifier value specified in a predefined field of an instruction | 2023/4/4 | ||
Range-Invariant Approximation of Non-Linear Operations for Efficient BERT Fine-Tuning | Janghyeon Kim Janghwan Lee Jungwook Choi JeongHo Han Sangheon Lee | 2023/7/9 | |
Mixed precision capable hardware for tuning a machine learning model | 2023/3/14 | ||
Architecture-Aware Optimization of Layer Fusion for Latency-Optimal CNN Inference | Minyong Yoon Jungwook Choi | 2023/6/11 | |
Teacher intervention: Improving convergence of quantization aware training for ultra-low precision transformers | arXiv preprint arXiv:2302.11812 | Minsoo Kim Kyuhong Shim Seongmin Park Wonyong Sung Jungwook Choi | 2023/2/23 |
Finding optimal numerical format for sub-8-bit post-training quantization of vision transformers | Janghwan Lee Youngdeok Hwang Jungwook Choi | 2023/6/4 | |
Exploring Attention Map Reuse for Efficient Transformer Neural Networks | arXiv preprint arXiv:2301.12444 | Kyuhong Shim Jungwook Choi Wonyong Sung | 2023/1/29 |
PillarAcc: Sparse PointPillars Accelerator for Real-Time Point Cloud 3D Object Detection on Edge Devices | arXiv preprint arXiv:2305.07522 | Minjae Lee Hyungmin Kim Seongmin Park Minyong Yoon Janghwan Lee | 2023/5/12 |
Power-Efficient Deep Neural Network Accelerator Minimizing Global Buffer Access without Data Transfer between Neighboring Multiplier—Accumulator Units | Electronics | Jeonghyeok Lee Sangwook Han Seungwon Choi Jungwook Choi | 2022/6/25 |
Understanding and improving knowledge distillation for quantization-aware training of large transformer encoders | arXiv preprint arXiv:2211.11014 | Minsoo Kim Sihwa Lee Sukjin Hong Du-Seong Chang Jungwook Choi | 2022/11/20 |
Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores | arXiv preprint arXiv:2202.06819 | Junkyeong Choi Hyucksung Kwon Woongkyu Lee Jungwook Choi Jieun Lim | 2022/2/11 |
Understanding and Optimizing INT4 Convolution for Accelerated DNN Inference on Tensor Cores | Junkyeong Choi Hyucksung Kwon Woongkyu Lee Jieun Lim Jungwook Choi | 2022/11/2 | |
Optimizing Exponent Bias for Sub-8bit Floating-Point Inference of Fine-tuned Transformers | Janghwan Lee Jungwook Choi | 2022/6/13 |