Chin-Hui Lee
Georgia Institute of Technology
H-index: 85
North America-United States
Top articles of Chin-Hui Lee
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction | Shilong Wu Chenxi Wang Hang Chen Yusheng Dai Chenyue Zhang | 2024/4/14 | |
Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Hang Chen Qing Wang Jun Du Bao-Cai Yin Jia Pan | 2024/4/25 |
Boosting End-to-End Multilingual Phoneme Recognition Through Exploiting Universal Speech Attributes Constraints | Hao Yen Sabato Marco Siniscalchi Chin-Hui Lee | 2024/4/14 | |
Collaborative Viseme Subword and End-to-end Modeling for Word-level Lip Reading | IEEE Transactions on Multimedia | Hang Chen Qing Wang Jun Du Gen-Shun Wan Shi-Fu Xiong | 2024/4/17 |
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition | arXiv preprint arXiv:2403.04245 | Yusheng Dai Hang Chen Jun Du Ruoyu Wang Shihao Chen | 2024/3/7 |
Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization | Haotian Wang Jun Du Yusheng Dai Chin-Hui Lee Yuling Ren | 2024/4/14 | |
Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori | arXiv preprint arXiv:2401.13766 | Hu Hu Sabato Marco Siniscalchi Chin-Hui Lee | 2024/1/24 |
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition | Feng Ma Yanhui Tu Maokui He Ruoyu Wang Shutong Niu | 2024/4/14 | |
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture | Gaobin Yang Maokui He Shutong Niu Ruoyu Wang Yanyan Yue | 2024/4/14 | |
Correlated Multi-Level Speech Enhancement for Robust Real-World ASR Applications Using Mask-Waveform-Feature Optimization | Hang Chen Jun Du Zhe Wang Chenxi Wang Yuling Ren | 2023/10/31 | |
Joint Speech and Noise Estimation Using SNR-Adaptive Target Learning for Deep-Learning-Based Speech Enhancement | Xiaoran Li Zilu Guo Jun Du Chin-Hui Lee Yu Gao | 2023/12/8 | |
Using iterative adaptation and dynamic mask for child speech extraction under real-world multilingual conditions | Speech Communication | Shi Cheng Jun Du Shutong Niu Alejandrina Cristia Xin Wang | 2023/7/1 |
Incorporating visual information reconstruction into progressive learning for optimizing audio-visual speech enhancement | Chen-Yue Zhang Hang Chen Jun Du Bao-Cai Yin Jia Pan | 2023/6/4 | |
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting | Haotian Wang Jun Du Hengshun Zhou Chin-Hui Lee Yuling Ren | 2023 | |
A four-stage data augmentation approach to resnet-conformer based acoustic modeling for sound event localization and detection | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Qing Wang Jun Du Hua-Xin Wu Jia Pan Feng Ma | 2023/3/13 |
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning | arXiv preprint arXiv:2309.09270 | Zilu Guo Jun Du CHin-Hui Lee | 2023/9/17 |
Joint Time-Domain and Frequency-Domain Progressive Learning for Single-Channel Speech Enhancement and Recognition | Gongzhen Zou Jun Du Shutong Niu Hang Chen Yuling Ren | 2023/12/8 | |
Variance-preserving-based interpolation diffusion models for speech enhancement | arXiv preprint arXiv:2306.08527 | Zilu Guo Jun Du Chin-Hui Lee Yu Gao Wenbin Zhang | 2023/6/14 |
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion | Ya Jiang Hang Chen Jun Du Qing Wang Chin-Hui Lee | 2023/6/4 | |
QDM-SSD: quality-aware dynamic masking for separation-based speaker diarization | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Shu-Tong Niu Jun Du Lei Sun Yu Hu Chin-Hui Lee | 2023/2/13 |