Haizhou Li
National University of Singapore
H-index: 76
Asia-Singapore
Top articles of Haizhou Li
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement | ICASSP 2024 | Qiquan Zhang Meng Ge Hongxu Zhu Eliathamby Ambikairajah Qi Song | 2024/1/18 |
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism | Yu Chen Xinyuan Qian Zexu Pan Kainan Chen Haizhou Li | 2024/4/14 | |
Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio | ICASSP 2024 | Yi Ma Kong Aik Lee Ville Hautamäki Meng Ge Haizhou Li | 2024/1/5 |
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training | arXiv preprint arXiv:2404.00861 | Ruijie Tao Xinyuan Qian Rohan Kumar Das Xiaoxue Gao Jiadong Wang | 2024/4/1 |
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Rui Liu Berrak Sisman Guanglai Gao Haizhou Li | 2024/4/1 |
A comprehensive analysis of the effectiveness of large language models as automatic dialogue evaluators | Chen Zhang Luis Fernando D'Haro Yiming Chen Malu Zhang Haizhou Li | 2024/2 | |
Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks | arXiv preprint arXiv:2402.15725 | Duo Ma Xianghu Yue Junyi Ao Xiaoxue Gao Haizhou Li | 2024/2/24 |
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition | Shuai Wang Qibing Bai Qi Liu Jianwei Yu Zhengyang Chen | 2024/4/14 | |
Speaker Extraction with Detection of Presence and Absence of Target Speakers | Ke Zhang Marvin Borsdorf Zexu Pan Haizhou Li Yangjie Wei | 2023 | |
Quantify Health-Related Atomic Knowledge in Chinese Medical Large Language Models: A Computational Analysis | arXiv preprint arXiv:2310.11722 | Yaxin Fan Feng Jiang Peifeng Li Haizhou Li | 2023/10/18 |
XAnet: Cross-Attention Between EEG of Left and Right Brain for Auditory Attention Decoding | Saurav Pahuja Siqi Cai Tanja Schultz Haizhou Li | 2023/4/24 | |
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification | arXiv preprint arXiv:2312.03620 | Tianchi Liu Kong Aik Lee Qiongqiong Wang Haizhou Li | 2023/12/6 |
Speaker recognition with two-step multi-modal deep cleansing | Ruijie Tao Kong Aik Lee Zhan Shi Haizhou Li | 2023/6/4 | |
Dynamic Transformers Provide a False Sense of Efficiency | arXiv preprint arXiv:2305.12228 | Yiming Chen Simin Chen Zexin Li Wei Yang Cong Liu | 2023/5/20 |
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units | arXiv preprint arXiv:2306.17005 | Junchen Lu Berrak Sisman Mingyang Zhang Haizhou Li | 2023/6/29 |
HuatuoGPT, towards Taming Language Model to Be a Doctor | Findings of EMNLP 2023 | Hongbo Zhang Junying Chen Feng Jiang Fei Yu Zhihong Chen | 2023/5/24 |
NeuroHeed: Neuro-Steered Speaker Extraction using EEG Signals | arXiv preprint arXiv:2307.14303 | Zexu Pan Marvin Borsdorf Siqi Cai Tanja Schultz Haizhou Li | 2023/7/26 |
Speech-Aware Multi-Domain Dialogue State Generation with ASR Error Correction Modules | Ridong Jiang Wei Shi Bin Wang Chen Zhang Yan Zhang | 2023/9 | |
USED: Universal Speaker Extraction and Diarization | arXiv preprint arXiv:2309.10674 | Junyi Ao Mehmet Sinan Yıldırım Meng Ge Shuai Wang Ruijie Tao | 2023/9/19 |
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert | Jiadong Wang Xinyuan Qian Malu Zhang Robby T Tan Haizhou Li | 2023 |