Zhen-Hua Ling(凌震华)
University of Science and Technology of China
H-index: 47
Asia-China
Top articles of Zhen-Hua Ling(凌震华)
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding | arXiv preprint arXiv:2402.10533 | Yang Ai Xiao-Hang Jiang Ye-Xin Lu Hui-Peng Du Zhen-Hua Ling | 2024/2/16 |
Adversarial speech for voice privacy protection from Personalized Speech generation | arXiv preprint arXiv:2401.11857 | Shihao Chen Liping Chen Jie Zhang KongAik Lee Zhenhua Ling | 2024/1/22 |
Dynamic facial expression recognition with pseudo‐label guided multi‐modal pre‐training | IET Computer Vision | Bing Yin Shi Yin Cong Liu Yanyong Zhang Changfeng Xi | 2024/2/1 |
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction | arXiv preprint arXiv:2401.06387 | Ye-Xin Lu Yang Ai Hui-Peng Du Zhen-Hua Ling | 2024/1/12 |
Considering Temporal Connection between Turns for Conversational Speech Synthesis | Kangdi Mei Zhaoci Liu Huipeng Du Hengyu Li Yang Ai | 2024/4/14 | |
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Rui-Chen Zheng Yang Ai Zhen-Hua Ling | 2024/2/1 |
Modeling Pseudo-Speaker Uncertainty in Voice Anonymization | Liping Chen Kong Aik Lee Wu Guo Zhen-Hua Ling | 2024/4/14 | |
Model Editing Can Hurt General Abilities of Large Language Models | arXiv preprint arXiv:2401.04700 | Jia-Chen Gu Hao-Xiang Xu Jun-Yu Ma Pan Lu Zhen-Hua Ling | 2024/1/9 |
Neighboring Perturbations of Knowledge Editing on Large Language Models | arXiv preprint arXiv:2401.17623 | Jun-Yu Ma Jia-Chen Gu Ningyu Zhang Zhen-Hua Ling | 2024/1/31 |
Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval | Qian Wang Jia-Chen Gu Zhen-Hua Ling | 2024/4/14 | |
Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Yang Ai Zhen-Hua Ling | 2024/4/4 |
Corrective Retrieval Augmented Generation | arXiv preprint arXiv:2401.15884 | Shi-Qi Yan Jia-Chen Gu Yun Zhu Zhen-Hua Ling | 2024/1/29 |
Learning WHO Saying WHAT to WHOM in Multi-Party Conversations | Jia-Chen Gu Zhuosheng Zhang Zhen-Hua Ling | 2023/11 | |
APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra | Hui-Peng Du Ye-Xin Lu Yang Ai Zhen-Hua Ling | 2023/12/8 | |
The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge | Haochen Wu Zhuhai Li Luzhen Xu Zhentao Zhang Wenting Zhao | 2023 | |
Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses | Yang Ai Zhen-Hua Ling | 2023/6/4 | |
Long-frame-shift Neural Speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation | IEEE Signal Processing Letters | Yang Ai Ye-Xin Lu Zhen-Hua Ling | 2023/8/17 |
USTC-NELSLIP at SemEval-2023 Task 2: Statistical Construction and Dual Adaptation of Gazetteer for Multilingual Complex NER | Jun-Yu Ma Jia-Chen Gu Jiajun Qi Zhen-Hua Ling Quan Liu | 2023/5/4 | |
Is ChatGPT a Good Multi-Party Conversation Solver? | Chao-Hong Tan Jia-Chen Gu Zhen-Hua Ling | 2023/10/25 | |
Symbolization, Prompt, and Classification: A Framework for Implicit Speaker Identification in Novels | Yue Chen Tianwei He Hongbin Zhou Jia-Chen Gu Heng Lu | 2023/12 |