Jun Du
University of Science and Technology of China
H-index: 44
Asia-China
Top articles of Jun Du
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Translating language characters in media content | 2021/6/8 | ||
Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization | Haotian Wang Jun Du Yusheng Dai Chin-Hui Lee Yuling Ren | 2024/4/14 | |
SEMv2: Table separation line detection based on instance segmentation | Pattern Recognition | Zhenrong Zhang Pengfei Hu Jiefeng Ma Jun Du Jianshu Zhang | 2024/5/1 |
Deep learning-based size prediction for optical trapped nanoparticles and extracellular vesicles from limited bandwidth camera detection | Biomedical Optics Express | Derrick Boateng Kaiqin Chu Zachary J Smith Jun Du Yichuan Dai | 2024/1/1 |
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture | Gaobin Yang Maokui He Shutong Niu Ruoyu Wang Yanyan Yue | 2024/4/14 | |
Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Hang Chen Qing Wang Jun Du Bao-Cai Yin Jia Pan | 2024/4/25 |
The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction | Shilong Wu Chenxi Wang Hang Chen Yusheng Dai Chenyue Zhang | 2024/4/14 | |
Collaborative Viseme Subword and End-to-end Modeling for Word-level Lip Reading | IEEE Transactions on Multimedia | Hang Chen Qing Wang Jun Du Gen-Shun Wan Shi-Fu Xiong | 2024/4/17 |
Multitask frame-level learning for few-shot sound event detection | arXiv preprint arXiv:2403.11091 | Liang Zou Genwei Yan Ruoyu Wang Jun Du Meng Lei | 2024/3/17 |
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition | Feng Ma Yanhui Tu Maokui He Ruoyu Wang Shutong Niu | 2024/4/14 | |
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition | arXiv preprint arXiv:2403.04245 | Yusheng Dai Hang Chen Jun Du Ruoyu Wang Shihao Chen | 2024/3/7 |
Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization | Minghui Wu Haitao Tang Jiahuan Fan Ruoyu Wang Hang Chen | 2024/4/14 | |
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge | Hang Chen Shilong Wu Yusheng Dai Zhe Wang Jun Du | 2023/6/4 | |
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion | Ya Jiang Hang Chen Jun Du Qing Wang Chin-Hui Lee | 2023/6/4 | |
Improving audio-visual speech recognition by lip-subword correlation based visual pre-training and cross-modal fusion encoder | Yusheng Dai Hang Chen Jun Du Xiaofei Ding Ning Ding | 2023/7/10 | |
Group, Contrast and Recognize: A Self-supervised Method for Chinese Character Recognition | Xinzhe Jiang Jun Du Pengfei Hu Mobai Xue Jiefeng Ma | 2023/8/19 | |
Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder | Jinshui Hu Hao Wu Mingjun Chen Chenyu Liu Jiajia Wu | 2023/10/26 | |
A Study on Domain Adaptation for Audio-Visual Speech Enhancement | Chenxi Wang Hang Chen Jun Du Chenyue Zhang Yuling Ren | 2023/12/8 | |
USTC-iFLYTEK at DocILE: a multi-modal approach using domain-specific GraphDoc | Working Notes of CLEF | Yan Wang Jun Du Jiefeng Ma Pengfei Hu Zhenrong Zhang | 2023 |
A four-stage data augmentation approach to resnet-conformer based acoustic modeling for sound event localization and detection | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Qing Wang Jun Du Hua-Xin Wu Jia Pan Feng Ma | 2023/3/13 |