Jun Du

About Jun Du

Jun Du, With an exceptional h-index of 44 and a recent h-index of 42 (since 2020), a distinguished researcher at University of Science and Technology of China, specializes in the field of Speech Signal Processing, Audio Signal Processing, Pattern Recognition.

His recent articles reflect a diverse array of research interests and contributions to the field:

Translating language characters in media content

Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization

SEMv2: Table separation line detection based on instance segmentation

Deep learning-based size prediction for optical trapped nanoparticles and extracellular vesicles from limited bandwidth camera detection

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction

Collaborative Viseme Subword and End-to-end Modeling for Word-level Lip Reading

Jun Du Information

University

Position

Associate Professor NEL-SLIP

Citations(all)

9142

Citations(since 2020)

7226

Cited By

4304

hIndex(all)

44

hIndex(since 2020)

42

i10Index(all)

131

i10Index(since 2020)

113

Email

University Profile Page

University of Science and Technology of China

Google Scholar

View Google Scholar Profile

Jun Du Skills & Research Interests

Speech Signal Processing

Audio Signal Processing

Pattern Recognition

Top articles of Jun Du

Title

Journal

Author(s)

Publication Date

Translating language characters in media content

2021/6/8

Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization

Haotian Wang

Jun Du

Yusheng Dai

Chin-Hui Lee

Yuling Ren

...

2024/4/14

SEMv2: Table separation line detection based on instance segmentation

Pattern Recognition

Zhenrong Zhang

Pengfei Hu

Jiefeng Ma

Jun Du

Jianshu Zhang

...

2024/5/1

Deep learning-based size prediction for optical trapped nanoparticles and extracellular vesicles from limited bandwidth camera detection

Biomedical Optics Express

Derrick Boateng

Kaiqin Chu

Zachary J Smith

Jun Du

Yichuan Dai

2024/1/1

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

Gaobin Yang

Maokui He

Shutong Niu

Ruoyu Wang

Yanyan Yue

...

2024/4/14

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Hang Chen

Qing Wang

Jun Du

Bao-Cai Yin

Jia Pan

...

2024/4/25

The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction

Shilong Wu

Chenxi Wang

Hang Chen

Yusheng Dai

Chenyue Zhang

...

2024/4/14

Collaborative Viseme Subword and End-to-end Modeling for Word-level Lip Reading

IEEE Transactions on Multimedia

Hang Chen

Qing Wang

Jun Du

Gen-Shun Wan

Shi-Fu Xiong

...

2024/4/17

Multitask frame-level learning for few-shot sound event detection

arXiv preprint arXiv:2403.11091

Liang Zou

Genwei Yan

Ruoyu Wang

Jun Du

Meng Lei

...

2024/3/17

A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition

Feng Ma

Yanhui Tu

Maokui He

Ruoyu Wang

Shutong Niu

...

2024/4/14

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

arXiv preprint arXiv:2403.04245

Yusheng Dai

Hang Chen

Jun Du

Ruoyu Wang

Shihao Chen

...

2024/3/7

Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization

Minghui Wu

Haitao Tang

Jiahuan Fan

Ruoyu Wang

Hang Chen

...

2024/4/14

Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge

Hang Chen

Shilong Wu

Yusheng Dai

Zhe Wang

Jun Du

...

2023/6/4

Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion

Ya Jiang

Hang Chen

Jun Du

Qing Wang

Chin-Hui Lee

2023/6/4

Improving audio-visual speech recognition by lip-subword correlation based visual pre-training and cross-modal fusion encoder

Yusheng Dai

Hang Chen

Jun Du

Xiaofei Ding

Ning Ding

...

2023/7/10

Group, Contrast and Recognize: A Self-supervised Method for Chinese Character Recognition

Xinzhe Jiang

Jun Du

Pengfei Hu

Mobai Xue

Jiefeng Ma

...

2023/8/19

Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder

Jinshui Hu

Hao Wu

Mingjun Chen

Chenyu Liu

Jiajia Wu

...

2023/10/26

A Study on Domain Adaptation for Audio-Visual Speech Enhancement

Chenxi Wang

Hang Chen

Jun Du

Chenyue Zhang

Yuling Ren

...

2023/12/8

USTC-iFLYTEK at DocILE: a multi-modal approach using domain-specific GraphDoc

Working Notes of CLEF

Yan Wang

Jun Du

Jiefeng Ma

Pengfei Hu

Zhenrong Zhang

...

2023

A four-stage data augmentation approach to resnet-conformer based acoustic modeling for sound event localization and detection

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Qing Wang

Jun Du

Hua-Xin Wu

Jia Pan

Feng Ma

...

2023/3/13

See List of Professors in Jun Du University(University of Science and Technology of China)