Xinyuan Qian
National University of Singapore
H-index: 10
Asia-Singapore
Top articles of Xinyuan Qian
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
arXiv preprint arXiv:2404.18501
2024/4/29
GLMB 3D Speaker Tracking with Video-Assisted Multi-Channel Audio Optimization Functions
2024/4/14
Xinyuan Qian
H-Index: 4
Qiquan Zhang
H-Index: 5
Visually Guided Binaural Audio Generation with Cross-Modal Consistency
2024/4/14
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
2024/4/14
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training
arXiv preprint arXiv:2404.00861
2024/4/1
M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing
Pattern Recognition Letters
2024/2/10
Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking
2023/9/8
Adapting Pre-Trained Self-Supervised Learning Model for Speech Recognition with Light-Weight Adapters
Electronics
2024/1/1
Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss
IEEE Transactions on Circuits and Systems for Video Technology
2023/10/23
Audio-visual speaker tracking: Progress, challenges, and future directions
arXiv preprint arXiv:2310.14778
2023/10/23
Yong Xu
H-Index: 18
Xinyuan Qian
H-Index: 4
Davide Berghi
H-Index: 0
Meng Cui
H-Index: 21
Wenwu Wang
H-Index: 29
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech
IEEE Transactions on Multimedia
2023/10/13
Audio Visual Speaker Localization from EgoCentric Views
arXiv preprint arXiv:2309.16308
2023/9/28
L F-TOUCH: A Wireless GelSight with Decoupled Tactile and Three-axis Force Sensing
IEEE Robotics and Automation Letters
2023/7/5
Self-Convolution for Automatic Speech Recognition
2023/6/4
Tian-Hao Zhang
H-Index: 9
Qi Liu
H-Index: 28
Xinyuan Qian
H-Index: 4
Feng Chen
H-Index: 15
Xu-Cheng Yin
H-Index: 16
Stream Attention Based U-Net for L3DAS23 Challenge
2023/6/4
Yanjie Fu
H-Index: 24
Junjie Li
H-Index: 4
Meng Ge
H-Index: 5
Longbiao Wang
H-Index: 15
Xinyuan Qian
H-Index: 4
Ripple sparse self-attention for monaural speech enhancement
2023/6/4
A miniaturised camera-based multi-modal tactile sensor
2023/5/29
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
arXiv preprint arXiv:2305.14049
2023/5/23
Tian-Hao Zhang
H-Index: 9
Qi Liu
H-Index: 28
Feng Chen
H-Index: 15
Xinyuan Qian
H-Index: 4
Xu-Cheng Yin
H-Index: 16
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition
Network
2023/5
Tian-Hao Zhang
H-Index: 9
Qi Liu
H-Index: 28
Xinyuan Qian
H-Index: 4
Feng Chen
H-Index: 15
Xu-Cheng Yin
H-Index: 16
Device features based on linear transformation with parallel training data for replay speech detection
IEEE/ACM Transactions on Audio, Speech, and Language Processing
2023/4/17