Rohan Kumar Das
National University of Singapore
H-index: 29
Asia-Singapore
Top articles of Rohan Kumar Das
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Sound Event Detection: A Journey Through DCASE Challenge Series | APSIPA Transactions on Signal and Information Processing | Tanmay Khandelwal Rohan Kumar Das Eng Siong Chng | 2024 |
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks | arXiv preprint arXiv:2404.17280 | Mingrui He Longting Xu Han Wang Mingjun Zhang Rohan Kumar Das | 2024/4/26 |
Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan | arXiv preprint arXiv:2404.09342 | Muhammad Saad Saeed Shah Nawaz Muhammad Salman Tahir Rohan Kumar Das Muhammad Zaigham Zaheer | 2024/4/14 |
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training | arXiv preprint arXiv:2404.00861 | Ruijie Tao Xinyuan Qian Rohan Kumar Das Xiaoxue Gao Jiadong Wang | 2024/4/1 |
Dual Knowledge Distillation for Efficient Sound Event Detection | Yang Xiao Rohan Kumar Das | 2024/2/5 | |
Adaptive-avg-pooling based Attention Vision Transformer for Face Anti-spoofing | Jichen Yang Fangfan Chen Rohan Kumar Das Zhengyu Zhu Shunsi Zhang | 2024/1/10 | |
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds | Tanmay Khandelwal Rohan Kumar Das | 2023/8 | |
FMSG submission for DCASE 2023 challenge task 4 on sound event detection with weak labels and synthetic soundscapes | Proc. DCASE Challenge | Yang Xiao Tanmay Khandelwal Rohan Kumar Das | 2023/6 |
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions | Tanmay Khandelwal Rohan Kumar Das Andrew Koh Eng Siong Chng | 2023/4/25 | |
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Ruijie Tao Kong Aik Lee Rohan Kumar Das Ville Hautamäki Haizhou Li | 2023/4/20 |
Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection | Tanmay Khandelwal Rohan Kumar Das | 2022/12/12 | |
Self-supervised Speaker Recognition with Loss-gated Learning | Ruijie Tao Kong Aik Lee Rohan Kumar Das Ville Hautamäki Haizhou Li | 2022/2 | |
On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement | Rohith Mars Rohan Kumar Das | 2022/12 | |
Neural acoustic-phonetic approach for speaker verification with phonetic attention mask | IEEE Signal Processing Letters | Tianchi Liu Rohan Kumar Das Kong Aik Lee Haizhou Li | 2022/1/13 |
Is Your Baby Fine at Home? Baby Cry Sound Detection in Domestic Environments | Tanmay Khandelwal Rohan Kumar Das Eng Siong Chng | 2022/11/7 | |
A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks | Longting Xu Mianxin Tian Xing Guo Zhiyong Shan Jie Jia | 2022 | |
I4U System Description for NIST SRE'20 CTS Challenge | arXiv preprint arXiv:2211.01091 | Kong Aik Lee Tomi Kinnunen Daniele Colibro Claudio Vair Andreas Nautsch | 2022/11/2 |
FMSG-NTU Submission For DCASE 2022 TASK 4 on Sound Event Detection in Domestic Environments | Tanmay Khandelwal Rohan Kumar Das Andrew Koh Eng Siong Chng | 2022/6 | |
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances | Tianchi Liu Rohan Kumar Das Kong Aik Lee Haizhou Li | 2022/2/3 | |
HLT-NUS DiCOVA 2021 Challenge System Report | Github. io | Rohan Kumar Das Maulik Madhavi Haizhou Li | 2021 |