Yuexian Zou

About Yuexian Zou

Yuexian Zou, With an exceptional h-index of 41 and a recent h-index of 35 (since 2020), a distinguished researcher at Peking University, specializes in the field of Machine Learning, Speech Processing, Image Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Exploiting Auxiliary Caption for Video Grounding

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling

Zeronlg: Aligning and autoencoding domains for zero-shot multimodal and multilingual natural language generation

Audiogpt: Understanding and generating speech, music, sound, and talking head

Visiongpt: Vision-language understanding agent using generalized multimodal framework

Retrieval is Accurate Generation

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport

Yuexian Zou Information

University

Position

Shenzhen Graduate School

Citations(all)

6597

Citations(since 2020)

4914

Cited By

2667

hIndex(all)

41

hIndex(since 2020)

35

i10Index(all)

139

i10Index(since 2020)

114

Email

University Profile Page

Google Scholar

Yuexian Zou Skills & Research Interests

Machine Learning

Speech Processing

Image Processing

Top articles of Yuexian Zou

Title

Journal

Author(s)

Publication Date

Exploiting Auxiliary Caption for Video Grounding

Proceedings of the AAAI Conference on Artificial Intelligence

Hongxiang Li

Meng Cao

Xuxin Cheng

Yaowei Li

Zhihong Zhu

...

2024/3/24

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

arXiv preprint arXiv:2403.01169

Chenchen Tao

Chong Wang

Yuexian Zou

Xiaohao Peng

Jiafei Wu

...

2024/3/2

Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling

Proceedings of the AAAI Conference on Artificial Intelligence

Xianwei Zhuang

Xuxin Cheng

Yuexian Zou

2024/3/24

Zeronlg: Aligning and autoencoding domains for zero-shot multimodal and multilingual natural language generation

IEEE Transactions on Pattern Analysis and Machine Intelligence

Bang Yang

Fenglin Liu

Yuexian Zou

Xian Wu

Yaowei Wang

...

2024/2/29

Audiogpt: Understanding and generating speech, music, sound, and talking head

Proceedings of the AAAI Conference on Artificial Intelligence

Rongjie Huang

Mingze Li

Dongchao Yang

Jiatong Shi

Xuankai Chang

...

2024/3/24

Visiongpt: Vision-language understanding agent using generalized multimodal framework

arXiv preprint arXiv:2403.09027

Chris Kelly

Luhui Hu

Bang Yang

Yu Tian

Deshun Yang

...

2024/3/14

Retrieval is Accurate Generation

arXiv preprint arXiv:2402.17532

Bowen Cao

Deng Cai

Leyang Cui

Xuxin Cheng

Wei Bi

...

2024/2/27

Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport

Proceedings of the AAAI Conference on Artificial Intelligence

Xuxin Cheng

Zhihong Zhu

Hongxiang Li

Yaowei Li

Xianwei Zhuang

...

2024/3/24

WorldGPT: a Sora-inspired video AI agent as Rich world models from text and image inputs

arXiv preprint arXiv:2403.07944

Deshun Yang

Luhui Hu

Yu Tian

Zihao Li

Chris Kelly

...

2024/3/10

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

arXiv preprint arXiv:2401.17186

Bang Yang

Yong Dai

Xuxin Cheng

Yaowei Li

Asif Raza

...

2024/1/30

Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment

Proceedings of the AAAI Conference on Artificial Intelligence

Zhihong Zhu

Xuxin Cheng

Yaowei Li

Hongxiang Li

Yuexian Zou

2024/3/24

Dance with Labels: Dual-Heterogeneous Label Graph Interaction for Multi-intent Spoken Language Understanding

Zhihong Zhu

Xuxin Cheng

Hongxiang Li

Yaowei Li

Yuexian Zou

2024/3/4

Mrrl: Modifying the reference via reinforcement learning for non-autoregressive joint multiple intent detection and slot filling

Xuxin Cheng

Zhihong Zhu

Bowen Cao

Qichen Ye

Yuexian Zou

2023/12

Nadiffuse: Noise-aware diffusion-based model for speech enhancement

Wen Wang

Dongchao Yang

Qichen Ye

Bowen Cao

Yuexian Zou

2023/10/31

Multicapclip: Auto-encoding prompts for zero-shot multilingual visual captioning

arXiv preprint arXiv:2308.13218

Bang Yang

Fenglin Liu

Xian Wu

Yaowei Wang

Xu Sun

...

2023/8/25

Background-aware Modeling for Weakly Supervised Sound Event Detection

Proc. INTERSPEECH

Yifei Xin

Dongchao Yang

Yuexian Zou

2023

GhostT5: generate more features with cheap operations to improve textless spoken question answering

Proc. INTERSPEECH

Xuxin Cheng

Zhihong Zhu

Ziyu Yao

Hongxiang Li

Yaowei Li

...

2023

Diffsound: Discrete diffusion model for text-to-sound generation

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Dongchao Yang

Jianwei Yu

Helin Wang

Wen Wang

Chao Weng

...

2023/4/28

Ssvmr: Saliency-based self-training for video-music retrieval

Xuxin Cheng

Zhihong Zhu

Hongxiang Li

Yaowei Li

Yuexian Zou

2023/6/4

Ftm: A frame-level timeline modeling method for temporal graph representation learning

Proceedings of the AAAI Conference on Artificial Intelligence

Bowen Cao

Qichen Ye

Weiyuan Xu

Yuexian Zou

2023/6/26

See List of Professors in Yuexian Zou University(Peking University)

Co-Authors

academic-engine