ProfessorsProfessors of University of Science and Technology of ChinaYucheng Zhao

Yucheng Zhao

University of Science and Technology of China

H-index: 8

Asia-China

About Yucheng Zhao

Yucheng Zhao, With an exceptional h-index of 8 and a recent h-index of 8 (since 2020), a distinguished researcher at University of Science and Technology of China, specializes in the field of Speech, Self-Supervised Learning, Transformer, Video Generation.

His recent articles reflect a diverse array of research interests and contributions to the field:

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

Stream Query Denoising for Vectorized HD Map Construction

VLM-Eval: A General Evaluation on Video Large Language Models

Attention-Guided Contrastive Masked Image Modeling for Transformer-Based Self-Supervised Learning

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Streaming video model

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

Look before you match: Instance understanding matters in video object segmentation

Yucheng Zhao Information

University	University of Science and Technology of China
Position	___
Citations(all)	358
Citations(since 2020)	358
Cited By	2
hIndex(all)	8
hIndex(since 2020)	8
i10Index(all)	8
i10Index(since 2020)	8
Email	Access Email
University Profile Page	University of Science and Technology of China
Google Scholar	View Google Scholar Profile

Yucheng Zhao Skills & Research Interests

Speech

Self-Supervised Learning

Transformer

Video Generation

Top articles of Yucheng Zhao

Title	Journal	Author(s)	Publication Date
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control	arXiv preprint arXiv:2403.19438	Binyuan Huang Yuqing Wen Yucheng Zhao Yaosi Hu Yingfei Liu ...	2024/3/28
Stream Query Denoising for Vectorized HD Map Construction		Shuo Wang Fan Jia Yingfei Liu Yucheng Zhao Zehui Chen ...	2024/1/17
VLM-Eval: A General Evaluation on Video Large Language Models	arXiv preprint arXiv:2311.11865	Shuailin Li Yuang Zhang Yucheng Zhao Qiuyue Wang Fan Jia ...	2023/11/20
Attention-Guided Contrastive Masked Image Modeling for Transformer-Based Self-Supervised Learning		Yucheng Zhan Yucheng Zhao Chong Luo Yueyi Zhang Xiaoyan Sun	2023/10/8
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss		Zhiyuan Zhao Lijun Wu Chuanxin Tang Dacheng Yin Yucheng Zhao ...	2023/6/4
Streaming video model		Yucheng Zhao Chong Luo Chuanxin Tang Dongdong Chen Noel Codella ...	2023
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving	arXiv preprint arXiv:2311.16813	Yuqing Wen Yucheng Zhao Yingfei Liu Fan Jia Yanhui Wang ...	2023/11/28
Look before you match: Instance understanding matters in video object segmentation		Junke Wang Dongdong Chen Zuxuan Wu Chong Luo Chuanxin Tang ...	2023
Adriver-i: A general world model for autonomous driving	arXiv preprint arXiv:2311.13549	Fan Jia Weixin Mao Yingfei Liu Yucheng Zhao Yuqing Wen ...	2023/11/22
Omnivl: One foundation model for image-language and video-language tasks	Advances in neural information processing systems	Junke Wang Dongdong Chen Zuxuan Wu Chong Luo Luowei Zhou ...	2022/12/6
Peripheral vision transformer	Advances in Neural Information Processing Systems	Juhong Min Yucheng Zhao Chong Luo Minsu Cho	2022/12/6
T2D: Spatiotemporal Feature Learning Based on Triple 2D Decomposition		Yucheng Zhao Chong Luo Chuanxin Tang Dongdong Chen Noel C Codella ...	2022/9/29
RetrieverTTS: Modeling decomposed factors for text-based speech insertion		Dacheng Yin Chuanxin Tang Yanqing Liu Xiaoqiang Wang Zhiyuan Zhao ...	2022
When shift operation meets vision transformer: An extremely simple alternative to attention mechanism	Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI)	Guangting Wang Yucheng Zhao Chuanxin Tang Chong Luo Wenjun Zeng	2022/1/26
Sparse MLP for image recognition: Is self-attention really necessary?	Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI)	Chuanxin Tang Yucheng Zhao Guangting Wang Chong Luo Wenxuan Xie ...	2021/9/12
Zero-shot text-to-speech for text-based insertion in audio narration	arXiv preprint arXiv:2109.05426	Chuanxin Tang Chong Luo Zhiyuan Zhao Dacheng Yin Yucheng Zhao ...	2021/9/12
A battle of network structures: An empirical study of cnn, transformer, and mlp	arXiv preprint arXiv:2108.13002	Yucheng Zhao Guangting Wang Chuanxin Tang Chong Luo Wenjun Zeng ...	2021/8/30
General-purpose speech representation learning through a self-supervised multi-granularity framework	arXiv preprint arXiv:2102.01930	Yucheng Zhao Dacheng Yin Chong Luo Zhiyuan Zhao Chuanxin Tang ...	2021/2/3
Multi-scale group transformer for long sequence modeling in speech separation		Yucheng Zhao Chong Luo Zheng-Jun Zha Wenjun Zeng	2021/1/7
Self-supervised visual representations learning by contrastive mask prediction		Yucheng Zhao Guangting Wang Chong Luo Wenjun Zeng Zheng-Jun Zha	2021/8/18