Yucheng Zhao
University of Science and Technology of China
H-index: 8
Asia-China
Top articles of Yucheng Zhao
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control | arXiv preprint arXiv:2403.19438 | Binyuan Huang Yuqing Wen Yucheng Zhao Yaosi Hu Yingfei Liu | 2024/3/28 |
Stream Query Denoising for Vectorized HD Map Construction | Shuo Wang Fan Jia Yingfei Liu Yucheng Zhao Zehui Chen | 2024/1/17 | |
VLM-Eval: A General Evaluation on Video Large Language Models | arXiv preprint arXiv:2311.11865 | Shuailin Li Yuang Zhang Yucheng Zhao Qiuyue Wang Fan Jia | 2023/11/20 |
Attention-Guided Contrastive Masked Image Modeling for Transformer-Based Self-Supervised Learning | Yucheng Zhan Yucheng Zhao Chong Luo Yueyi Zhang Xiaoyan Sun | 2023/10/8 | |
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss | Zhiyuan Zhao Lijun Wu Chuanxin Tang Dacheng Yin Yucheng Zhao | 2023/6/4 | |
Streaming video model | Yucheng Zhao Chong Luo Chuanxin Tang Dongdong Chen Noel Codella | 2023 | |
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving | arXiv preprint arXiv:2311.16813 | Yuqing Wen Yucheng Zhao Yingfei Liu Fan Jia Yanhui Wang | 2023/11/28 |
Look before you match: Instance understanding matters in video object segmentation | Junke Wang Dongdong Chen Zuxuan Wu Chong Luo Chuanxin Tang | 2023 | |
Adriver-i: A general world model for autonomous driving | arXiv preprint arXiv:2311.13549 | Fan Jia Weixin Mao Yingfei Liu Yucheng Zhao Yuqing Wen | 2023/11/22 |
Omnivl: One foundation model for image-language and video-language tasks | Advances in neural information processing systems | Junke Wang Dongdong Chen Zuxuan Wu Chong Luo Luowei Zhou | 2022/12/6 |
Peripheral vision transformer | Advances in Neural Information Processing Systems | Juhong Min Yucheng Zhao Chong Luo Minsu Cho | 2022/12/6 |
T2D: Spatiotemporal Feature Learning Based on Triple 2D Decomposition | Yucheng Zhao Chong Luo Chuanxin Tang Dongdong Chen Noel C Codella | 2022/9/29 | |
RetrieverTTS: Modeling decomposed factors for text-based speech insertion | Dacheng Yin Chuanxin Tang Yanqing Liu Xiaoqiang Wang Zhiyuan Zhao | 2022 | |
When shift operation meets vision transformer: An extremely simple alternative to attention mechanism | Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI) | Guangting Wang Yucheng Zhao Chuanxin Tang Chong Luo Wenjun Zeng | 2022/1/26 |
Sparse MLP for image recognition: Is self-attention really necessary? | Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI) | Chuanxin Tang Yucheng Zhao Guangting Wang Chong Luo Wenxuan Xie | 2021/9/12 |
Zero-shot text-to-speech for text-based insertion in audio narration | arXiv preprint arXiv:2109.05426 | Chuanxin Tang Chong Luo Zhiyuan Zhao Dacheng Yin Yucheng Zhao | 2021/9/12 |
A battle of network structures: An empirical study of cnn, transformer, and mlp | arXiv preprint arXiv:2108.13002 | Yucheng Zhao Guangting Wang Chuanxin Tang Chong Luo Wenjun Zeng | 2021/8/30 |
General-purpose speech representation learning through a self-supervised multi-granularity framework | arXiv preprint arXiv:2102.01930 | Yucheng Zhao Dacheng Yin Chong Luo Zhiyuan Zhao Chuanxin Tang | 2021/2/3 |
Multi-scale group transformer for long sequence modeling in speech separation | Yucheng Zhao Chong Luo Zheng-Jun Zha Wenjun Zeng | 2021/1/7 | |
Self-supervised visual representations learning by contrastive mask prediction | Yucheng Zhao Guangting Wang Chong Luo Wenjun Zeng Zheng-Jun Zha | 2021/8/18 |