Ruihua Song
Renmin University of China
H-index: 35
Asia-China
Top articles of Ruihua Song
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition | arXiv preprint arXiv:2401.18045 | Yihan Wu Soumi Maiti Yifan Peng Wangyou Zhang Chenda Li | 2024/1/31 |
Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion | arXiv preprint arXiv:2403.07312 | Wenhui Tan Bei Liu Junbo Zhang Ruihua Song Jianlong Fu | 2024/3/12 |
Characterized chatbot with personality | 2024/3/5 | ||
TeViS: Translating Text Synopses to Video Storyboards | Xu Gu Yuchong Sun Feiyue Ni Shizhe Chen Xihua Wang | 2023/10/26 | |
Difference between Multi-modal vs. Text Pre-trained Models in Embedding Text | Beijing Da Xue Xue Bao | Yuchong Sun Xiwei Cheng Ruihua Song Wanxiang Che Zhiwu Lu | 2023 |
Videodubber: Machine translation with speech-aware length control for video dubbing | Proceedings of the AAAI Conference on Artificial Intelligence | Yihan Wu Junliang Guo Xu Tan Chen Zhang Bohan Li | 2023/6/26 |
Going Beyond Closed Sets: A Multimodal Perspective for Video Emotion Analysis | Hao Pu Yuchong Sun Ruihua Song Xu Chen Hao Jiang | 2023/10/13 | |
Intelligent virtual assistants with llm-based process automation | arXiv preprint arXiv:2312.06677 | Yanchu Guan Dong Wang Zhixuan Chu Shiyu Wang Feiyue Ni | 2023/12/4 |
Expanding the Horizons: Exploring Further Steps in Open-Vocabulary Segmentation | Xihua Wang Lei Ji Kun Yan Yuchong Sun Ruihua Song | 2023/10/13 | |
Pave the way to grasp anything: Transferring foundation models for universal pick-place robots | arXiv preprint arXiv:2306.05716 | Jiange Yang Wenhui Tan Chuhao Jin Bei Liu Jianlong Fu | 2023/6/9 |
Joint Semantic and Strategy Matching for Persuasive Dialogue | Chuhao Jin Yutao Zhu Lingzhen Kong Shijie Li Xiao Zhang | 2023/12 | |
Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions | arXiv preprint arXiv:2310.07301 | Yuchong Sun Che Liu Jinwen Huang Ruihua Song Fuzheng Zhang | 2023/10/11 |
Recagent: A novel simulation paradigm for recommender systems | arXiv preprint arXiv:2306.02552 | Lei Wang Jingsen Zhang Xu Chen Yankai Lin Ruihua Song | 2023/6/5 |
What makes for good visual instructions? synthesizing complex visual reasoning instructions for visual instruction tuning | arXiv preprint arXiv:2311.01487 | Yifan Du Hangyu Guo Kun Zhou Wayne Xin Zhao Jinpeng Wang | 2023/11/2 |
ViCo: Engaging Video Comment Generation with Human Preference Rewards | arXiv preprint arXiv:2308.11171 | Yuchong Sun Bei Liu Xu Chen Ruihua Song Jianlong Fu | 2023/8/22 |
Alphablock: Embodied finetuning for vision-language reasoning in robot manipulation | arXiv preprint arXiv:2305.18898 | Chuhao Jin Wenhui Tan Jiange Yang Bei Liu Ruihua Song | 2023/5/30 |
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World | Hongpeng Lin Ludan Ruan Wenke Xia Peiyu Liu Jingyuan Wen | 2023/10/26 | |
ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios | arXiv preprint arXiv:2305.12200 | Yuyue Wang Huan Xiao Yihan Wu Ruihua Song | 2023/5/20 |
Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration | IEEE Transactions on Multimedia | Yu Lu Feiyue Ni Haofan Wang Xiaofeng Guo Linchao Zhu | 2023/7/26 |
A roadmap for big model | arXiv preprint arXiv:2203.14101 | Sha Yuan Hanyu Zhao Shuai Zhao Jiahong Leng Yangxiao Liang | 2022/3/26 |