Peihao Chen
South China University of Technology
H-index: 10
Asia-China
Top articles of Peihao Chen
3D-VLA: A 3D Vision-Language-Action Generative World Model
arXiv preprint arXiv:2403.09631
2024/3/14
Peihao Chen
H-Index: 7
Jincheng Yang
H-Index: 4
Xin Yan
H-Index: 7
Yilun Du
H-Index: 7
Chuang Gan
H-Index: 37
Vesper: A compact and effective pretrained model for speech emotion recognition
IEEE Transactions on Affective Computing
2024/2/26
Peihao Chen
H-Index: 7
Xiangmin Xu
H-Index: 22
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
Advances in Neural Information Processing Systems
2024/2/13
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
arXiv preprint arXiv:2401.08577
2024/1/16
3d-llm: Injecting the 3d world into large language models
Advances in Neural Information Processing Systems
2023/12/15
A Simple Knowledge Distillation Framework for Open-world Object Detection
arXiv preprint arXiv:2312.08653
2023/12/14
Ying Wei
H-Index: 16
Peihao Chen
H-Index: 7
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning
arXiv preprint arXiv:2312.05783
2023/12/10
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
arXiv preprint arXiv:2311.03354
2023/11/6
Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models
arXiv preprint arXiv:2308.07997
2023/8/15
Detecting the open-world objects with the help of the Brain
arXiv preprint arXiv:2303.11623
2023/3/21
Ying Wei
H-Index: 16
Peihao Chen
H-Index: 7
Learning vision-and-language navigation from youtube videos
2023
Masked motion encoding for self-supervised video representation learning
2023
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
2022/10/14
Learning Active Camera for Multi-Object Navigation
2022/10/14
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning
Proceedings of the AAAI Conference on Artificial Intelligence
2021/5/18
Peihao Chen
H-Index: 7
Deng Huang
H-Index: 4
Runhao Zeng
H-Index: 5
Mingkui Tan
H-Index: 31
Chuang Gan
H-Index: 37
Generating visually aligned sound from videos
IEEE Transactions on Image Processing
2020/7/28
Peihao Chen
H-Index: 7
Yang Zhang
H-Index: 3
Mingkui Tan
H-Index: 31
Deng Huang
H-Index: 4
Chuang Gan
H-Index: 37
Location-aware graph convolutional networks for video question answering
Proceedings of the AAAI Conference on Artificial Intelligence
2020/4/3
Deng Huang
H-Index: 4
Peihao Chen
H-Index: 7
Runhao Zeng
H-Index: 5
Mingkui Tan
H-Index: 31
Chuang Gan
H-Index: 37
Dense regression network for video grounding
2020
Foley music: Learning to generate music from videos
2020