Xiaodan Liang
Sun Yat-Sen University
H-index: 75
Asia-China
Top articles of Xiaodan Liang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation | arXiv preprint arXiv:2405.00448 | Xujie Zhang Ente Lin Xiu Li Yuxuan Luo Michael Kampffmeyer | 2024/5/1 |
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments | Mengxue Qu Yu Wu Wu Liu Xiaodan Liang Jingkuan Song | 2023/8 | |
Towards detailed text-to-motion synthesis via basic-to-advanced hierarchical diffusion model | Proceedings of the AAAI conference on artificial intelligence | Zhenyu Xie Yang Wu Xuehao Gao Zhongqian Sun Wei Yang | 2023/12/18 |
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving | arXiv preprint arXiv:2404.16771 | Jiehui Huang Xiao Dong Wenhui Song Hanhui Li Jun Zhou | 2024/4/25 |
GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data | arXiv preprint arXiv:2402.06198 | Haoyuan Li Yanpeng Zhou Yihan Zeng Hang Xu Xiaodan Liang | 2024/2/9 |
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation | arXiv preprint arXiv:2403.08426 | Zicheng Zhang Tong Zhang Yi Zhu Jianzhuang Liu Xiaodan Liang | 2024/3/13 |
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | arXiv preprint arXiv:2404.09216 | Lewei Yao Renjie Pi Jianhua Han Xiaodan Liang Hang Xu | 2024/2/27 |
MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation | arXiv preprint arXiv:2401.07314 | Jiaqi Chen Bingqian Lin Ran Xu Zhenhua Chai Xiaodan Liang | 2024/1/14 |
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning | arXiv preprint arXiv:2403.07376 | Bingqian Lin Yunshuang Nie Ziming Wei Jiaqi Chen Shikui Ma | 2024/3/12 |
MLP Can Be A Good Transformer Learner | arXiv preprint arXiv:2404.05657 | Sihao Lin Pumeng Lyu Dongrui Liu Tao Tang Xiaodan Liang | 2024/4/8 |
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis | arXiv preprint arXiv:2402.17483 | Tao Tang Guangrun Wang Yixing Lao Peng Chen Jie Liu | 2024/2/27 |
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping | Proceedings of the AAAI Conference on Artificial Intelligence | Luoyang Lin Zutao Jiang Xiaodan Liang Liqian Ma Michael C Kampffmeyer | 2024/3/24 |
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | arXiv preprint arXiv:2401.00979 | Xuan Huang Hanhui Li Zejun Yang Zhisheng Wang Xiaodan Liang | 2024/1/2 |
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data | arXiv preprint arXiv:2402.08957 | Yinya Huang Xiaohan Lin Zhengying Liu Qingxing Cao Huajian Xin | 2024/2/14 |
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation | Proceedings of the AAAI Conference on Artificial Intelligence | Hanhui Li Xiaojian Lin Xuan Huang Zejun Yang Zhisheng Wang | 2024/3/24 |
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models | arXiv preprint arXiv:2401.00988 | Xinpeng Ding Jinahua Han Hang Xu Xiaodan Liang Wei Zhang | 2024/1/2 |
Discourse-aware graph networks for textual logical reasoning | IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) | Yinya Huang Lemao Liu Kun Xu Meng Fang Liang Lin | 2023/5/26 |
Towards causality-aware inferring: a sequential discriminative approach for medical diagnosis | IEEE Transactions on Pattern Analysis and Machine Intelligence | Junfan Lin Keze Wang Ziliang Chen Xiaodan Liang Liang Lin | 2023/7/5 |
Dq-lore: Dual queries with low rank approximation re-ranking for in-context learning | arXiv preprint arXiv:2310.02954 | Jing Xiong Zixuan Li Chuanyang Zheng Zhijiang Guo Yichun Yin | 2023/10/4 |
3d-togo: Towards text-guided cross-category 3d object generation | Proceedings of the AAAI Conference on Artificial Intelligence | Zutao Jiang Guansong Lu Xiaodan Liang Jihua Zhu Wei Zhang | 2023/6/26 |