Zhengyuan Yang
University of Rochester
H-index: 25
North America-United States
Top articles of Zhengyuan Yang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Design2Code: How Far Are We From Automating Front-End Engineering? | arXiv preprint arXiv:2403.03163 | Chenglei Si Yanzhe Zhang Zhengyuan Yang Ruibo Liu Diyi Yang | 2024/3/5 |
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis | arXiv preprint arXiv:2401.17093 | Zecheng Tang Chenfei Wu Zekai Zhang Mingheng Ni Shengming Yin | 2024/1/30 |
Bring Metric Functions into Diffusion Models | arXiv preprint arXiv:2401.02414 | Jie An Zhengyuan Yang Jianfeng Wang Linjie Li Zicheng Liu | 2024/1/4 |
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs | arXiv preprint arXiv:2404.16375 | An Yan Zhengyuan Yang Junda Wu Wanrong Zhu Jianwei Yang | 2024/4/25 |
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training | arXiv preprint arXiv:2401.00849 | Alex Jinpeng Wang Linjie Li Kevin Qinghong Lin Jianfeng Wang Kevin Lin | 2024/1/1 |
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition | arXiv preprint arXiv:2403.12339 | Jielin Qiu William Han Winfred Wang Zhengyuan Yang Linjie Li | 2024/3/19 |
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images | arXiv preprint arXiv:2302.10781 | Xiaodong Wang Chenfei Wu Shengming Yin Minheng Ni Jianfeng Wang | 2023/2/21 |
Disco: Disentangled control for referring human dance generation in real world | arXiv e-prints | Tan Wang Linjie Li Kevin Lin Chung-Ching Lin Zhengyuan Yang | 2023/6 |
Weakly supervised semantic parsing | 2023/9/14 | ||
Interfacing Foundation Models' Embeddings | arXiv preprint arXiv:2312.07532 | Xueyan Zou Linjie Li Jianfeng Wang Jianwei Yang Mingyu Ding | 2023/12/12 |
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design | arXiv preprint arXiv:2310.15144 | Kevin Lin Zhengyuan Yang Linjie Li Jianfeng Wang Lijuan Wang | 2023/10/23 |
Equivariant similarity for vision-language foundation models | Tan Wang Kevin Lin Linjie Li Chung-Ching Lin Zhengyuan Yang | 2023 | |
Idea2img: Iterative self-refinement with gpt-4v (ision) for automatic image design and generation | arXiv preprint arXiv:2310.08541 | Zhengyuan Yang Jianfeng Wang Linjie Li Kevin Lin Chung-Ching Lin | 2023/10/12 |
Mm-vet: Evaluating large multimodal models for integrated capabilities | arXiv preprint arXiv:2308.02490 | Weihao Yu Zhengyuan Yang Linjie Li Jianfeng Wang Kevin Lin | 2023/8/4 |
Diagnostic benchmark and iterative inpainting for layout-guided image generation | arXiv preprint arXiv:2304.06671 | Jaemin Cho Linjie Li Zhengyuan Yang Zhe Gan Lijuan Wang | 2023/4/13 |
Mm-narrator: Narrating long-form videos with multimodal in-context learning | arXiv preprint arXiv:2311.17435 | Chaoyi Zhang Kevin Lin Zhengyuan Yang Jianfeng Wang Linjie Li | 2023/11/29 |
ReCo: Region-Controlled Text-to-Image Generation | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) | Zhengyuan Yang Jianfeng Wang Zhe Gan Linjie Li Kevin Lin | 2023 |
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation | arXiv preprint arXiv:2303.12346 | Shengming Yin Chenfei Wu Huan Yang Jianfeng Wang Xiaodong Wang | 2023/3/22 |
Openleaf: Open-domain interleaved image-text generation and evaluation | arXiv preprint arXiv:2310.07749 | Jie An Zhengyuan Yang Linjie Li Jianfeng Wang Kevin Lin | 2023/10/11 |
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models | arXiv preprint arXiv:2307.14648 | Xin Yuan Linjie Li Jianfeng Wang Zhengyuan Yang Kevin Lin | 2023/7/27 |