Zhengyuan Yang

Zhengyuan Yang

University of Rochester

H-index: 25

North America-United States

About Zhengyuan Yang

Zhengyuan Yang, With an exceptional h-index of 25 and a recent h-index of 25 (since 2020), a distinguished researcher at University of Rochester, specializes in the field of Computer Vision, Multimedia, Vision + Language, Multimodal.

His recent articles reflect a diverse array of research interests and contributions to the field:

Design2Code: How Far Are We From Automating Front-End Engineering?

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Bring Metric Functions into Diffusion Models

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

Disco: Disentangled control for referring human dance generation in real world

Zhengyuan Yang Information

University

Position

___

Citations(all)

3117

Citations(since 2020)

3101

Cited By

283

hIndex(all)

25

hIndex(since 2020)

25

i10Index(all)

31

i10Index(since 2020)

31

Email

University Profile Page

University of Rochester

Google Scholar

View Google Scholar Profile

Zhengyuan Yang Skills & Research Interests

Computer Vision

Multimedia

Vision + Language

Multimodal

Top articles of Zhengyuan Yang

Title

Journal

Author(s)

Publication Date

Design2Code: How Far Are We From Automating Front-End Engineering?

arXiv preprint arXiv:2403.03163

Chenglei Si

Yanzhe Zhang

Zhengyuan Yang

Ruibo Liu

Diyi Yang

2024/3/5

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

arXiv preprint arXiv:2401.17093

Zecheng Tang

Chenfei Wu

Zekai Zhang

Mingheng Ni

Shengming Yin

...

2024/1/30

Bring Metric Functions into Diffusion Models

arXiv preprint arXiv:2401.02414

Jie An

Zhengyuan Yang

Jianfeng Wang

Linjie Li

Zicheng Liu

...

2024/1/4

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

arXiv preprint arXiv:2404.16375

An Yan

Zhengyuan Yang

Junda Wu

Wanrong Zhu

Jianwei Yang

...

2024/4/25

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

arXiv preprint arXiv:2401.00849

Alex Jinpeng Wang

Linjie Li

Kevin Qinghong Lin

Jianfeng Wang

Kevin Lin

...

2024/1/1

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

arXiv preprint arXiv:2403.12339

Jielin Qiu

William Han

Winfred Wang

Zhengyuan Yang

Linjie Li

...

2024/3/19

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

arXiv preprint arXiv:2302.10781

Xiaodong Wang

Chenfei Wu

Shengming Yin

Minheng Ni

Jianfeng Wang

...

2023/2/21

Disco: Disentangled control for referring human dance generation in real world

arXiv e-prints

Tan Wang

Linjie Li

Kevin Lin

Chung-Ching Lin

Zhengyuan Yang

...

2023/6

Weakly supervised semantic parsing

2023/9/14

Interfacing Foundation Models' Embeddings

arXiv preprint arXiv:2312.07532

Xueyan Zou

Linjie Li

Jianfeng Wang

Jianwei Yang

Mingyu Ding

...

2023/12/12

DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design

arXiv preprint arXiv:2310.15144

Kevin Lin

Zhengyuan Yang

Linjie Li

Jianfeng Wang

Lijuan Wang

2023/10/23

Equivariant similarity for vision-language foundation models

Tan Wang

Kevin Lin

Linjie Li

Chung-Ching Lin

Zhengyuan Yang

...

2023

Idea2img: Iterative self-refinement with gpt-4v (ision) for automatic image design and generation

arXiv preprint arXiv:2310.08541

Zhengyuan Yang

Jianfeng Wang

Linjie Li

Kevin Lin

Chung-Ching Lin

...

2023/10/12

Mm-vet: Evaluating large multimodal models for integrated capabilities

arXiv preprint arXiv:2308.02490

Weihao Yu

Zhengyuan Yang

Linjie Li

Jianfeng Wang

Kevin Lin

...

2023/8/4

Diagnostic benchmark and iterative inpainting for layout-guided image generation

arXiv preprint arXiv:2304.06671

Jaemin Cho

Linjie Li

Zhengyuan Yang

Zhe Gan

Lijuan Wang

...

2023/4/13

Mm-narrator: Narrating long-form videos with multimodal in-context learning

arXiv preprint arXiv:2311.17435

Chaoyi Zhang

Kevin Lin

Zhengyuan Yang

Jianfeng Wang

Linjie Li

...

2023/11/29

ReCo: Region-Controlled Text-to-Image Generation

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Zhengyuan Yang

Jianfeng Wang

Zhe Gan

Linjie Li

Kevin Lin

...

2023

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

arXiv preprint arXiv:2303.12346

Shengming Yin

Chenfei Wu

Huan Yang

Jianfeng Wang

Xiaodong Wang

...

2023/3/22

Openleaf: Open-domain interleaved image-text generation and evaluation

arXiv preprint arXiv:2310.07749

Jie An

Zhengyuan Yang

Linjie Li

Jianfeng Wang

Kevin Lin

...

2023/10/11

Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models

arXiv preprint arXiv:2307.14648

Xin Yuan

Linjie Li

Jianfeng Wang

Zhengyuan Yang

Kevin Lin

...

2023/7/27

See List of Professors in Zhengyuan Yang University(University of Rochester)

Co-Authors

H-index: 121
Jiebo Luo

Jiebo Luo

University of Rochester

H-index: 33
Jinsong Su

Jinsong Su

Xiamen University

academic-engine