Shuhuai Ren

About Shuhuai Ren

Shuhuai Ren, With an exceptional h-index of 9 and a recent h-index of 9 (since 2020), a distinguished researcher at Peking University, specializes in the field of Deep Learning, Natural Language Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

TempCompass: Do Video LLMs Really Understand Videos?

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Shuhuai Ren Information

University

Position

___

Citations(all)

769

Citations(since 2020)

768

Cited By

104

hIndex(all)

9

hIndex(since 2020)

9

i10Index(all)

8

i10Index(since 2020)

8

Email

University Profile Page

Google Scholar

Shuhuai Ren Skills & Research Interests

Deep Learning

Natural Language Processing

Top articles of Shuhuai Ren

Title

Journal

Author(s)

Publication Date

TempCompass: Do Video LLMs Really Understand Videos?

arXiv preprint arXiv:2403.00476

Yuanxin Liu

Shicheng Li

Yi Liu

Yuxiang Wang

Shuhuai Ren

...

2024/3/1

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

arXiv preprint arXiv:2402.15527

Liang Chen

Yichi Zhang

Shuhuai Ren

Haozhe Zhao

Zefan Cai

...

2024/2/21

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

arXiv preprint arXiv:2404.10763

Yuchi Wang

Shuhuai Ren

Rundong Gao

Linli Yao

Qingyan Guo

...

2024/4/16

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

arXiv preprint arXiv:2403.19221

Sishuo Chen

Lei Li

Shuhuai Ren

Rundong Gao

Yuanxin Liu

...

2024/3/28

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

arXiv preprint arXiv:2310.19060

Shuhuai Ren

Sishuo Chen

Shicheng Li

Xu Sun

Lu Hou

2023/10/29

MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

arXiv preprint arXiv:2306.04387

Lei Li

Yuwei Yin

Shicheng Li

Liang Chen

Peiyi Wang

...

2023/6/7

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

Advances in Neural Information Processing Systems

Shuhuai Ren

Aston Zhang

Yi Zhu

Shuai Zhang

Shuai Zheng

...

2024/2/13

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

arXiv preprint arXiv:2312.02051

Shuhuai Ren

Linli Yao

Shicheng Li

Xu Sun

Lu Hou

2023/12/4

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

arXiv preprint arXiv:2311.17404

Shicheng Li

Lei Li

Shuhuai Ren

Yuanxin Liu

Yi Liu

...

2023/11/29

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

Advances in Neural Information Processing Systems

Yuanxin Liu

Lei Li

Shuhuai Ren

Rundong Gao

Shicheng Li

...

2024/2/13

Delving into the Openness of CLIP

arXiv preprint arXiv:2206.01986

Shuhuai Ren

Lei Li

Xuancheng Ren

Guangxiang Zhao

Xu Sun

2022/6/4

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade

arXiv preprint arXiv:2012.14682

Lei Li

Yankai Lin

Deli Chen

Shuhuai Ren

Peng Li

...

2020/12/29

Dynamic Knowledge Distillation for Pre-trained Language Models

arXiv preprint arXiv:2109.11295

Lei Li

Yankai Lin

Shuhuai Ren

Peng Li

Jie Zhou

...

2021/9/23

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification

arXiv preprint arXiv:2109.00523

Shuhuai Ren

Jinchao Zhang

Lei Li

Xu Sun

Jie Zhou

2021/9/1

Learning Relation Alignment for Calibrated Cross-modal Retrieval

arXiv preprint arXiv:2105.13868

Shuhuai Ren

Junyang Lin

Guangxiang Zhao

Rui Men

An Yang

...

2021/5/28

Cuge: A chinese language understanding and generation evaluation benchmark

arXiv preprint arXiv:2112.13610

Yuan Yao

Qingxiu Dong

Jian Guan

Boxi Cao

Zhengyan Zhang

...

2021/12/27

DCA: Diversified Co-Attention towards Informative Live Video Commenting

Zhihan Zhang

Zhiyi Yin

Shuhuai Ren

Xinhang Li

Shicheng Li

2020/10/1

See List of Professors in Shuhuai Ren University(Peking University)

Co-Authors

academic-engine