Yu Yang

Yu Yang

University of California, Los Angeles

H-index: 9

North America-United States

About Yu Yang

Yu Yang, With an exceptional h-index of 9 and a recent h-index of 9 (since 2020), a distinguished researcher at University of California, Los Angeles, specializes in the field of Data Selection, Efficient ML, Trustworthy ML.

His recent articles reflect a diverse array of research interests and contributions to the field:

Identifying spurious biases early in training through the lens of simplicity bias

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

Robust Learning with Progressive Data Expansion Against Spurious Correlation

Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

Eliminating spurious correlations from pre-trained models via data mixing

SIEVE: Multimodal Dataset Pruning Using Image Captioning Models

NeSSA: Near-Storage Data Selection for Accelerated Machine Learning Training

Towards Sustainable Learning: Coresets for Data-efficient Deep Learning

Yu Yang Information

University

Position

___

Citations(all)

530

Citations(since 2020)

510

Cited By

157

hIndex(all)

9

hIndex(since 2020)

9

i10Index(all)

9

i10Index(since 2020)

9

Email

University Profile Page

Google Scholar

Yu Yang Skills & Research Interests

Data Selection

Efficient ML

Trustworthy ML

Top articles of Yu Yang

Title

Journal

Author(s)

Publication Date

Identifying spurious biases early in training through the lens of simplicity bias

Yu Yang

Eric Gan

Gintare Karolina Dziugaite

Baharan Mirzasoleiman

2024/4/18

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

arXiv preprint arXiv:2403.07384

Yu Yang

Siddhartha Mishra

Jeffrey N Chiang

Baharan Mirzasoleiman

2024/3/12

Robust Learning with Progressive Data Expansion Against Spurious Correlation

arXiv preprint arXiv:2306.04949

Yihe Deng*

Yu Yang*

Baharan Mirzasoleiman

Quanquan Gu

2023/6/8

Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

arXiv preprint arXiv:2310.06982

Xuxi Chen

Yu Yang

Zhangyang Wang

Baharan Mirzasoleiman

2023/10/10

Eliminating spurious correlations from pre-trained models via data mixing

arXiv preprint arXiv:2305.14521

Yihao Xue

Ali Payani

Yu Yang

Baharan Mirzasoleiman

2023/5/23

SIEVE: Multimodal Dataset Pruning Using Image Captioning Models

Anas Mahmoud

Mostafa Elhoushi

Amro Abbas

Yu Yang

Newsha Ardalani

...

2024/3/10

NeSSA: Near-Storage Data Selection for Accelerated Machine Learning Training

Neha Prakriya

Yu Yang

Baharan Mirzasoleiman

Cho-Jui Hsieh

Jason Cong

2023

Towards Sustainable Learning: Coresets for Data-efficient Deep Learning

Yu Yang

Hao Kang

Baharan Mirzasoleiman

2023/7

Cleanclip: Mitigating data poisoning attacks in multimodal contrastive learning

Hritik Bansal

Nishad Singhi

Yu Yang

Fan Yin

Aditya Grover

...

2023

Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning

Proceedings of the 40th International Conference on Machine Learning

Yu Yang

Besmira Nushi

Hamid Palangi

Baharan Mirzasoleiman

2023/7

Towards mitigating spurious correlations in the wild: A benchmark & a more realistic dataset

arXiv preprint arXiv:2306.11957

Siddharth Joshi

Yu Yang

Yihao Xue

Wenhan Yang

Baharan Mirzasoleiman

2023/6/21

Decoding data quality via synthetic corruptions: Embedding-guided pruning of code data

arXiv preprint arXiv:2312.02418

Yu Yang

Aaditya K Singh

Mostafa Elhoushi

Anas Mahmoud

Kushal Tirumala

...

2023/12/5

Enhancing fairness in face detection in computer vision systems by demographic bias mitigation

Yu Yang

Aayush Gupta

Jianwei Feng

Prateek Singhal

Vivek Yadav

...

2022/7/26

Not All Poisons are Created Equal: Robust Training Against Data Poisoning

Yu Yang

Tian Yu Liu

Baharan Mirzasoleiman

2022/6/28

Explaining deep convolutional neural networks via latent visual-semantic filter attention

Yu Yang

Seungbae Kim

Jungseock Joo

2022

Friendly noise against adversarial noise: a powerful defense against data poisoning attack

Advances in Neural Information Processing Systems

Tian Yu Liu

Yu Yang

Baharan Mirzasoleiman

2022/12/6

See List of Professors in Yu Yang University(University of California, Los Angeles)

Co-Authors

academic-engine