Shuang Qiu

Shuang Qiu

University of Michigan

H-index: 13

North America-United States

About Shuang Qiu

Shuang Qiu, With an exceptional h-index of 13 and a recent h-index of 13 (since 2020), a distinguished researcher at University of Michigan, specializes in the field of machine learning, optimization, reinforcement learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Gradient-Variation Bound for Online Convex Optimization with Constraints

On the Value of Myopic Behavior in Policy Reuse

Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics

Castle in the sky: Dynamic sky replacement and harmonization in videos

Shuang Qiu Information

University

Position

___

Citations(all)

555

Citations(since 2020)

543

Cited By

112

hIndex(all)

13

hIndex(since 2020)

13

i10Index(all)

17

i10Index(since 2020)

16

Email

University Profile Page

Google Scholar

Shuang Qiu Skills & Research Interests

machine learning

optimization

reinforcement learning

Top articles of Shuang Qiu

Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems

arXiv preprint arXiv:2206.05830

2022/6/12

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

arXiv preprint arXiv:2402.18571

2024/2/28

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

arXiv preprint arXiv:2402.10207

2024/2/15

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Advances in Neural Information Processing Systems

2024/2/13

Gradient-Variation Bound for Online Convex Optimization with Constraints

Proceedings of the AAAI Conference on Artificial Intelligence

2023/6/26

On the Value of Myopic Behavior in Policy Reuse

arXiv preprint arXiv:2305.17623

2023/5/28

Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics

2022/9/29

Castle in the sky: Dynamic sky replacement and harmonization in videos

IEEE Transactions on Image Processing

2022/7/26

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

2022

In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle

2022/6/10

Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

arXiv preprint arXiv:2202.12797

2022/2/25

Provably efficient fictitious play policy optimization for zero-sum Markov games with structured transitions

2021/7/1

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

2021/7/1

On Finite-Time Convergence of Actor-Critic Algorithm

IEEE Journal on Selected Areas in Information Theory

2021/5/19

Pine: Universal deep embedding for graph nodes via partial permutation invariant set functions

IEEE Transactions on Pattern Analysis and Machine Intelligence

2021/2/23

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

arXiv preprint arXiv:2008.10103

2020/8/23

Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

2020

Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis

2020/11/21

See List of Professors in Shuang Qiu University(University of Michigan)