Shuang Qiu at University of Michigan

University	University of Michigan
Position	___
Citations(all)	555
Citations(since 2020)	543
Cited By	112
hIndex(all)	13
hIndex(since 2020)	13
i10Index(all)	17
i10Index(since 2020)	16
Email	Access Email
University Profile Page	University of Michigan
Google Scholar	View Google Scholar Profile

Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems

arXiv preprint arXiv:2206.05830

2022/6/12

Shuang Qiu

H-Index: 3

Jiawei Jiang

H-Index: 1

Guoliang Li

H-Index: 15

Ji Liu

H-Index: 4

Wentao Wu

H-Index: 13

Ce Zhang

H-Index: 3

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

arXiv preprint arXiv:2402.18571

2024/2/28

Haoxiang Wang

H-Index: 2

Yong Lin

H-Index: 12

Wei Xiong

H-Index: 8

Rui Yang

H-Index: 4

Shuang Qiu

H-Index: 3

Han Zhao

H-Index: 15

Tong Zhang

H-Index: 20

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

arXiv preprint arXiv:2402.10207

2024/2/15

Rui Yang

H-Index: 4

Feng Luo

H-Index: 24

Shuang Qiu

H-Index: 3

Dong Yu

H-Index: 22

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Advances in Neural Information Processing Systems

2024/2/13

Shuang Qiu

H-Index: 3

Zhaoran Wang

H-Index: 25

Zhuoran Yang

H-Index: 1

Tong Zhang

H-Index: 20

Gradient-Variation Bound for Online Convex Optimization with Constraints

Proceedings of the AAAI Conference on Artificial Intelligence

2023/6/26

Shuang Qiu

H-Index: 3

Mladen Kolar

H-Index: 20

On the Value of Myopic Behavior in Policy Reuse

arXiv preprint arXiv:2305.17623

2023/5/28

Kang Xu

H-Index: 6

Chenjia Bai

H-Index: 2

Shuang Qiu

H-Index: 3

Bin Zhao

H-Index: 18

Zhen Wang

H-Index: 42

Wei Li

H-Index: 8

Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics

2022/9/29

Lingxiao Wang

H-Index: 1

Shuang Qiu

H-Index: 3

Zuyue Fu

H-Index: 3

Zhuoran Yang

H-Index: 1

Csaba Szepesvari

H-Index: 51

Zhaoran Wang

H-Index: 25

Castle in the sky: Dynamic sky replacement and harmonization in videos

IEEE Transactions on Image Processing

2022/7/26

Zhengxia Zou

H-Index: 15

Rui Zhao

H-Index: 3

Shuang Qiu

H-Index: 3

Zhenwei Shi

H-Index: 31

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

2022

Shuang Qiu

H-Index: 3

Lingxiao Wang

H-Index: 1

Chenjia Bai

H-Index: 2

Zhuoran Yang

H-Index: 1

Zhaoran Wang

H-Index: 25

In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle

2022/6/10

Shuang Qiu

H-Index: 3

Jiawei Jiang

H-Index: 1

Guoliang Li

H-Index: 15

Ji Liu

H-Index: 4

Wentao Wu

H-Index: 13

Ce Zhang

H-Index: 3

Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

arXiv preprint arXiv:2202.12797

2022/2/25

Zhaoran Wang

H-Index: 25

Zhuoran Yang

H-Index: 1

Provably efficient fictitious play policy optimization for zero-sum Markov games with structured transitions

2021/7/1

Shuang Qiu

H-Index: 3

Zhaoran Wang

H-Index: 25

Zhuoran Yang

H-Index: 1

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

2021/7/1

Shuang Qiu

H-Index: 3

Zhaoran Wang

H-Index: 25

Zhuoran Yang

H-Index: 1

On Finite-Time Convergence of Actor-Critic Algorithm

IEEE Journal on Selected Areas in Information Theory

2021/5/19

Shuang Qiu

H-Index: 3

Zhuoran Yang

H-Index: 1

Zhaoran Wang

H-Index: 25

Pine: Universal deep embedding for graph nodes via partial permutation invariant set functions

IEEE Transactions on Pattern Analysis and Machine Intelligence

2021/2/23

Shupeng Gui

H-Index: 4

Xiangliang Zhang

H-Index: 30

Shuang Qiu

H-Index: 3

Zhengdao Wang

H-Index: 19

Ji Liu

H-Index: 4

Stylized neural painting

2021

Zhengxia Zou

H-Index: 15

Shuang Qiu

H-Index: 3

Yi Yuan

H-Index: 2

Zhenwei Shi

H-Index: 31

Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning

arXiv preprint arXiv:2008.10103

2020/8/23

Shuang Qiu

H-Index: 3

Zhuoran Yang

H-Index: 1

Zhaoran Wang

H-Index: 25

Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

2020

Shuang Qiu

H-Index: 3

Zhuoran Yang

H-Index: 1

Zhaoran Wang

H-Index: 25

Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis

2020/11/21

Shuang Qiu

H-Index: 3

Zhuoran Yang

H-Index: 1

Shuang Qiu

University of Michigan

About Shuang Qiu

Shuang Qiu Information

Shuang Qiu Skills & Research Interests

Top articles of Shuang Qiu

Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems

Shuang Qiu

Jiawei Jiang

Guoliang Li

Ji Liu

Wentao Wu

Ce Zhang

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Haoxiang Wang

Yong Lin

Wei Xiong

Rui Yang

Shuang Qiu

Han Zhao

Tong Zhang

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Rui Yang

Feng Luo

Shuang Qiu

Dong Yu

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Shuang Qiu

Zhaoran Wang

Zhuoran Yang

Tong Zhang

Gradient-Variation Bound for Online Convex Optimization with Constraints

Shuang Qiu

Mladen Kolar

On the Value of Myopic Behavior in Policy Reuse

Kang Xu

Chenjia Bai

Shuang Qiu

Bin Zhao

Zhen Wang

Wei Li

Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics

Lingxiao Wang

Shuang Qiu

Zuyue Fu

Zhuoran Yang

Csaba Szepesvari

Zhaoran Wang

Castle in the sky: Dynamic sky replacement and harmonization in videos

Zhengxia Zou

Rui Zhao

Shuang Qiu

Zhenwei Shi

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Shuang Qiu

Lingxiao Wang

Chenjia Bai

Zhuoran Yang

Zhaoran Wang

In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle

Shuang Qiu

Jiawei Jiang

Guoliang Li

Ji Liu

Wentao Wu

Ce Zhang

Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

Zhaoran Wang

Zhuoran Yang

Provably efficient fictitious play policy optimization for zero-sum Markov games with structured transitions

Shuang Qiu

Zhaoran Wang

Zhuoran Yang

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Shuang Qiu

Zhaoran Wang

Zhuoran Yang

On Finite-Time Convergence of Actor-Critic Algorithm

Shuang Qiu

Zhuoran Yang