Shuang Qiu
University of Michigan
H-index: 13
North America-United States
Top articles of Shuang Qiu
Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems
arXiv preprint arXiv:2206.05830
2022/6/12
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
arXiv preprint arXiv:2402.18571
2024/2/28
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
arXiv preprint arXiv:2402.10207
2024/2/15
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation
Advances in Neural Information Processing Systems
2024/2/13
Gradient-Variation Bound for Online Convex Optimization with Constraints
Proceedings of the AAAI Conference on Artificial Intelligence
2023/6/26
On the Value of Myopic Behavior in Policy Reuse
arXiv preprint arXiv:2305.17623
2023/5/28
Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics
2022/9/29
Castle in the sky: Dynamic sky replacement and harmonization in videos
IEEE Transactions on Image Processing
2022/7/26
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
2022
In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle
2022/6/10
Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
arXiv preprint arXiv:2202.12797
2022/2/25
Provably efficient fictitious play policy optimization for zero-sum Markov games with structured transitions
2021/7/1
On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game
2021/7/1
On Finite-Time Convergence of Actor-Critic Algorithm
IEEE Journal on Selected Areas in Information Theory
2021/5/19
Pine: Universal deep embedding for graph nodes via partial permutation invariant set functions
IEEE Transactions on Pattern Analysis and Machine Intelligence
2021/2/23
Stylized neural painting
2021
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
arXiv preprint arXiv:2008.10103
2020/8/23
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
2020
Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis
2020/11/21