Journals (and Preprints in Revision)

*Zhaoran Wang (back to home)*

Is Pessimism Provably Efficient for Offline RL? Ying Jin, Zhuoran Yang, Zhaoran Wang (equal contribution) International Conference on Machine Learning (ICML), 2021 (short version) Mathematics of Operations Research (MOR), 2024 (long version)

Distributional Policy Evaluation in Reinforcement Learning Zhengling Qi, Chenjia Bai, Zhaoran Wang, Lan Wang Journal of the American Statistical Association (JASA), 2024

A Primal-Dual Approach to Constrained Markov Decision Processes Yi Chen, Jing Dong, Zhaoran Wang Management Science (MS), 2024

False Correlation Reduction for Offline Reinforcement Learning Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, Jing Jiang IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2023

Federated Offline Reinforcement Learning Doudou Zhou, Yufeng Zhang, Zhaoran Wang, Junwei Lu, Tianxi Cai Journal of the American Statistical Association (JASA), 2023

Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers? Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael I Jordan Journal of Machine Learning Research (JMLR), 2023

Online Bootstrap Inference for Policy Evaluation in Reinforcement Learning Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will W Sun, Guang Cheng Journal of the American Statistical Association (JASA), 2022

Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iteration Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael I Jordan Advances in Neural Information Processing Systems (NeurIPS), 2020 (short version) Under Major Revision at Operations Research (OR), 2022 (long version)

Near-Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection Yi Chen, Yining Wang, Ethan X Fang, Zhaoran Wang, Runze Li Journal of the American Statistical Association (JASA), 2022

A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang (alphabetical order) SIAM Journal on Optimization (SIOPT), 2022

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium Qiaomin Xie, Yudong Chen†, Zhaoran Wang†, Zhuoran Yang† (†: alphabetical order) Annual Conference on Learning Theory (COLT), 2020 (short version) Mathematics of Operations Research (MOR), 2022 (long version)

Natural Actor-Critic Converges Globally for Hierarchical Linear-Quadratic Regulators Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar Accepted (Upon Minor Revision) at Journal of Machine Learning Research (JMLR), 2022

Fairness-Oriented Learning for Optimal Individualized Treatment Rules Ethan X Fang, Zhaoran Wang, Lan Wang Journal of the American Statistical Association (JASA), 2021

On the Finite-Time Convergence of the Actor-Critic Algorithm Shuang Qiu, Zhuoran Yang, Jieping Ye, Zhaoran Wang IEEE Journal on Selected Areas in Information Theory (JSAIT), 2021

Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin, Zhuoran Yang, Zhaoran Wang, Michael I Jordan Annual Conference on Learning Theory (COLT), 2020 (short version) Mathematics of Operations Research (MOR), 2022 (long version)

Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima Qi Cai, Zhuoran Yang, Jason D Lee, Zhaoran Wang Advances in Neural Information Processing Systems (NeurIPS), 2019 (short version) Mathematics of Operations Research (MOR), 2022 (long version)

A Theoretical Analysis of Deep Q-Learning Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang (alphabetical order) Learning for Dynamics and Control (L4DC), 2020 (short version) Revision and Resubmission at Annals of Statistics (AOS), 2022 (long version)

Curses of Heterogeneity: Computational Barriers in Sparse Mixture Models and Phase Retrieval Jianqing Fan, Han Liu, Zhaoran Wang, Zhuoran Yang (alphabetical order) Under Major Revision at Annals of Statistics (AOS), 2020

High-Dimensional Varying Index Coefficient Models via Stein’s Identity Sen Na, Zhuoran Yang, Zhaoran Wang, Mladen Kolar Journal of Machine Learning Research (JMLR), 2019

A Convex Formulation for High-Dimensional Sparse Sliced Inverse Regression Kean Ming Tan, Zhaoran Wang, Tong Zhang, Han Liu, Dennis Cook Biometrika (BMK), 2019

Misspecified Nonconvex Statistical Optimization for Phase Retrieval Zhuoran Yang, Lin F Yang, Ethan X Fang, Tuo Zhao, Zhaoran Wang, Matey Neykov Mathematical Programming (MP), 2019

Symmetry, Saddle Points, and Global Optimization Landscape in Nonconvex Matrix Factorization Xingguo Li, Junwei Lu, Raman Arora, Jarvis Haupt, Han Liu, Zhaoran Wang, Tuo Zhao IEEE Transactions on Information Theory (TIT), 2019

Agnostic Estimation for Misspecified Phase Retrieval Models Matey Neykov, Zhaoran Wang, Han Liu Advances in Neural Information Processing Systems (NeurIPS), 2016 (short version) Journal of Machine Learning Research (JMLR), 2020 (long version)

Nonconvex Statistical Optimization for Sparse Tensor Graphical Models Will W Sun, Zhaoran Wang, Han Liu, Guang Cheng Advances in Neural Information Processing Systems (NeurIPS), 2015 (short version) IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2019 (long version)

Sparse Generalized Eigenvalue Problems: Optimal Statistical Rates via Truncated Rayleigh Flow Kean Ming Tan, Zhaoran Wang, Han Liu, Tong Zhang Journal of the Royal Statistical Society (JRSS), 2018

Optimal Computational and Statistical Rates of Convergence for Sparse Nonconvex Learning Problems Zhaoran Wang, Han Liu, Tong Zhang Annals of Statistics (AOS), 2014 (INFORMS Best Student Paper Finalist in Data Mining)

A Strictly Contractive Peaceman-Rachford Splitting Method for Convex Programming Bingsheng He, Han Liu, Zhaoran Wang, Xiaoming Yuan (alphabetical order) SIAM Journal on Optimization (SIOPT), 2013 (ASA Best Student Paper in Statistical Learning and Data Mining)