Qiwen Cui

alt text 

Qiwen Cui
Ph.D. student
Paul G. Allen School of Computer Science & Engineering
University of Washington
Email: qwcui at cs (dot) washington (dot) edu
Google Scholar

About me

I am currently a first year Ph.D. student in the Paul G. Allen School of Computer Science & Engineering at University of Washington. I am very fortunate to be advised by Professor Simon Shaolei Du. My research interests are boardly in machine learning theory. I have been working on reinforcement learning theory and I am also exploring other areas like optimization and game theory.

Prior to starting my Ph.D. study, I did my undergrad in the School of Mathematical Sciences at Peking University advised by Professor Zaiwen Wen. I had a great summer working with Professor Lin F. Yang in 2020, who led me into the world of reinforcement learning theory.

Research

My research interests include

  • Reinforcement Learning

  • Optimization

  • Game Theory

Publications

  1. Randomized Exploration for Reinforcement Learning with General Value Function Approximation
    Haque Ishfaq*, Qiwen Cui*, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F Yang
    International Conference on Machine Learning (ICML) 2021

  2. Minimax sample complexity for turn-based stochastic game
    Qiwen Cui, Lin F. Yang
    Uncertainty in Artificial Intelligence (UAI) 2021

  3. Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?
    Qiwen Cui, Lin F. Yang
    Conference on Neural Information Processing Systems (NeurIPS) 2020

Preprints

  1. Learning in Congestion Games with Bandit Feedback
    Qiwen Cui*, Zhihan Xiong*, Maryam Fazel, Simon S. Du

  2. Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus
    Qiwen Cui, Simon S. Du

  3. On Gap-dependent Bounds for Offline Reinforcement Learning
    Xinqi Wang, Qiwen Cui, Simon S. Du

  4. When is Offline Two-Player Zero-Sum Markov Game Solvable?
    Qiwen Cui, Simon S. Du

  5. Near-Optimal Randomized Exploration for Tabular MDP
    Zhihan Xiong*, Ruoqi Shen*, Qiwen Cui*, Simon S. Du

  6. NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
    Minghan Yang, Dong Xu, Qiwen Cui, Zaiwen Wen, Pengxiang Xu