Publications

*: indicating equal contribution or alphabetic ordering.

Preprints

  1. BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
    Xuwu Wang, Qiwen Cui, Yunzhe Tao, Yiran Wang, Ziwei Chai, Xiaotian Han, Boyi Liu, Jianbo Yuan, Jing Su, Guoyin Wang, Tingkai Liu, Liyu Chen, Tianyi Liu, Tao Sun, Yufeng Zhang, Sirui Zheng, Quanzeng You, Yang Yang, Hongxia Yang

  2. Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques
    Natalia Zhang*, Xinqi Wang*, Qiwen Cui*, Runlong Zhou, Sham M Kakade, Simon S Du

  3. (N,K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
    Yufeng Zhang, Liyu Chen, Boyi Liu, Yingxiang Yang, Qiwen Cui, Yunzhe Tao, Hongxia Yang

2024

  1. Learning Optimal Tax Design in Nonatomic Congestion Games
    Qiwen Cui, Maryam Fazel, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2024

  2. Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
    Yan Dai, Qiwen Cui, Simon S. Du
    The 37th Annual Conference on Learning Theory (COLT) 2024

  3. A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
    Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du
    International Conference on Learning Representations (ICLR) 2024

  4. Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
    Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon S. Du
    International Conference on Learning Representations (ICLR) 2024

2023

  1. Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
    Qiwen Cui, Kaiqing Zhang, Simon S. Du
    The 36th Annual Conference on Learning Theory (COLT) 2023

  2. Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement
    Haozhe Jiang*, Qiwen Cui*, Zhihan Xiong, Maryam Fazel, Simon S. Du
    International Conference on Learning Representations (ICLR) 2023

2022

  1. Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus
    Qiwen Cui, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2022

  2. When is Offline Two-Player Zero-Sum Markov Game Solvable?
    Qiwen Cui, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2022

  3. Learning in Congestion Games with Bandit Feedback
    Qiwen Cui*, Zhihan Xiong*, Maryam Fazel, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2022

  4. Near-Optimal Randomized Exploration for Tabular MDP
    Zhihan Xiong*, Ruoqi Shen*, Qiwen Cui*, Maryam Fazel, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2022

  5. On Gap-dependent Bounds for Offline Reinforcement Learning
    Xinqi Wang, Qiwen Cui, Simon S. Du
    Conference on Neural Information Processing Systems (NeurIPS) 2022

  6. NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
    Minghan Yang, Dong Xu, Qiwen Cui, Zaiwen Wen, Pengxiang Xu
    IEEE Transactions on Pattern Analysis and Machine Intelligence 2022

2021

  1. Randomized Exploration for Reinforcement Learning with General Value Function Approximation
    Haque Ishfaq*, Qiwen Cui*, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F Yang
    International Conference on Machine Learning (ICML) 2021

  2. Minimax Sample Complexity for Turn-based Stochastic Game
    Qiwen Cui, Lin F. Yang
    Uncertainty in Artificial Intelligence (UAI) 2021

  3. Clinical Decision Support Model for Tooth Extraction Therapy Derived from Electronic Dental Records
    Qiwen Cui, Qingxiao Chen, Pufan Liu, Debin Liu, Zaiwen Wen The Journal of Prosthetic Dentistry 2021

2020

  1. Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?
    Qiwen Cui, Lin F. Yang
    Conference on Neural Information Processing Systems (NeurIPS) 2020