Jack (Hao) Bai

haob2 AT illinois DOT edu

pic.jpg

Hi there! I’m Jack. I’m currently in my last year of MS program and will resume my study towards the PhD program at UIUC CS, advised by Prof. Tong Zhang. I work closely with Sergey Levine @ BAIR, Aviral Kumar @ CMU MLD, and Nan Jiang @ UIUC.

Recently, I focus my research on enhancing the reasoning & planning capability of intelligent agents with foundation models and reinforcement leanring (RL). I am identified as an empirical RL person but still try to make methods principled.

I received my dual undergrad degree from UIUC and Zhejiang University. During those wonderful years, I was lucky enough to work with Yi Ma @ BAIR, Chengxiang Zhai @ UIUC, and Shilin He @ MSR.

A public up-to-date resume can be found here.

News

Mar 06, 2025 I am thrilled to resume my PhD study at UIUC CS, advised by Prof. Tong Zhang! Stay tuned for our work on RL+VLMs, we’re cooking something big.
Jan 23, 2025 My second paper on building device control agents with RL, Digi-Q, has been accepted to ICLR 2025! Check out the preprint! This work was done when I visited BAIR, advised by Sergey Levine and Aviral Kumar.
Jan 23, 2025 My first representation learning paper CRATE-LM has been selected as Oral at CPAL 2025! This work was done when I visited BAIR, advised by Prof. Yi Ma.

Latest Posts

Selected Publications

  1. ICLR 2025
    Digi-Q: Transforming VLMs to Device-Control Agents via Value-Based Offline RL
    Hao Bai , Yifei Zhou, Erran Li, Sergey Levine, and Aviral Kumar
    Jan 2025
  2. Oral @ CPAL 2025
    Improving Neuron-level Interpretability with White-box Language Models
    Hao Bai , and Yi Ma
    Oct 2024
  3. NeurIPS 2024 Oral @ ICML WS
    DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
    Hao Bai , Yifei Zhou, Jiayi Pan, Mert Cemri, Alane Suhr, Sergey Levine, and Aviral Kumar
    Jun 2024
  4. NeurIPS 2024
    Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
    Yuexiang Zhai,  Hao Bai , Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, and Sergey Levine
    May 2024
  5. JMLR
    White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
    Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, Shengbang Tong,  Hao Bai , Yuexiang Zhai, Benjamin D Haeffele, and Yi Ma
    Apr 2024
  6. WSDM’24
    CharmBana: Progressive Responses with Real-Time Internet Search for Knowledge-Powered Conversations
    Revanth Gangi Reddy, Sharath Chandra,  Hao Bai , Wentao Yao,  ..., and Chengxiang Zhai
    Feb 2024
  7. EMNLP’23
    Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
    Revanth Reddy,  Hao Bai , Wentao Yao, Sharath Chandra Etagi Suresh, Heng Ji, and ChengXiang Zhai
    Oct 2023