Qingquan Bao

Qingquan Bao /tɕʰɪŋ/ /tɕʰɥœn/ /bɑʊ/

Software Engineer, ML Motion Planning @ Zoox

Zoox

Biography

Hello! I’m Qingquan Bao, a Software Engineer on the ML Motion Planning team at Zoox, working on Pretraining and Reinforcement Learning for L4 autonomous driving.

My aspiration is to develop a versatile agent capable of assisting with both physical and intellectual tasks in our daily lives. I firmly believe that embodied AGI is crucial for advancing humanity’s liberty and democracy.

I completed my M.S. in Robotics (GPA 4.0/4.0) at the GRASP Lab, University of Pennsylvania in May 2025. Prior to Penn, I graduated cum laude (Zhiyuan Honor) in 2023 from Shanghai Jiao Tong University. During my undergraduate years, I collaborated with Professor Junchi Yan on Vision Graph Matching and Trustworthy AI (CVPR 2022), and had the privilege of working with Professors Joshua B. Tenenbaum, Chuang Gan, Leslie Pack Kaelbling, and Tomás Lozano-Pérez on Embodied AI at MIT.

Interests
  • Artificial General Intelligence
  • Robotics and Embodied AI
  • Humanoid Robots
  • Mobile Manipulation
Education
  • M.S. in Robotics, GPA 4.0/4.0, 2025

    University of Pennsylvania

  • B.E. in Artificial Intelligence (Honors), GPA 4.0/4.3, 2023

    Shanghai Jiao Tong University

Skills

Technical
Python
Deep Learning (Pytorch)
JAX
C++
SQL
Hobbies
tennis Tennis
Photography
sword Kendo

Experience

 
 
 
 
 
Zoox (L4 Robotaxi)
Software Engineer, ML Motion Planning
June 2025 – Present Foster City, CA

Develop Pretraining and Reinforcement Learning to build long-horizon trajectory models for production.

  • Designed autoregressive policy head for BC/RL planner; improved candidate selection by 2% and safety score by 0.8%
  • Evaluated representation-learning (MAE-style pretraining) alignment with downstream RL objectives
  • Traced JAX runtime regression (8× slowdown) and restored throughput via vectorized workaround
  • Shipped virtual stopline for pickup/drop-off zones, eliminating looping incidents
 
 
 
 
 
Zoox
ML Engineer Intern, ML Motion Planning
May 2024 – August 2024 Foster City, CA

Collaborated with Prof. Nick Roy (MIT) on representation learning for autonomous driving.

  • Built OOD detector using intermediate RL planner embeddings
  • Introduced Masked Autoencoder (MAE) for scenario representations; boosted OOD AUC by +22.7%
 
 
 
 
 
Research Assistant
July 2022 – January 2023 Cambridge, MA, US
Led the research project “Embodied Depth Prediction”
 
 
 
 
 
Shanghai Jiao Tong University
Research Assitant
Shanghai Jiao Tong University
December 2020 – June 2023 Shanghai, China

Projects include:

  • Deep Graph Matching library with PaddlePaddle
  • Appearance and Structure Aware Robust Deep Visual Graph Matching: Attack, Defense and Beyond
  • Visual Model Search and Optimization for Software and Hardware Co-design (My Bachelor’s thesis)

Projects

*
Embodied Depth Prediction
We study the problem of embodied depth prediction, where an embodied agent in an environment must learn to accurately estimate the depth of its surroundings.
Embodied Depth Prediction
Heuristic Reward Driven Athlete Trainer
Let’s train an excellent Olymic runner with partial observations and use curiosity as dense rewards!
Heuristic Reward Driven Athlete Trainer
Deep Graph Matching library with PaddlePaddle
Contribute to an over 700⭐️ open-source graph matching model library!
Deep Graph Matching library with PaddlePaddle

Contact

Feel free to contact me!