Jinning Li

SWE @ Google

University of California, Berkeley

About Me

I am currently a software engineer at Google, working on Generative AI evaluation. I received in Mechanical Engineering (Robotics & Control) with Prof. Masayoshi Tomizuka at University of California, Berkeley in 2024. My research was focused on trustworthy planning algorithms for autonomous agents, such as vehicles and robots. I got my Master of Science in 2022, in the middle of my PhD study at UC Berkeley. Previously, I received my Bachelor of Engineering from Harbin Institute of Tehcnology, working with Prof. Huijun Gao and Prof. Weichao Sun on systems control and fault diagnosis.

Interests

Machine Learning
Reinforcment Learning
Control
Optimization

Education

Ph.D. in Robotics & Control, 2024

University of California, Berkeley
M.S. in Robotics, 2022

University of California, Berkeley
B.Eng. in Automation, 2019

Harbin Institute of Technology, China

Work Experience

Software Engineer

Gen AI Evaulation, Google LLC

Aug 2024 – Present Mountain View, CA

Design and create highly configurable prompt-based multimodal auto-rating product based on LLM with traffic scheduling services
Built production-gating, multimodal, nightly evaluations for Gemini models
Improve the latency of model (LLM) inference on TPU and evaluation pipeline

Research Scientist

Machine Learning Research, Nuro

Feb 2024 – Aug 2024 Mountain View, CA

Led the design, building, and deploying of the epistemic uncertainty estimation algorithm for the autonomy system of the in-house self-driving vehicle

Student Associate / Research Intern

Interactive Behavior Modeling, Honda Research Institute

Aug 2023 – Dec 2023 San Jose, CA

Designed algorithms to improve safe generalization of prediction models, incorporating behavior planning modules of the vehicles
Captured invariant information between different traffic scenes across partitions of training datasets by unsupervised learning
Evaluated the proposed algorithm on large-scale datasets, e.g., Waymo Open Motion Dataset & Argoverse 2 Dataset, which are preprocessed and unified to the same format

Graduate Student Researcher / Instructor

University of California, Berkeley

Aug 2019 – Feb 2024 Berkeley, CA

Conduct research on trustworthy and safe machine learning, especially reinforcement learning, and their application on real-world systems
Provide lectures and guide on discussion sessions in graduate-level courses

Software Engineer Intern

Machine Learning Infrastructure Team, Google LLC

May 2023 – Aug 2023 Sunnyvale, CA

Designed and built a JAX-ONNX backend library: Jaxonnxruntime. Github: https://github.com/google/jaxonnxruntime
Passed more than 700 unit tests in both ONNX backend test suites and customized scenarios including Large Language Models
Transformed the original Pytorch LLaMA model to JAX
Exported and served the transformed models by the JAX ecosystem on Google Cloud internal server platforms
Benchmarked the inference of JAX Transformers on model servers with different parallel partition rules on GPUs and TPUs
Customized the library based on the needs of users at Google

Software Engineer Intern

Discover Ads Auction Team, Google LLC

May 2022 – Aug 2022 Mountain View, CA

Designed and built an offline reinforcement learning infrastructure under Tensorflow for discover ads auction
Trained deep NNs to optimize auction long term values from real-world data to achieve better advertiser/user value trade-off
Conducted A/B testing of the trained algorithm on production traffic and polished the models accordingly

Research Projects

Trustworthy Reinforcement Learning Algorithms for Real-World Application

UC Berkeley Jan 2023 – Jun 2024

Designed a guided online distillation algorithm (website) for safe reinforcement learning (RL): extracted skills from human demonstrations by Decision Transformer, and distilled them into a lightweight network in online interactive funetuning for safety enhancement
Proposed a metric to quantify the interaction intensity for multi agent RL, which guides resource allocation for training diverse policies under a constraint budget
Develop a generative model (Diffusion) based simulator producing human like interactions, which can be trained concurrently and accept feedback from planning modules for better sample efficiency and final performance on safety

Resulting Publications:

J. Li et al., ‘’Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting,’’ in IEEE Robotics and Automation Letters, 2025.
J. Li et al., ‘’Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration,’’ in 2024 IEEE Conference on Robotics and Automation (ICRA), 2024.
Y. Chen, C. Tang, R. Tian, C. Li, J.Li et al., ‘’Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization,’’ in arXiv preprint arXiv:2310.07218, 2023.

Machine Learning Framework and Algorithms Design for Decision Making

UC Berkeley Sep 2020 – Apr 2023

Designed a spatio-temporal graph dual-attention network for multi-agent prediction, considering context information, trajectories of interactive agents, and physical feasibility constraints
Proposed a Pessimistic Offline Reinforcement Learning algorithm, which palliates the distributional shift problem by explicitly handling out-of-distribution states
Built a hierarchical planning framework especially for long horizon tasks, with a high-level module reasons about long-term strategies and plan sub-goals, and low-level goal-conditioned offline reinforcement learning algorithms to achieve sub-tasks

Resulting Publications:

J. Li et al., ‘’Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning,’’ in IEEE Robotics and Automation Letters (RA-L), 2022.
J. Li et al., ‘’Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking,’’ in IEEE Transactions on Intelligent Transportation Systems, 2021.
J. Li et al., ‘’Dealing with the Unknown: Pessimistic Offline Reinforcement Learning,`` in 2021 Conference on Robot Learning (CoRL), 2021.

Interaction-Aware Behavior Planning for Autonomous Vehicles

UC Berkeley Aug 2019 – Jan 2021

Built an interaction-aware behavior planning algorithm, which predicts the cooperativeness of the surrounding vehicles and solves a POMDP problem by MCTS
Proposed a general hierarchical planning framework, which safely handles various complex urban traffic conditions
Built a simulator that reproduces real traffic scenarios, and the proposed algorithms achieved both high completion rate of around and low collision rate

Resulting Publications:

J. Li et al., ‘’A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning,’’ in 2021 IEEE Conference on Robotics and Automation (ICRA), 2021.
J. Li et al., ‘’Interaction-aware behavior planning for autonomous vehicles validated with real traffic data,’’ in Dynamic Systems and Control Conference (DSCC). American Society of Mechanical Engineers, 2020.

Machine Learning Based Fault Diagnosis for Industrial Processes

Harbin Institute of Technology Mar 2018 – Oct 2018

Built an integrated SVM model with KPCA to extract and compress information, and GA to optimize the model parameters
Evaluated the algorithm on Tennessee Eastman process benchmark. Ablation studies showed that KPCA and GA both boost the performance of the SVM

Resulting Publications:

Jinning Li. ‘’A novel integrated SVM for fault diagnosis using KPCA and GA,’’ in Journal of Physics: Conference Series. IOP Publishing, 2019.

Publications

Jinning Li, Jiachen Li, Sangjae Bae, Isele, David (2025). Adaptive prediction ensemble: Improving out-of-distribution generalization of motion forecasting. In IEEE Robotics and Automation Letters, 2025.

PDF Cite Project

Jinning Li, Xinyi Liu, Banghua Zhu, Jiantao Jiao, Masayoshi Tomizuka, Chen Tang, Wei Zhan (2024). Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration. in 2024 IEEE Conference on Robotics and Automation (ICRA).

PDF Cite Video

Yuxin Chen, Chen Tang, Ran Tian, Chenran Li, Jinning Li, Masayoshi Tomizuka, Wei Zhan (2023). Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization. In arXiv preprint arXiv:2310.07218.

PDF Cite

Jinning Li, Chen Tang, Masayoshi Tomizuka, Wei Zhan (2022). Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning. In IEEE Robotics and Automation Letters (RA-L).

PDF Cite

Jinning Li, Chen Tang, Masayoshi Tomizuka, Wei Zhan (2022). Dealing with the Unknown: Pessimistic Offline Reinforcement Learning. In Conference on Robot Learning (CoRL).

PDF Cite

Jiachen Li, Hengbo Ma, Zhihao Zhang, Jinning Li, Masayoshi Tomizuka (2021). Spatio-temporal graph dual-attention network for multi-agent prediction and tracking. In IEEE Transactions on Intelligent Transportation Systems.

PDF Cite

Jinning Li, Liting Sun, Jianyu Chen, Masayoshi Tomizuka, Wei Zhan (2021). A safe hierarchical planning framework for complex driving scenarios based on reinforcement learning. In IEEE Conference on Robotics and Automation (ICRA).

PDF Cite

Jinning Li, Liting Sun, Wei Zhan, Masayoshi Tomizuka (2020). Interaction-aware behavior planning for autonomous vehicles validated with real traffic data. In Dynamic Systems and Control Conference (DSCC).

PDF Cite

Jinning Li (2019). A Novel Integrated SVM for Fault Diagnosis Using KPCA and GA. In Journal of Physics: Conference Series.

PDF Cite