Manager, RL Algorithms & Decoder

Zoox
Zoox

Software Engineering

Foster City, CA, USA

Posted on Jul 1, 2026
The Onboard Behavior Model Architecture team is responsible for developing deep learning models that leverage data and compute at large scale to train driving models. We learn and predict behaviors from large scale expert data and large scale reinforcement learning to produce a ML driver that is safe, comfortable, and completes the mission. In this role, you will collaborate closely with the Onboard Perception, Cost Planner, Simulation, Validation, Data Science, Systems Engineering, QA, and ML Infra teams.

In this role, you will:

  • Lead a Team: Manage, mentor, and grow a team of individual contributors, fostering a culture of innovation and continuous improvement.

  • Develop Strategy: Develop and organize our overall strategy for Onboard Behavior ML Models for generating driving plans for our autonomous vehicle. You will interface with multiple partner teams to identify opportunities for model improvements within their problem area. You’ll be setting the short and long term technical direction for the team and collaborate on broader company-wide directions.

  • Provide technical guidance and leadership in the design and development of training models at large scale and work with partner teams on ensuring their efficient inference.

  • Monitor Performance: Establish and monitor key performance indicators (KPIs) to measure the effectiveness of work packages and drive continuous improvement.

  • Manage Resources: Manage the allocation of resources within the team, ensuring that projects are staffed appropriately and that team members have the necessary tools and support to succeed.

Qualifications

  • Expertise with Reinforcement Learning and Machine Learning for at least one of these areas: Planning, LLMs, VLAs/VLMs, recommendation systems.

  • Extensive experience with programming and algorithm design, strong mathematics skills.

  • MS or PhD degree in computer science or related field.

  • 5+ years of experience with production Machine Learning pipelines, with at least 3 years in a leadership or management role.

Bonus Qualifications

  • Conference or Journal publications in Machine Learning or Robotics related venues.

  • Prior experience working with autonomous vehicles or robotics, diffusion models, large scale training.

277000 - 349000 USD a year

Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.