Role overview

Machine Learning Engineer II

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Essential Responsibilities

  • Architect and operate data and training pipelines across cloud and cluster environments.
  • Build and maintain distributed training and orchestration tooling.
  • Design and maintain the data and metadata stores that back our training and evaluation workflows

Skills and Abilities

  • Architect data and model parallelism training infrastructure for large data (>100TB) or large model (>100GB) applications
  • Architecting and operating containerized/pipelined ML Training workloads, including GPU scheduling/autoscaling, dataloader design and experiment tracking.
  • Building and maintaining CI/CD pipelines and infrastructure-as-code (e.g. Terraform).
  • Working with relational and object stores, and high-throughput data formats for ML workloads.

Required

  • Bachelor’s or Master’s degree in Robotics, Computer Science or a related field with strong mathematical and engineering foundations.
  • A minimum of 2 years building ML-oriented infrastructure, platforms, or distributed systems in production.
  • Proficiency in C++, Python and PyTorch with experience in Linux environments.
  • Familiarity with basic concepts in Machine Learning (training loops, basic operators and architectures)

Desirable

  • Proficiency in Go or Rust.
  • Familiarity with ML orchestration and experiment tooling such as Ray, Kubeflow, Airflow, MLflow, or Weights & Biases.
  • Familiarity with distributed training frameworks (PyTorch DDP/FSDP, DeepSpeed).
  • Familiarity with data pipeline and storage technologies (Spark, Parquet, object storage, feature/metadata stores).
  • Familiarity with basic Perception and Planning concepts in Autonomous Driving.

Physical Requirements

  • Standard office working conditions which includes but is not limited to:
  • Prolonged sitting
  • Prolonged standing
  • Prolonged computer use

Details

  • Prolonged sitting
  • Prolonged standing
  • Prolonged computer use

Benefits and Perks

  • Comprehensive healthcare suite including medical, dental, vision, life, and disability plans. Domestic partners who have been residing together at least one year are also eligible to participate.
  • Health Savings and Flexible Spending Healthcare and Dependent Care Accounts available.
  • Rich retirement benefits, including an immediately vested employer safe harbor match.
  • Generous paid parental leave as well as a phased return to work.
  • Flexible vacation policy in addition to paid company holidays.
  • Total Wellness Program providing numerous resources for overall wellbeing
Similar roles

Keep a backup shortlist.

Browse stack
FocusAutonomy EngineeringRole area
Seniority signalMiddleCandidate level
StackCI/CD, Python, SparkPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link