Role overview

Product Manager, Inference Platform

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Impact and outcomes you'll drive

  • You will own how workloads scale and where they land — autoscaling to demand (up under load, down to zero when idle) and a single placement policy expressing region, compliance regime, and capacity preference, with compliance-bound workloads given right-of-way on sensitive capacity.
  • You will make production inference reliable by default — every request reaches a healthy replica, rolling deploys never drop traffic, region-aware routing with multi-region / active-active and fallback as first-class policy, and health-aware recovery from stuck or bad replicas.
  • You will build the release engine beneath safe rollouts — the traffic-shifting that powers canary/shadow/A/B, warm-ups, drain, and probes.
  • You will push the cost/performance frontier for serving AI at scale — latency, throughput, uptime, and cost-efficiency, plus a measurable decline in MTTR through self-serve incident management.

What we're looking for

  • 8+ years in product management, including deep experience with infrastructure, distributed systems, or ML serving.
  • You reason fluently about scaling, routing, failover, and the cost/performance frontier — and you earn the respect of staff engineers doing it.
  • You’ve owned capabilities end to end, backend through UX, rather than a single slice.
  • You drive cross-team roadmaps and the dependencies beneath them, and you're at your best defining a category that doesn't fully exist yet.

Not for You if

  • You don't like getting technical.
  • You prefer only strategy, UX, or writing great docs over doing whatever it takes to ship great products for customers.
  • You lean toward applied AI over building platform, systems, GPUs, models, and scaling platforms and infrastructure.
  • You're not interested in the foundational, sometimes unglamorous work of making AI systems reliable and scalable at the infrastructure level.

Not for You if

  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Similar roles

Keep a backup shortlist.

Browse stack
FocusProductRole area
Seniority signalLeadCandidate level
StackKubernetes, SparkPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link