Baseten

Product Manager, Inference Platform

Vaga remota de Product com fit claro de localização do candidato.

Publicada2 de abr. de 2026

Países elegíveis1 país aceito

Sinal de senioridadeLead

Modelo de trabalhoRemoto

Locais aceitos para candidatos

Estados Unidos

Posso mesmo aplicar?Confira a lista de países

Países aceitos para candidatos estão listados (1).

Atualidade da fonte2 de abr. de 2026

Fit de localização1 país aceito

Match de stackKubernetes, Spark

Caminho de aplicaçãoSite da empresa

Resumo de fit da MiraPor que vale revisar esta vaga

Fit de localização1 país aceitoAdicione seu país

Match de stackAdicione skills ao perfil para compararKubernetes, Spark

Sinal de senioridadeLeadDefina seu nível para uma análise mais precisa.

Prontidão para aplicarSite da empresaA aplicação continua no site da empresa.

Aplicação

Aplicar no site da empresa

Aplicação externa

Aplicando paraProduct Manager, Inference PlatformBaseten

Fit de país1 país aceito

Caminho de aplicaçãoSite da empresa

WithMiraSalve ou assine antes de sair

Aplicação da empresa

O WithMira mantém esta vaga para descoberta. A aplicação continua no site da empresa.

Resumo da vaga

Conteúdo da vaga extraído em seções para revisão mais rápida.

You will own how workloads scale and where they land — autoscaling to demand (up under load, down to zero when idle) and a single placement policy expressing region, compliance regime, and capacity preference, with compliance-bound workloads given right-of-way on sensitive capacity.
You will make production inference reliable by default — every request reaches a healthy replica, rolling deploys never drop traffic, region-aware routing with multi-region / active-active and fallback as first-class policy, and health-aware recovery from stuck or bad replicas.
You will build the release engine beneath safe rollouts — the traffic-shifting that powers canary/shadow/A/B, warm-ups, drain, and probes.
You will push the cost/performance frontier for serving AI at scale — latency, throughput, uptime, and cost-efficiency, plus a measurable decline in MTTR through self-serve incident management.

8+ years in product management, including deep experience with infrastructure, distributed systems, or ML serving.
You reason fluently about scaling, routing, failover, and the cost/performance frontier — and you earn the respect of staff engineers doing it.
You’ve owned capabilities end to end, backend through UX, rather than a single slice.
You drive cross-team roadmaps and the dependencies beneath them, and you're at your best defining a category that doesn't fully exist yet.

You don't like getting technical.
You prefer only strategy, UX, or writing great docs over doing whatever it takes to ship great products for customers.
You lean toward applied AI over building platform, systems, GPUs, models, and scaling platforms and infrastructure.
You're not interested in the foundational, sometimes unglamorous work of making AI systems reliable and scalable at the infrastructure level.

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Fertility and family-building stipend through Carrot
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Vagas similares