Baseten
Product Manager, Inference Platform
Vaga remota de Product com fit claro de localização do candidato.
Publicada2 de abr. de 2026
Países elegíveis1 país aceito
Sinal de senioridadeLead
Modelo de trabalhoRemoto
Locais aceitos para candidatos
Estados Unidos
Resumo da vaga
Product Manager, Inference Platform
Requisitos e responsabilidades
Conteúdo da vaga extraído em seções para revisão mais rápida.
Impact and outcomes you'll drive
- You will own how workloads scale and where they land — autoscaling to demand (up under load, down to zero when idle) and a single placement policy expressing region, compliance regime, and capacity preference, with compliance-bound workloads given right-of-way on sensitive capacity.
- You will make production inference reliable by default — every request reaches a healthy replica, rolling deploys never drop traffic, region-aware routing with multi-region / active-active and fallback as first-class policy, and health-aware recovery from stuck or bad replicas.
- You will build the release engine beneath safe rollouts — the traffic-shifting that powers canary/shadow/A/B, warm-ups, drain, and probes.
- You will push the cost/performance frontier for serving AI at scale — latency, throughput, uptime, and cost-efficiency, plus a measurable decline in MTTR through self-serve incident management.
What we're looking for
- 8+ years in product management, including deep experience with infrastructure, distributed systems, or ML serving.
- You reason fluently about scaling, routing, failover, and the cost/performance frontier — and you earn the respect of staff engineers doing it.
- You’ve owned capabilities end to end, backend through UX, rather than a single slice.
- You drive cross-team roadmaps and the dependencies beneath them, and you're at your best defining a category that doesn't fully exist yet.
Not for You if
- You don't like getting technical.
- You prefer only strategy, UX, or writing great docs over doing whatever it takes to ship great products for customers.
- You lean toward applied AI over building platform, systems, GPUs, models, and scaling platforms and infrastructure.
- You're not interested in the foundational, sometimes unglamorous work of making AI systems reliable and scalable at the infrastructure level.
Not for You if
- Competitive compensation, including meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents
- Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- Paid parental leave
- Fertility and family-building stipend through Carrot
- Company-facilitated 401(k)
- Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Vagas similares
Mantenha uma lista reserva.
Kubernetes USA
Staff Backend Engineer- Grafana Enterprise| US| RemoteGrafana LabsVer vaga Kubernetes USA
Staff Backend Engineer- Grafana Enterprise| Canada| RemoteGrafana LabsVer vaga Kubernetes USA
Staff Backend Engineer- Databases Tempo| US| RemoteGrafana LabsVer vaga Kubernetes USA
Staff Backend Engineer- Databases Tempo| Canada| RemoteGrafana LabsVer vaga Stack
Use estas tags para comparar vagas remotas similares.
Elegibilidade de localização
Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.
Seu perfilPaís não definidoEntre para comparar seu país com esta vaga.
Fluxo de contratação
O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.
1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.