Baseten
Product Manager, Inference Platform
Rol remoto de Product con fit claro de ubicación del candidato.
Publicado2 abr 2026
Países elegibles1 país aceptado
Señal de seniorityLead
Modelo de trabajoRemoto
Ubicaciones aceptadas para candidatos
Estados Unidos
Resumen del rol
Product Manager, Inference Platform
Requisitos y responsabilidades
Contenido del rol extraído en secciones para revisar más rápido.
Impact and outcomes you'll drive
- You will own how workloads scale and where they land — autoscaling to demand (up under load, down to zero when idle) and a single placement policy expressing region, compliance regime, and capacity preference, with compliance-bound workloads given right-of-way on sensitive capacity.
- You will make production inference reliable by default — every request reaches a healthy replica, rolling deploys never drop traffic, region-aware routing with multi-region / active-active and fallback as first-class policy, and health-aware recovery from stuck or bad replicas.
- You will build the release engine beneath safe rollouts — the traffic-shifting that powers canary/shadow/A/B, warm-ups, drain, and probes.
- You will push the cost/performance frontier for serving AI at scale — latency, throughput, uptime, and cost-efficiency, plus a measurable decline in MTTR through self-serve incident management.
What we're looking for
- 8+ years in product management, including deep experience with infrastructure, distributed systems, or ML serving.
- You reason fluently about scaling, routing, failover, and the cost/performance frontier — and you earn the respect of staff engineers doing it.
- You’ve owned capabilities end to end, backend through UX, rather than a single slice.
- You drive cross-team roadmaps and the dependencies beneath them, and you're at your best defining a category that doesn't fully exist yet.
Not for You if
- You don't like getting technical.
- You prefer only strategy, UX, or writing great docs over doing whatever it takes to ship great products for customers.
- You lean toward applied AI over building platform, systems, GPUs, models, and scaling platforms and infrastructure.
- You're not interested in the foundational, sometimes unglamorous work of making AI systems reliable and scalable at the infrastructure level.
Not for You if
- Competitive compensation, including meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents
- Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- Paid parental leave
- Fertility and family-building stipend through Carrot
- Company-facilitated 401(k)
- Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Roles similares
Mantén una lista de respaldo.
Kubernetes USA
Staff Backend Engineer- Grafana Enterprise| US| RemoteGrafana LabsVer rol Kubernetes USA
Staff Backend Engineer- Grafana Enterprise| Canada| RemoteGrafana LabsVer rol Kubernetes USA
Staff Backend Engineer- Databases Tempo| US| RemoteGrafana LabsVer rol Kubernetes USA
Staff Backend Engineer- Databases Tempo| Canada| RemoteGrafana LabsVer rol Stack
Usa estas tags para comparar roles remotos similares.
Elegibilidad de ubicación
Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.
Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.
Flujo de contratación
WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.
1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.