Baseten

Product Manager, Inference Platform

Rol remoto de Product con fit claro de ubicación del candidato.

Publicado2 abr 2026

Países elegibles1 país aceptado

Señal de seniorityLead

Modelo de trabajoRemoto

Ubicaciones aceptadas para candidatos

Estados Unidos

Puedo aplicar realmente?Revisa la lista de países

Las ubicaciones aceptadas para candidatos están listadas (1).

Actualidad de la fuente2 abr 2026

Fit de ubicación1 país aceptado

Match de stackKubernetes, Spark

Camino de aplicaciónSitio de la empresa

Resumen de fit de MiraPor qué vale revisar este rol

Fit de ubicación1 país aceptadoAgrega tu país

Match de stackAgrega skills al perfil para compararKubernetes, Spark

Señal de seniorityLeadDefine tu nivel para una revisión más precisa.

Preparación para aplicarSitio de la empresaLa aplicación continúa en el sitio de la empresa.

Aplicación

Aplicar en el sitio de la empresa

Aplicación externa

Aplicando aProduct Manager, Inference PlatformBaseten

Fit de país1 país aceptado

Camino de aplicaciónSitio de la empresa

WithMiraGuarda o suscríbete antes de salir

Aplicación de la empresa

WithMira mantiene este rol para descubrimiento. La aplicación continúa en el sitio de la empresa.

Resumen del rol

Contenido del rol extraído en secciones para revisar más rápido.

You will own how workloads scale and where they land — autoscaling to demand (up under load, down to zero when idle) and a single placement policy expressing region, compliance regime, and capacity preference, with compliance-bound workloads given right-of-way on sensitive capacity.
You will make production inference reliable by default — every request reaches a healthy replica, rolling deploys never drop traffic, region-aware routing with multi-region / active-active and fallback as first-class policy, and health-aware recovery from stuck or bad replicas.
You will build the release engine beneath safe rollouts — the traffic-shifting that powers canary/shadow/A/B, warm-ups, drain, and probes.
You will push the cost/performance frontier for serving AI at scale — latency, throughput, uptime, and cost-efficiency, plus a measurable decline in MTTR through self-serve incident management.

8+ years in product management, including deep experience with infrastructure, distributed systems, or ML serving.
You reason fluently about scaling, routing, failover, and the cost/performance frontier — and you earn the respect of staff engineers doing it.
You’ve owned capabilities end to end, backend through UX, rather than a single slice.
You drive cross-team roadmaps and the dependencies beneath them, and you're at your best defining a category that doesn't fully exist yet.

You don't like getting technical.
You prefer only strategy, UX, or writing great docs over doing whatever it takes to ship great products for customers.
You lean toward applied AI over building platform, systems, GPUs, models, and scaling platforms and infrastructure.
You're not interested in the foundational, sometimes unglamorous work of making AI systems reliable and scalable at the infrastructure level.

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Fertility and family-building stipend through Carrot
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Roles similares