Resumen del rol

Senior Site Reliability Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

Details

  • Build and extend platforms to improve system reliability
  • Work on team goals that encompass reliability for the entire company
  • Standardize reliability tools across multiple platforms and organizations
  • Triage, coordinate, and lead stabilization of sev 0–1 incidents
  • Serve as primary oncall, maintaining structured escalation paths and exercising leadership escalation
  • Drive platform-wide reliability improvements, shared operational tooling, and deploy-safety patterns
  • Use AI-driven systems to improve signal detection, reduce noise, and accelerate root cause analysis
  • Design and implement safe deployment patterns (progressive delivery, automated rollback, guardrails)
  • Drive to root cause systems with many moving parts and take the necessary steps to fix them
  • Demonstrated technical initiative and leadership on previous projects, especially those with a backend/platform focus
  • Familiarity with AI-driven tooling for observability, incident analysis, or automation
  • A mindset that naturally reaches for AI to accelerate problem-solving and reduce toil
  • Experience running production oncall for high-availability systems
  • Strong incident management skills — structured triage, mitigation under pressure, blameless postmortems
  • Fluency with CI/CD pipelines, progressive rollout strategies, and rollback automation
  • Monitoring & observability expertise — building/tuning alerts for uptime, error rates, latency regression, and resource exhaustion
  • Ability to create and maintain evidence-based maturity assessments using trailing 90-day data windows.
  • Comfort with vendor/dependency management — maintaining validated escalation contacts reachable within ≤ 5 minutes.
  • Boundless curiosity, autonomy, and a strong sense of accountability
  • A strong desire to perform and grow as an engineer
  • 5+ years of software development experience
  • Kotlin, Modern Java (11+)
  • HTTP, JSON, gRPC, and Protocol Buffers
  • MySQL / Vitess / DynamoDB
  • Event driven architectures
  • DataDog
  • LaunchDarkly
  • Terraform, Kubernetes, Istio/Envoy
  • Amazon Web Services
  • Healthcare coverage (Medical, Vision and Dental insurance)
  • Health Savings Account and Flexible Spending Account
  • Retirement Plans including company match
  • Employee Stock Purchase Program
  • Wellness programs, including access to mental health, 1:1 financial planners, and a monthly wellness allowance
  • Paid parental and caregiving leave
  • Paid time off (including 12 paid holidays)
  • Paid sick leave (1 hour per 26 hours worked (max 80 hours per calendar year to the extent legally permissible) for non-exempt employees and covered by our Flexible Time Off policy for exempt employees)
  • Learning and Development resources
  • Paid Life insurance, AD&D, and disability benefits
Roles similares

Mantén una lista de respaldo.

Ver stack
Foco10402 Engineering - Product Platform EngineeringÁrea del rol
Señal de senioritySeniorNivel del candidato
StackCI/CD, Java, KubernetesSkills principales
Ubicación27 países aceptadosElegibilidad

Stack

Usa estas tags para comparar roles remotos similares.

Elegibilidad de ubicación

Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.

Flujo de contratación

WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.

1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.
Aplicar en el sitio de la empresaSitio de la empresaAbrir link