Resumo da vaga

Senior Site Reliability Engineer II- Infrastructure (AI Native)

Requisitos e responsabilidades

Conteúdo da vaga extraído em seções para revisão mais rápida.

What You’ll Do

  • Scaling and maintaining our infrastructure and services using AI (Claude Code) as a first-class collaborator in your daily development workflow.
  • Being opinionated on technical direction and strategy (and documenting those opinions for others to be able to follow).
  • Leading and mentoring other engineers on the team
  • Owning and resolving the most complex infrastructure failures — Kubernetes scheduling edge cases, networking degradation, cross-service cascading failures, and AWS platform issues that other engineers escalate
  • Participating in a shared on-call rotation (roughly one week every six to eight weeks on call)
  • Estimating schedules, breaking tasks down to reasonable 1-3 day tasks.
  • Driving cloud cost efficiency by identifying over-provisioned resources, rightsizing EC2 and container workloads, and building tooling to surface cost anomalies before they compound

What We’re Looking For

  • Bachelor's in Computer Science, Engineering, related field, or equivalent practical experience
  • Expert-level experience (5+ years) managing medium to large-scale deployments on AWS (~5000 instances, 50+ accounts), or equivalent.
  • 3+ years of experience programming in Java, Python, or other formal programming languages
  • Strong Kubernetes experience (3+ years) deploying and managing at scale (100s of Deployments,10k+ containers, 20k+ Cores).Understanding of container orchestration and microservicesExperience with service discovery/service mesh
  • Understanding of container orchestration and microservices
  • Experience with service discovery/service mesh
  • Strong Linux administration experience, shell/bash scripting.
  • Expert-level experience with Infrastructure as code tools: Terraform, CloudFormation; config management/provisioning tools: Ansible, Chef, etc.
  • Strong Build / Automation / CI/CD experience.
  • Strong Knowledge/experience with networking and load-balancer technologies.
  • Experience with existing open-source projects such as Consul, Docker, ArgoCD, Nexus, Jenkins
  • Experience with large-scale Kafka deployments
  • Database knowledge is a plus.
  • Excellent troubleshooting skills, expertise with any monitoring tools, and attention to detail
  • Excellent interpersonal skills and highly collaborative working style
  • Hands-on experience with AI coding tools (Claude Code, Cursor, or equivalent) used for infrastructure scripting, incident response automation, or tooling development

Details

  • Understanding of container orchestration and microservices
  • Experience with service discovery/service mesh

AI-Native Expectations

  • Daily use: You use AI coding assistants (we support Claude Code, Cursor, and GitHub Copilot) for real, substantive tasks: analysis, coding, refactoring, testing, navigating codebases, and documentation. Not just research or autocomplete.
  • Judgment and ownership: AI-generated code gets the same review you'd give any PR. You are accountable for everything you ship.
  • Agentic thinking: We are actively building multi-agent systems that automate operational work — from incident response pipelines to infrastructure remediation. You don't need to have built a production agentic system, but you should be curious about this space, have opinions on where it works and where it breaks, and be ready to contribute to it.
  • Velocity: We expect senior engineers who use AI well to operate with meaningful leverage — doing more with the same time, taking on problems that would otherwise require a larger team.
  • Team leadership: You share what works. You automate prompting strategies that others can build on.
  • Continuous learning: The tooling is changing fast. You stay current and bring recommendations to the team.

Our Benefits

  • Competitive pay and benefits
  • Medical, dental, vision, life and disability insurance plans
  • RRSP plan with DPSP company matching program
  • Employee Assistance Program (EAP) for mental well-being
  • Flexible PTO, several company-wide days off throughout the year
  • Winter and Summer Week-long Synchronized Company Shutdowns
  • Learning & Development programs
  • Equipment, tools, and reimbursement support for a productive remote environment
  • Free Life360 Platinum Membership for your preferred circle
  • Free Tile Products

Life360 Values

  • Be a Good Person - We have a team of high-integrity people you can trust.
  • Be Direct With Respect - We communicate directly, even when it’s hard.
  • Members Before Metrics - We focus on building an exceptional experience for families.
  • High Intensity High Impact - We do whatever it takes to get the job done.
Vagas similares

Mantenha uma lista reserva.

Ver stack
FocoSite Reliability EngineeringÁrea da vaga
Sinal de senioridadeSeniorNível do candidato
StackAWS, CI/CD, DockerSkills principais
Localização1 país aceitoElegibilidade

Stack

Use estas tags para comparar vagas remotas similares.

Elegibilidade de localização

Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.

Seu perfilPaís não definidoEntre para comparar seu país com esta vaga.

Fluxo de contratação

O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.

1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.
Aplicar no site da empresaSite da empresaAbrir link