Berkeley Research Group
Site Reliability Engineer
Rol remoto de Site Reliability Engineering con fit claro de ubicación del candidato.
Publicado2 jul 2026
Países elegibles1 país aceptado
Señal de senioritySenior
Modelo de trabajoRemoto
Ubicaciones aceptadas para candidatos
Estados Unidos
Resumen del rol
Site Reliability Engineer
Requisitos y responsabilidades
Contenido del rol extraído en secciones para revisar más rápido.
Responsibilities
- Design, implement, and maintain scalable and reliable systems in cloud environments such as Azure Cloud Services.
- Experience with CI/CD Platforms (GitHub Actions, GitLab CI)
- Provide operational support for full-stack software applications.
- Increase system resilience with expert-level coding, bulletproof release, and change management skills.
- Develop service-level indicators and objectives to automate release validation.
- Improve automation and increase the system’s self-healing capability.
- Collect operating system data and report performance metrics to stakeholders.
- Ensure security best practices are followed in cloud infrastructure and application deployments.
- Manage cloud and database system maintenance, debugging production issues as they arise.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.
- Lead incident management processes; respond to outages and service disruptions promptly.
Qualifications:
- Bachelor’s degree in computer science or similar field.
- Five years’ experience as a site reliability engineer or similar role.
- Strong programming skills (Golang, Ruby, Python, or similar)
- Proven ability to diagnose and monitor performance and reliability issues across the stack.
- Expertise in Kubernetes.
- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
- Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
- Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).
- Experience scripting operating system tasks with Infrastructure as Code.
- Impeccable communication skills.
- Ability to problem-solve in a fast-paced, high-stakes environment.
Roles similares
Mantén una lista de respaldo.
AWS, Kubernetes 13 países aceptados
Senior Backend Engineer (AdTech)Leap ToolsVer rol AWS, Kubernetes 13 países aceptados
Senior Backend EngineerLeap ToolsVer rol AWS, CI/CD 13 países aceptados
Senior QA Automation EngineerSubway EcommerceVer rol CI/CD, Python 8 países aceptados
Application Security EngineerMorgan StanleyVer rol Stack
Usa estas tags para comparar roles remotos similares.
Elegibilidad de ubicación
Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.
Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.
Flujo de contratación
WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.
1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.