Resumen del rol

Site Reliability Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

Details

  • Ensure the reliability of our critical products and services by meeting or exceeding SRE objectives.
  • Instantiate and maintain production infrastructure using Infrastructure as Code and Configuration Management tools.
  • Build and maintain proper monitoring of our services by utilizing centralized logging and time series databases.
  • Automate deployments, administration, and monitoring of our services by following CI/CD practices.
  • Work with engineering and information security teams to enhance, document, establish processes and generally improve the operability and security of our services.
  • Participation in team on-call rotation is required.
  • Additional tasks associated with this position may be assigned in response to company initiatives and business needs.

More information about this role:

  • This is a full-time and remote position based in the UK with the ideal candidate located within a 1-hour driving radius of greater Manchester area.
  • This is an individual contributor role, reporting to our Manager, Site Reliability Engineering.
  • The targeted compensation package for this role is between GBP 40,000 and 45,000, subject upon internal equity and years of experience. We may make further adjustments through an approval process if the targeted compensation range needs to be modified based on business needs and market trends.

Education:

  • Bachelor's degree in information systems, computer science, technology, or a related field is strongly preferred. In lieu of degree, 2+ years of relevant and/or equivalent experience is acceptable.

Experience:

  • Minimum of 3+ years of software and/or operational experience in building and maintaining internet-facing production environments is required.
  • Strong experience with Linux/Unix systems administration.
  • Knowledge of source control tools (Git preferred).

Experience:

  • Strong scripting abilities in Bash and Python.
  • Experience with incident management, troubleshooting, and root cause analysis.
  • Experience in handling postmortems, building incident response plans, and improving incident resolution procedures.
  • Experience running and maintaining real-world build systems (Jenkins, DroneCI, or similar tools)
  • Demonstrable experience with the entire life cycle of software, starting with Systems Architecture, Systems Design, Implementation, Maintenance, and Operation.
  • Programming experience using HTTP Service APIs.
  • Ability to mount and install network devices such as routers, switches, and servers in data centers. This includes rack mounting, cable management, and hardware and peripheral configuration.
  • Virtualization experience (VMWare, Proxmox, Oracle Linux Virtualization Manager).
  • Network administration experience is a plus.
  • Exposure to Security and Testing frameworks is a plus.
  • Exposure to compliant regulated industries such as Finance, Healthcare, or Government is a plus.
  • Experience with distributed data processing, databases, and large-scale file systems is a plus.
Roles similares

Mantén una lista de respaldo.

Ver stack
FocoSite Reliability EngineeringÁrea del rol
Señal de senioritySeniorNivel del candidato
StackCI/CD, PythonSkills principales
Ubicación1 país aceptadoElegibilidad

Stack

Usa estas tags para comparar roles remotos similares.

Elegibilidad de ubicación

Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.

Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.

Flujo de contratación

WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.

1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.
Aplicar en el sitio de la empresaSitio de la empresaAbrir link