Red Hat

Customer Site Reliability Engineer- OpenShift Managed Cloud Services (Kubernete

Vaga remota de Site Reliability Engineering com fit claro de localização do candidato.

Publicada15 de jun. de 2026

Países elegíveis1 país aceito

Sinal de senioridadeSenior

Modelo de trabalhoRemoto

Locais aceitos para candidatos

Austrália

AWS Azure GCP Golang Kubernetes

Posso mesmo aplicar?Confira a lista de países

Países aceitos para candidatos estão listados (1).

Atualidade da fonte15 de jun. de 2026

Fit de localização1 país aceito

Match de stackAWS, Azure

Caminho de aplicaçãoSite da empresa

Resumo de fit da MiraPor que vale revisar esta vaga

Fit de localização1 país aceitoAdicione seu país

Match de stackAdicione skills ao perfil para compararAWS, Azure

Sinal de senioridadeSeniorDefina seu nível para uma análise mais precisa.

Prontidão para aplicarSite da empresaA aplicação continua no site da empresa.

Aplicação

Aplicar no site da empresa

Aplicação externa

Aplicando paraCustomer Site Reliability Engineer- OpenShift Managed Cloud Services (KuberneteRed Hat

Fit de país1 país aceito

Caminho de aplicaçãoSite da empresa

WithMiraSalve ou assine antes de sair

Aplicação da empresa

O WithMira mantém esta vaga para descoberta. A aplicação continua no site da empresa.

Aplicar no site da empresa

Salvar vaga

Resumo da vaga

Customer Site Reliability Engineer- OpenShift Managed Cloud Services (Kubernete

Requisitos e responsabilidades

Conteúdo da vaga extraído em seções para revisão mais rápida.

What you will do

Manage large-scale, distributed systems, focusing on minimizing downtime and improving system resilience.
Maintain customer trust and confidence by ensuring stability and functionality of services.
Drive continuous enhancement of processes, tools, and methodologies to support the evolving needs of the service.
Lead the development of code and automation scripts to optimize the scalability, reliability, and performance of services.
Lead and participate in high-priority customer escalations, adopting a customer-first mindset.
Coordinate and execute complex incident response procedures, ensuring timely resolution and thorough postmortems.
Collaborate with cross-functional teams to enhance system robustness.
Demonstrate a proactive mindset to help preempt escalations and ensure reliable operations.
Document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions.
Mentor and coach team members, fostering a culture of continuous learning, knowledge sharing and collaboration.
Participate in on-call rotation and provide leadership during critical incidents.
Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a better product experience for customers.
Given the customer-facing nature of this SRE role, exceptional communication skills are essential. You must demonstrate the ability to articulate complex technical solutions and lead critical incident calls with confidence, even in high-pressure environments."

What you will bring

Advanced Experience with OpenShift/Kubernetes container platform support or administration.
Proficient with container-based technologies on Linux.
Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.
Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.
Advanced with enterprise configuration management such as Ansible, Terraform.
Software engineering experience using object-oriented languages; golang is preferred.
Superior communications skills and experience working directly with and presenting to customers.
Ability to quickly learn new technologies and follow industry trends.
Demonstrated ability to quickly and accurately troubleshoot systems issues.
Solid understanding of standard TCP/IP networking and common protocols.
Fluent in English and any additional language like Japanese, Chinese, Korean, Spanish is an advantage.

Vagas similares