Role overview

Principal Site Reliability Engineer

Requirements and responsibilities

Readable role content extracted into sections for faster review.

What you will be doing

  • Design and build advanced cloud-native infrastructure
  • Guide technical discussions with clients and build technical roadmaps
  • Collaborate with the Engineering Director(s) to (re)design architecture
  • Assist the Site Reliability Manager with resource planning
  • Assist engineering managers with building career paths for individuals wishing to be promoted to Principal Engineers
  • Teach, mentor, grow, and provide advice to other domain experts, individual contributors, and across several teams.
  • Document processes and monitor performance metrics
  • Guide conversations to remove blockers and encourage collaboration across teams.
  • Constantly improve the stability, scalability, security, cost-effectiveness, and operational excellence of our clients' systems.
  • Continuously discover, evaluate, and implement new technologies to maximize development efficiency and security.
  • Conduct infrastructure planning, testing, and development
  • Provide technical leadership on multiple projects.

What you must have

  • At least 7 or more years experience working in a DevOps/SRE team
  • Extensive experience in DevOps/SRE, team management and collaboration
  • Advanced knowledge of best practices related to data encryption and cybersecurity
  • Advanced knowledge of the general DevOps/SRE landscape, architectures, and emerging technologies
  • Cloud experience, preferably GCP, Azure and AWS
  • Experience in Observability Practices and Incident Management
  • Extensive experience with Prometheus, Grafana, the Elastic Stack and all versions of Beats, especially within Kubernetes
  • Experience with Infrastructure as Code, preferably Terraform
  • Experience with general automation and config management, preferably Ansible
  • Extensive experience building and maintaining Kubernetes clusters and workloads
  • Strong foundation of basic network and security concepts
  • Ability to build robust CICD pipelines
  • Familiarity with relational and non-relational databases
  • Solid understanding of Linux operating systems

Qualities & Behaviours

  • Exceptional interpersonal and communication skills
  • A zest for automation
  • Comfortable working as a remote team member and leader
  • Ability to keep up to date with DevOps/SRE best practices, trends and innovation
  • Passionate about mentoring and growing technical skills within the team

Becoming a Martian means:

  • Comfortably working and learning from a fully remote, culturally diverse team based predominantly in South Africa, Kenya, Nigeria and Ghana.
  • Being an open, honest and respectful communicator.
  • You enjoy asking questions, identifying areas of improvement and proposing solutions, no matter your job title or whether you have been with us for a day, a month or years!
  • You are comfortable taking initiative and operating independently.
  • You thrive in a fast paced environment, where change is constant.
  • You find it exciting to work with various clients, from different industries, each with a different problem for you and your team to solve.
  • Intentionally sharing tech and industry trends that excite you with your peers.
  • Seeking continuous feedback and actively taking steps to continuously grow personally and professionally.
Similar roles

Keep a backup shortlist.

Browse stack
FocusSite Reliability EngineeringRole area
Seniority signalSeniorCandidate level
StackAWS, Azure, GCPPrimary skills
Location2 accepted countriesEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link