Okta
Senior Site Reliability Engineer (Auth0)
Remote Tech Ops-610 role with clear candidate location fit.
PostedRecently added
Eligible countries37 accepted countries
Seniority signalSenior
Work settingRemote
Accepted candidate locations
Role overview
Senior Site Reliability Engineer (Auth0)
Requirements and responsibilities
Readable role content extracted into sections for faster review.
Details
- Design and build custom software in Go to enhance the platform's reliability, resiliency, and redundancy.
- Partner with engineering teams to embed reliability principles, improving the availability, performance, and observability of our services.
- Use your deep understanding of infrastructure and observability principles to identify opportunities for improvement within the product and implement solutions.
- Contribute to our follow-the-sun on-call rotation, providing rapid, effective response to critical incidents and using your expertise to troubleshoot, mitigate or accurately escalate production issues. Because our team is globally distributed, your on-call shifts will only occur during your standard local working hours.
- Develop and refine our SRE tooling and processes, focusing on automation and operational efficiency.
- Define, document, and champion reliability best practices across the organisation.
- A proactive and systematic approach to problem-solving, with a high degree of ownership.
- Proven experience in a production environment supporting large-scale, mission-critical applications with a high degree of autonomy.
- Proficiency in at least one programming language, with a preference for Go. You should be comfortable writing custom applications, not just scripts.
- Experience with infrastructure as code (Terraform), container orchestration (Kubernetes, Docker) and GitOps (ArgoCD).
- Demonstrable expertise in a major cloud provider (Azure, AWS, or GCP).
- A strong grasp of microservices architecture, databases (SQL, NoSQL), and networking fundamentals, so you can understand how custom code can solve platform-level issues.
- An understanding of core SRE principles, including SLIs, SLOs, and error budgets.
- Experience in an on-call rotation for a 24/7 cloud-based environment.
- Exceptional communication and collaboration skills, with a proven ability to work effectively in a remote, distributed team, where tasks may be self-driven.
- Supporting Your Well-Being
- Driving Social Impact
- Developing Talent and Fostering Connection + Community
Similar roles
Keep a backup shortlist.
Stack
Use these tags to compare similar remote roles.
Location eligibility
Candidates should apply only when their profile country is listed here.
Your profileCountry not setSign in to check your country against this role.
View all 37 accepted countries
Hiring flow
WithMira shows the role, then sends candidates to the company application.
1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.