Berkeley Research Group
Site Reliability Engineer
Remote Site Reliability Engineering role with clear candidate location fit.
PostedJul 2, 2026
Eligible countries1 accepted country
Seniority signalSenior
Work settingRemote
Accepted candidate locations
USA
Role overview
Site Reliability Engineer
Requirements and responsibilities
Readable role content extracted into sections for faster review.
Responsibilities
- Design, implement, and maintain scalable and reliable systems in cloud environments such as Azure Cloud Services.
- Experience with CI/CD Platforms (GitHub Actions, GitLab CI)
- Provide operational support for full-stack software applications.
- Increase system resilience with expert-level coding, bulletproof release, and change management skills.
- Develop service-level indicators and objectives to automate release validation.
- Improve automation and increase the system’s self-healing capability.
- Collect operating system data and report performance metrics to stakeholders.
- Ensure security best practices are followed in cloud infrastructure and application deployments.
- Manage cloud and database system maintenance, debugging production issues as they arise.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.
- Lead incident management processes; respond to outages and service disruptions promptly.
Qualifications:
- Bachelor’s degree in computer science or similar field.
- Five years’ experience as a site reliability engineer or similar role.
- Strong programming skills (Golang, Ruby, Python, or similar)
- Proven ability to diagnose and monitor performance and reliability issues across the stack.
- Expertise in Kubernetes.
- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
- Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
- Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).
- Experience scripting operating system tasks with Infrastructure as Code.
- Impeccable communication skills.
- Ability to problem-solve in a fast-paced, high-stakes environment.
Similar roles
Keep a backup shortlist.
AWS, Kubernetes 13 accepted countries
Senior Backend Engineer (AdTech)Leap ToolsView role AWS, Kubernetes 13 accepted countries
Senior Backend EngineerLeap ToolsView role AWS, CI/CD 13 accepted countries
Senior QA Automation EngineerSubway EcommerceView role CI/CD, Python 8 accepted countries
Application Security EngineerMorgan StanleyView role Stack
Use these tags to compare similar remote roles.
Location eligibility
Candidates should apply only when their profile country is listed here.
Your profileCountry not setSign in to check your country against this role.
Hiring flow
WithMira shows the role, then sends candidates to the company application.
1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.