Role overview

[Remote] Principal Site Reliability Developer- USC Required

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Details

  • Lead reliability engineering efforts for large-scale cloud-native healthcare platforms
  • Design and operate highly available distributed systems supporting AI-driven services
  • Build automation, self-healing systems, and intelligent operational tooling
  • Drive improvements across scalability, observability, deployment safety, and incident response
  • Lead complex production investigations and engineer durable long-term fixes
  • Develop AIOps capabilities including anomaly detection, predictive scaling, and automated remediation
  • Partner with software and platform teams to improve architecture, resiliency, and operational readiness
  • Influence engineering standards across Kubernetes, CI/CD, infrastructure as code, and cloud operations
  • Mentor engineers and help raise operational engineering maturity across the organization
Similar roles

Keep a backup shortlist.

Browse stack
FocusSite Reliability EngineeringRole area
Seniority signalSeniorCandidate level
StackCI/CD, KubernetesPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link