Role overview

Principal Systems Engineer

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Key Responsibilities

  • Maintain and improve the reliability, availability, and performance of our production environments.
  • Lead the investigation and resolution of complex application, database, and infrastructure issues.
  • Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences.
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
  • Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
  • Partner with engineering and cloud teams to refine deployment, monitoring, and support processes.
  • Provide technical leadership during major incidents and act as a key escalation point for critical issues.

Experience:

  • 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments.
  • Monitoring & Observability: Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic.
  • Microsoft Stack: Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon.
  • Database Skills: Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups.
  • Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
  • ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.

Preferred Skills:

  • Scripting with PowerShell, Python, or similar languages.
  • Infrastructure as Code (Terraform, ARM Templates, Bicep).
  • CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions).
  • Experience with Kubernetes and containerized workloads.
  • Experience implementing SLOs, SLIs, and Error Budgets.
  • Experience in a healthcare technology or patient care environment.

Education:

  • Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered.

Working Arrangements

  • This is a remote position open to candidates within the United States.
  • You will participate in an on-call rotation to support our 24x7 healthcare environment.
  • Occasional after-hours work is required for activations, upgrades, and major incidents.

Travel

  • Travel is not a requirement for this role.
Similar roles

Keep a backup shortlist.

Browse stack
FocusSite Reliability EngineeringRole area
Seniority signalSeniorCandidate level
StackAzure, CI/CD, KubernetesPrimary skills
Location2 accepted countriesEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link