Role overview

Senior Site Reliability Engineer

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Core Responsibilities

  • Support production systems on platforms such as ESXi, Azure, AWS, and GCP
  • Utilize configuration management tools for scalable and repeatable systems management including Ansible and Puppet
  • Design, develop, and maintain automation frameworks, scripts, and operational tooling to improve scalability, reliability, and operational efficiency across infrastructure and platform services.
  • Configure, maintain, patch, and troubleshoot Linux operating systems with basic knowledge of Windows operating systems
  • Ensure compliance with security and data handling policies to meet PCI, HIPAA, and other standards
  • Develop and maintain Infrastructure-as-Code (IaC) solutions using tools such as Terraform, Ansible, and Puppet to support repeatable and standardized deployments.
  • Collaborate with peers as an accountable and supportive member of Amwell technology teams
  • Participate in 24/7 call rotation and scheduled maintenance tasks

Qualifications

  • 5 or more years of experience managing Linux based systems, certifications a plus
  • Strong experience with Infrastructure-as-Code and configuration management technologies including Terraform, Ansible, Puppet, or similar automation frameworks.
  • Build automation workflows for system provisioning, patch management, monitoring, configuration management, incident response, and operational remediation.
  • Experience with on-prem and cloud based virtualization platform compute and storage, such as ESXi, Azure, AWS and GCP
  • Experience with Elasticsearch/Logstash/Kibana analytics engine (ELK Stack)
  • Experience managing Identity and Authentication solutions including LDAP, Active Directory, and Multi-Factor Authentication
  • Strong scripting and software development skills using languages such as Python, Bash, or PowerShell, with experience building reusable automation tooling and operational integrations in hybrid cloud and on-premise environments.
  • Experience developing monitoring, alerting, and self-healing automation solutions.
  • Solid foundation of TCP/IP networking concepts
  • Experience supporting large-scale production environments with an automation-first operational mindset.

Benefits

  • Flexible Personal Time Off (Vacation time)
  • 401K match
  • Competitive healthcare, dental and vision insurance plans
  • Paid Parental Leave (Maternity and Paternity leave)
  • Employee Stock Purchase Program
  • Free access to Amwell’s Telehealth Services, SilverCloud and The Clinic by Cleveland Clinic’s second opinion program
  • Free Subscription to the Calm App
  • Tuition Assistance Program
  • Pet Insurance
Similar roles

Keep a backup shortlist.

Browse stack
FocusSite Reliability EngineeringRole area
Seniority signalSeniorCandidate level
StackAWS, Azure, GCPPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link