EverCommerce
EverHealth- Lead DevOps Engineer
Vaga remota de Lead DevOps Engineer com fit claro de localização do candidato.
Publicada19 de jun. de 2026
Países elegíveis1 país aceito
Sinal de senioridadeSenior
Modelo de trabalhoRemoto
Locais aceitos para candidatos
Estados Unidos
Resumo da vaga
EverHealth- Lead DevOps Engineer
Requisitos e responsabilidades
Conteúdo da vaga extraído em seções para revisão mais rápida.
Key Responsibilities
- Cloud Infrastructure & AutomationDesign, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.Implement best practices for observability, security, and reliability across cloud environments.
- Design, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .
- Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.
- Implement best practices for observability, security, and reliability across cloud environments.
- Continuous Integration & Deployment (CI/CD)Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.Standardize CI/CD pipelines and release automation processes across teams.Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
- Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.
- Standardize CI/CD pipelines and release automation processes across teams.
- Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
- Configuration Management & OrchestrationAutomate server and container configurations using Ansible.Develop repeatable, scalable, and version-controlled infrastructure patterns.Support developers with automated service provisioning and self-service tools.
- Automate server and container configurations using Ansible.
- Develop repeatable, scalable, and version-controlled infrastructure patterns.
- Support developers with automated service provisioning and self-service tools.
- Security, Compliance, & GovernanceEmbed security and compliance controls into infrastructure and workflows.Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
- Embed security and compliance controls into infrastructure and workflows.
- Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.
- Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
- Monitoring, Logging & Incident ResponseImplement observability solutions, including logging, monitoring, and alerting for platform services.Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.Support production and non-production environments through proactive incident resolution and root cause analysis.
- Implement observability solutions, including logging, monitoring, and alerting for platform services.
- Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.
- Support production and non-production environments through proactive incident resolution and root cause analysis.
Cloud Infrastructure & Automation
- Design, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .
- Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.
- Implement best practices for observability, security, and reliability across cloud environments.
Continuous Integration & Deployment (CI/CD)
- Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.
- Standardize CI/CD pipelines and release automation processes across teams.
- Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
Configuration Management & Orchestration
- Automate server and container configurations using Ansible.
- Develop repeatable, scalable, and version-controlled infrastructure patterns.
- Support developers with automated service provisioning and self-service tools.
Security, Compliance, & Governance
- Embed security and compliance controls into infrastructure and workflows.
- Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.
- Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
Monitoring, Logging & Incident Response
- Implement observability solutions, including logging, monitoring, and alerting for platform services.
- Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.
- Support production and non-production environments through proactive incident resolution and root cause analysis.
Required Skills & Experience
- Proven experience in designing, migrating, and managing AWS ECS-based containerized environments.
- Deep expertise in Terraform for IaC, with experience in Spacelift.io or similar policy-as-code automation tools.
- Hands-on experience with GitHub Actions for CI/CD automation.
- Strong knowledge of Backstage.io for developer portal and self-service infrastructure.
- Experience with Ansible for configuration management and automation.
- Self-service and everything-as-code mindset – experience designing repeatable, fully automated infrastructure patterns.
- Strong understanding of networking, IAM policies, secrets management, and cloud security best practices.
- Experience with monitoring and logging solutions (e.g., CloudWatch, NewRelic).
- Ability to troubleshoot performance, availability, and scaling issues in containerized and cloud-native environments.
Nice to haves
- Experience with service mesh technologies (e.g., Istio, Linkerd, or AWS App Mesh).
- Familiarity with FinOps and cost optimization in AWS environments.
- Knowledge of SRE principles, SLAs, and error budgets.
- Experience with policy-as-code tools like Open Policy Agent (OPA) or HashiCorp Sentinel.
Benefits and Perks:
- Flexibility to work where/how you want within your country of employment – remote
- Robust health and wellness benefits, including an annual wellness stipend
- 401k with up to a 4% match and immediate vesting
- Flexible and generous (FTO) time-off
- Employee Stock Purchase Program
Vagas similares
Mantenha uma lista reserva.
Stack
Use estas tags para comparar vagas remotas similares.
Elegibilidade de localização
Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.
Seu perfilPaís não definidoEntre para comparar seu país com esta vaga.
Fluxo de contratação
O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.
1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.