EverCommerce
EverHealth- Lead DevOps Engineer
Rol remoto de Lead DevOps Engineer con fit claro de ubicación del candidato.
Publicado19 jun 2026
Países elegibles1 país aceptado
Señal de senioritySenior
Modelo de trabajoRemoto
Ubicaciones aceptadas para candidatos
Estados Unidos
Resumen del rol
EverHealth- Lead DevOps Engineer
Requisitos y responsabilidades
Contenido del rol extraído en secciones para revisar más rápido.
Key Responsibilities
- Cloud Infrastructure & AutomationDesign, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.Implement best practices for observability, security, and reliability across cloud environments.
- Design, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .
- Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.
- Implement best practices for observability, security, and reliability across cloud environments.
- Continuous Integration & Deployment (CI/CD)Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.Standardize CI/CD pipelines and release automation processes across teams.Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
- Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.
- Standardize CI/CD pipelines and release automation processes across teams.
- Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
- Configuration Management & OrchestrationAutomate server and container configurations using Ansible.Develop repeatable, scalable, and version-controlled infrastructure patterns.Support developers with automated service provisioning and self-service tools.
- Automate server and container configurations using Ansible.
- Develop repeatable, scalable, and version-controlled infrastructure patterns.
- Support developers with automated service provisioning and self-service tools.
- Security, Compliance, & GovernanceEmbed security and compliance controls into infrastructure and workflows.Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
- Embed security and compliance controls into infrastructure and workflows.
- Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.
- Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
- Monitoring, Logging & Incident ResponseImplement observability solutions, including logging, monitoring, and alerting for platform services.Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.Support production and non-production environments through proactive incident resolution and root cause analysis.
- Implement observability solutions, including logging, monitoring, and alerting for platform services.
- Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.
- Support production and non-production environments through proactive incident resolution and root cause analysis.
Cloud Infrastructure & Automation
- Design, deploy, and manage AWS ECS-based containerized workloads using Terraform and Spacelift .
- Build and optimize self-service infrastructure platforms with Backstage , enabling development teams to deploy services autonomously.
- Implement best practices for observability, security, and reliability across cloud environments.
Continuous Integration & Deployment (CI/CD)
- Develop and manage GitHub Actions workflows for automated testing, security scanning, and deployments.
- Standardize CI/CD pipelines and release automation processes across teams.
- Improve deployment strategies to ensure zero-downtime deployments and infrastructure immutability.
Configuration Management & Orchestration
- Automate server and container configurations using Ansible.
- Develop repeatable, scalable, and version-controlled infrastructure patterns.
- Support developers with automated service provisioning and self-service tools.
Security, Compliance, & Governance
- Embed security and compliance controls into infrastructure and workflows.
- Implement role-based access control (RBAC), policy enforcement, and infrastructure security best practices.
- Ensure auditability and traceability in infrastructure changes using GitOps methodologies.
Monitoring, Logging & Incident Response
- Implement observability solutions, including logging, monitoring, and alerting for platform services.
- Define SLAs, SLOs, and on-call runbooks to ensure high availability and reliability.
- Support production and non-production environments through proactive incident resolution and root cause analysis.
Required Skills & Experience
- Proven experience in designing, migrating, and managing AWS ECS-based containerized environments.
- Deep expertise in Terraform for IaC, with experience in Spacelift.io or similar policy-as-code automation tools.
- Hands-on experience with GitHub Actions for CI/CD automation.
- Strong knowledge of Backstage.io for developer portal and self-service infrastructure.
- Experience with Ansible for configuration management and automation.
- Self-service and everything-as-code mindset – experience designing repeatable, fully automated infrastructure patterns.
- Strong understanding of networking, IAM policies, secrets management, and cloud security best practices.
- Experience with monitoring and logging solutions (e.g., CloudWatch, NewRelic).
- Ability to troubleshoot performance, availability, and scaling issues in containerized and cloud-native environments.
Nice to haves
- Experience with service mesh technologies (e.g., Istio, Linkerd, or AWS App Mesh).
- Familiarity with FinOps and cost optimization in AWS environments.
- Knowledge of SRE principles, SLAs, and error budgets.
- Experience with policy-as-code tools like Open Policy Agent (OPA) or HashiCorp Sentinel.
Benefits and Perks:
- Flexibility to work where/how you want within your country of employment – remote
- Robust health and wellness benefits, including an annual wellness stipend
- 401k with up to a 4% match and immediate vesting
- Flexible and generous (FTO) time-off
- Employee Stock Purchase Program
Roles similares
Mantén una lista de respaldo.
Stack
Usa estas tags para comparar roles remotos similares.
Elegibilidad de ubicación
Candidatos deberían aplicar solo cuando el país del perfil aparece aquí.
Tu perfilPaís no definidoInicia sesión para comparar tu país con este rol.
Flujo de contratación
WithMira muestra el rol y luego envía candidatos a la aplicación de la empresa.
1Revisa fit del rol, stack y elegibilidad de ubicación en WithMira.
2Abre la página de aplicación de la empresa desde el link rastreado.
3Guarda el rol o suscríbete a oportunidades similares antes de salir.