Role overview

SRE Lead

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Requirements

  • 5+ years of experience in DevOps/SRE roles;
  • Strong experience with AWS cloud services;
  • Advanced knowledge of Kubernetes and container orchestration;
  • Solid understanding of DevOps principles and practices;
  • Experience with Helm chart development and maintenance;
  • Strong proficiency in monitoring, logging, alerting, cloud, platform, OS, CI/CD, repo storage, and management tools;
  • Strong scripting skills (Bash, Python, or similar);
  • Excellent problem-solving and communication skills.Responsibilities
  • Manage SRE teams;

- Technical excellence of teammates;

  • Implement and maintain monitoring solutions using Prometheus, Victoria-Metrics, and Grafana to identify and address performance issues proactively;
  • Manage logging infrastructure using Fluent, Fluent-bit, ElasticSearch, and Kibana, ensuring efficient log collection, analysis, and visualization;
  • Configure and manage alerting systems like AlertManager and Opsgenie to respond to critical incidents and minimize downtime promptly;
  • Control utilization of AWS Cloud services and design, deploy and manage scalable and highly available infrastructure;
  • Expertise in AWS services such as EC2, VPC, CloudWatch, and IAM to ensure optimal performance and security of our cloud-based applications;
  • Deploy and manage containerized applications using Kubernetes, Docker, and Helm, ensuring smooth orchestration and scalability;
  • Proficient in the Debian operating system, with the ability to troubleshoot and optimize system performance;
  • Implement and manage CI/CD pipelines using Jenkins and ArgoCD for seamless software delivery and infrastructure automation;
  • Manage code repositories using GitLab and Git, ensuring version control and collaboration among team members;
  • Collaborate with cross-functional teams using Jira and Confluence for effective project management and knowledge sharing.Nice to Have
  • Experience with other cloud providers (GCP, Azure);
  • Security certifications (AWS, CKS, etc.);
  • Experience with service mesh technologies.
Similar roles

Keep a backup shortlist.

Browse stack
FocusSite Reliability EngineeringRole area
Seniority signalSeniorCandidate level
StackAWS, Azure, CI/CDPrimary skills
Location37 accepted countriesEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link