Role overview

Senior Site Reliability Engineer, CCIP

Requirements and responsibilities

Readable role content extracted into sections for faster review.

About the Role

  • Improve deployment safety and increase delivery velocity by advancing production engineering practices.
  • Establish distributed tracing across the platform to improve observability and accelerate incident investigation.
  • Eliminate operational toil through automation that increases engineering efficiency and platform reliability.
  • Drive adoption of meaningful SLOs, SLIs, and error budgets that guide engineering decisions and improve service health.
  • Increase platform scalability and operational readiness as CCIP continues to grow.
  • Strengthen Chainlink's reputation through highly available production systems while reducing operational overhead.

Requirements

  • Demonstrated experience in Site Reliability Engineering, Production Engineering, or a similar role operating large-scale distributed systems.
  • Deep expertise defining, implementing, and driving adoption of SLOs, SLIs, and error budgets across engineering organizations.
  • Built and operated production Kubernetes environments supporting critical services.
  • Applied OpenTelemetry to improve observability across distributed systems.
  • Experience improving the reliability, scalability, and operability of production infrastructure.

Preferred Requirements

  • Demonstrated technical leadership influencing reliability practices across engineering teams.
  • Experience performing capacity planning and performance tuning for high-throughput distributed services.
  • Previous experience working on Web3 infrastructure or within a crypto-native engineering organization.
  • Applied chaos engineering or fault-injection techniques to improve production resilience.
  • Partnered with software engineering teams to conduct production-readiness reviews before service launches.
  • Experience leading on-call operations, including defining rotations, escalation policies, and improving alert quality.
Similar roles

Keep a backup shortlist.

Browse stack
FocusSenior Site Reliability EngineerRole area
Seniority signalSeniorCandidate level
StackKubernetesPrimary skills
Location8 accepted countriesEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link