Dropbox
Staff Site Reliability Engineer, Production Engineering
Remote Site Reliability Engineering role with clear candidate location fit.
PostedJun 5, 2026
Eligible countries2 accepted countries
Seniority signalSenior
Work settingRemote
Accepted candidate locations
CanadaUSA
Role overview
Staff Site Reliability Engineer, Production Engineering
Requirements and responsibilities
Readable role content extracted into sections for faster review.
Responsibilities
- Define and evolve Dropbox’s company-wide technical reliability strategy to support the changing engineering environment created by AI-assisted and agentic software development.
- Set multi-year reliability goals, standards, and roadmaps across observability, debugging, incident management, service health, and operational readiness.
- Lead cross-team initiatives that reduce reliability risk as software delivery velocity, pull request volume, service complexity, and incident volume increase.
- Partner with engineering leaders and platform teams to improve monitoring, alerting, debugging, SLOs, SLAs, and incident response systems at company scale.
- Identify emerging reliability risks introduced by AI-enabled development workflows and design scalable systems, processes, and guardrails to mitigate them.
- Provide technical leadership and mentorship to engineers across teams, raising engineering quality, reliability judgment, and operational excellence.
- Drive clear communication and alignment with senior stakeholders on reliability priorities, tradeoffs, risks, and execution progress.
Requirements
- BS degree in Computer Science or related technical field involving coding(e.g., physics or mathematics), or equivalent technical experience.
- 12+ years of experience in software engineering, site reliability engineering, infrastructure engineering, or related technical roles.
- Proven ability to define and deliver multi-year, multi-team reliability, infrastructure, or platform strategies with measurable business and customer impact.
- Deep experience with distributed systems, production operations, observability, incident response, SLOs/SLAs, debugging, and reliability risk management.
- Demonstrated ability to diagnose complex technical problems, debug production systems, automate operational workflows, and design resilient software components.
- Experience influencing engineering roadmaps across multiple teams and making technical decisions that optimize for the broader engineering organization.
- Strong communication and collaboration skills, with the ability to align cross-functional stakeholders through ambiguity and drive execution across teams.
Preferred Qualifications
- Experience adapting reliability strategies, developer tooling, or operational processes for AI-assisted software development workflows.
- Experience building or scaling observability, debugging, incident management, or developer productivity platforms for large engineering organizations.
- Experience leading reliability improvements in environments with high deployment velocity, complex service dependencies, and large-scale production systems.
- Track record of mentoring senior engineers, setting technical standards, and spreading reliability best practices through documentation, reviews, talks, or architecture guidance.
- Familiarity with AI-enabled tooling, agentic development workflows, or operational risks introduced by rapid automation in the software development lifecycle.
Similar roles
Keep a backup shortlist.
Azure, Golang 2 accepted countries
Staff Backend Engineer- Session Replay| USA| RemoteGrafana LabsView role Azure, Golang 2 accepted countries
Staff Backend Engineer- Session Replay| Canada| RemoteGrafana LabsView role CI/CD, Kubernetes 2 accepted countries
Staff Backend Engineer- Grafana Enterprise| US| RemoteGrafana LabsView role CI/CD, Kubernetes 2 accepted countries
Staff Backend Engineer- Grafana Enterprise| Canada| RemoteGrafana LabsView role Stack
Use these tags to compare similar remote roles.
Location eligibility
Candidates should apply only when their profile country is listed here.
Hiring flow
WithMira shows the role, then sends candidates to the company application.
1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.