Role overview

Lead Data Engineer (Databricks)

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Details

  • 8+ years of hands-on data engineering experience, designing and delivering production-grade data platforms
  • Expert-level in Apache Spark, including runtime internals, performance tuning, and optimisation, you understand what's happening under the hood and use that knowledge to build pipelines that perform at scale.
  • You write clean, production-quality code in Python, with Scala experience a strong plus for deeper Spark and performance-critical work.
  • You've built and productionalized solutions on the platform, including Delta Lake architectures, Unity Catalog governance, and Databricks Workflows. Databricks certification is a strong plus.
  • You have real, working experience across at least two major cloud platforms (AWS, Azure, GCP) with genuine depth in at least one, including cloud-native services such as AWS Redshift/Glue/S3, Azure Synapse/Data Factory/ADLS, or Google BigQuery/Dataflow/GCS.
  • You've led data engineering projects end-to-end in a client-facing or consulting context, managing technical scope, navigating stakeholder expectations, and delivering against timelines without cutting corners.
  • You bring a DataOps mindset: CI/CD for data pipelines, automated testing, observability, and infrastructure-as-code are standard practice for you, not afterthoughts.
  • Your experience spans ETL/ELT design, data warehousing, lakehouse architecture, and data modelling, and you know when to apply each approach.
  • Your communication skills allow you to engage technical and non-technical stakeholders equally well, from a client's CTO to a junior engineer on your team.
  • Lead Client Data Engagements: Serve as the senior technical lead on client projects. Own the architecture, guide the build, manage delivery risk, and ensure the solution shipped matches what was promised.
  • Build and Productionize Data Solutions: Design and implement scalable, reliable data pipelines and lakehouse architectures on Databricks and cloud platforms. You're hands-on keyboard, you write code, review code, and set the engineering standard for the engagement.
  • Architect for Scale and Reliability: Translate complex client requirements into robust technical designs, reference architectures, and data models built to last in production.
  • Drive Technical Delivery: Manage technical scope and timelines, identify blockers early, and partner with project managers and client stakeholders to keep engagements on track.
  • Mentor Data Engineers: Coach junior and mid-level engineers through hands-on pairing, code review, and direct feedback, raising the floor for everyone around you.
  • Promote Knowledge Sharing: Contribute technical blogs, reference architectures, and internal guides that reflect hard-won lessons from real client work.
  • Champion DataOps Practices: Establish and enforce modern data engineering standards across engagements, automated testing, pipeline observability, version control, CI/CD, and documentation.
Similar roles

Keep a backup shortlist.

Browse stack
FocusData EngineeringRole area
Seniority signalSeniorCandidate level
StackAWS, Azure, CI/CDPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link