Role overview

Lead Data Engineer

Requirements and responsibilities

Readable role content extracted into sections for faster review.

Details

  • Lead the technical design and implementation of CMG's Medallion 2.0 lakehouse architecture — bronze ingestion, silver transformation, and gold domain layers — built on GCS and Databricks (Delta Lake), with clear data contracts at each boundary
  • Design and manage data pipelines using Astro (Airflow), PySpark, and Delta Live Tables, ensuring reliability and scalability across ingestion and transformation layers
  • Govern the lakehouse using Databricks Unity Catalog — managing access controls, data lineage, and schema enforcement across domains
  • Apply domain-driven design principles to partition and model data domains (e.g., royalty, asset, artist, distribution)
  • Collaborate with the analytics team to ensure the gold layer reflects real business needs — reducing workarounds
  • Coordinate with external vendors (e.g., DataArt) and internal stakeholders across DevOps, product, and analytics
  • Proactively identify architectural risks, data quality issues, and dependency blockers with proposed resolutions
  • Maintain clear, impact-first documentation and status updates for both technical and non-technical stakeholders
  • Other duties as assigned
  • 4+ years of data engineering experience, with at least 1–2 years focused on data platform or lakehouse architecture
  • Hands-on experience with Databricks — including Delta Lake, PySpark, and ideally Unity Catalog
  • Experience with GCS or equivalent cloud object storage as a lakehouse foundation layer
  • Hands-on experience with domain-driven design applied to data modeling
  • Strong command of SQL and at least one transformation framework (dbt preferred)
  • Experience with medallion or lakehouse architectures (bronze/silver/gold or equivalent)
  • Familiarity with GCP-native tooling — Pub/Sub, Dataflow, or Dataplex a plus
  • Excellent written communication — able to write design docs non-engineers can understand and status updates executives can act on
  • Demonstrated ability to work independently in ambiguous environments
  • Track record of flagging risks early with proposed solutions
  • $120,000 - $150,000 CAD per year
  • The final compensation within this range will be determined based on the candidate’s experience, skills, and overall fit for the role.
Similar roles

Keep a backup shortlist.

Browse stack
FocusLead Data EngineerRole area
Seniority signalSeniorCandidate level
StackGCP, SQLPrimary skills
Location1 accepted countryEligibility

Stack

Use these tags to compare similar remote roles.

Location eligibility

Candidates should apply only when their profile country is listed here.

Your profileCountry not setSign in to check your country against this role.

Hiring flow

WithMira shows the role, then sends candidates to the company application.

1Check role fit, stack, and location eligibility in WithMira.
2Open the company application page from the tracked apply link.
3Save the role or subscribe for similar opportunities before leaving.
Apply on company siteCompany siteOpen link