Resumo da vaga

Data Engineer

Requisitos e responsabilidades

Conteúdo da vaga extraído em seções para revisão mais rápida.

What You'll Work On

  • Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
  • Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
  • Write performant, well-structured SQL for large-scale transformations in BigQuery
  • Implement data quality checks using Dataform assertions and automated alerting
  • Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
  • Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
  • Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
  • Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
  • Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
  • Support Data Scientists in moving work from notebook to production pipeline
  • Contribute to feature pipelines and data preparation for ML workloads
  • Help bridge the gap between research prototypes and scalable, maintainable code

Analytics Engineering & Reporting

  • Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
  • Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
  • Write performant, well-structured SQL for large-scale transformations in BigQuery
  • Implement data quality checks using Dataform assertions and automated alerting
  • Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
  • Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
  • Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
  • Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
  • Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
  • Support Data Scientists in moving work from notebook to production pipeline
  • Contribute to feature pipelines and data preparation for ML workloads
  • Help bridge the gap between research prototypes and scalable, maintainable code

Data Pipelines & Ingestion

  • Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
  • Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
  • Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
  • Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
  • Support Data Scientists in moving work from notebook to production pipeline
  • Contribute to feature pipelines and data preparation for ML workloads
  • Help bridge the gap between research prototypes and scalable, maintainable code

Data Science Collaboration

  • Support Data Scientists in moving work from notebook to production pipeline
  • Contribute to feature pipelines and data preparation for ML workloads
  • Help bridge the gap between research prototypes and scalable, maintainable code

What We're Looking For

  • SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
  • Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
  • Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
  • GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
  • Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
  • Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
  • Docker basics — able to containerise and deploy data workloads
  • Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
  • Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality

Details

  • SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
  • Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
  • Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
  • GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
  • Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
  • Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
  • Docker basics — able to containerise and deploy data workloads
  • Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
  • Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality
  • Looker / LookML experience
  • Familiarity with CDC concepts and tools (Datastream, Debezium)
  • Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
  • AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
  • Curiosity about sports performance data

Nice to haves

  • Looker / LookML experience
  • Familiarity with CDC concepts and tools (Datastream, Debezium)
  • Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
  • AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
  • Curiosity about sports performance data
Vagas similares

Mantenha uma lista reserva.

Ver stack
FocoData EngineeringÁrea da vaga
Sinal de senioridadeSeniorNível do candidato
StackAWS, CI/CD, DockerSkills principais
Localização2 países aceitosElegibilidade

Stack

Use estas tags para comparar vagas remotas similares.

Elegibilidade de localização

Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.

Seu perfilPaís não definidoEntre para comparar seu país com esta vaga.

Fluxo de contratação

O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.

1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.
Aplicar no site da empresaSite da empresaAbrir link