Kitman Labs
Data Engineer
Vaga remota de Data Engineering com fit claro de localização do candidato.
Publicada2 de jul. de 2026
Países elegíveis2 países aceitos
Sinal de senioridadeSenior
Modelo de trabalhoRemoto
Locais aceitos para candidatos
IrlandaReino Unido
Resumo da vaga
Data Engineer
Requisitos e responsabilidades
Conteúdo da vaga extraído em seções para revisão mais rápida.
What You'll Work On
- Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
- Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
- Write performant, well-structured SQL for large-scale transformations in BigQuery
- Implement data quality checks using Dataform assertions and automated alerting
- Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
- Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
- Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
- Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
- Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
- Support Data Scientists in moving work from notebook to production pipeline
- Contribute to feature pipelines and data preparation for ML workloads
- Help bridge the gap between research prototypes and scalable, maintainable code
Analytics Engineering & Reporting
- Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
- Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
- Write performant, well-structured SQL for large-scale transformations in BigQuery
- Implement data quality checks using Dataform assertions and automated alerting
- Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
- Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
- Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
- Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
- Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
- Support Data Scientists in moving work from notebook to production pipeline
- Contribute to feature pipelines and data preparation for ML workloads
- Help bridge the gap between research prototypes and scalable, maintainable code
Data Pipelines & Ingestion
- Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
- Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
- Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
- Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
- Support Data Scientists in moving work from notebook to production pipeline
- Contribute to feature pipelines and data preparation for ML workloads
- Help bridge the gap between research prototypes and scalable, maintainable code
Data Science Collaboration
- Support Data Scientists in moving work from notebook to production pipeline
- Contribute to feature pipelines and data preparation for ML workloads
- Help bridge the gap between research prototypes and scalable, maintainable code
What We're Looking For
- SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
- Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
- Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
- GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
- Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
- Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
- Docker basics — able to containerise and deploy data workloads
- Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
- Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality
Details
- SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
- Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
- Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
- GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
- Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
- Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
- Docker basics — able to containerise and deploy data workloads
- Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
- Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality
- Looker / LookML experience
- Familiarity with CDC concepts and tools (Datastream, Debezium)
- Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
- AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
- Curiosity about sports performance data
Nice to haves
- Looker / LookML experience
- Familiarity with CDC concepts and tools (Datastream, Debezium)
- Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
- AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
- Curiosity about sports performance data
Vagas similares
Mantenha uma lista reserva.
AWS, CI/CD 13 países aceitos
Senior QA Automation EngineerSubway EcommerceVer vaga AWS, Python 13 países aceitos
Senior Backend Engineer (AdTech)Leap ToolsVer vaga AWS, Python 13 países aceitos
Senior Backend EngineerLeap ToolsVer vaga CI/CD, Python 8 países aceitos
Application Security EngineerMorgan StanleyVer vaga Stack
Use estas tags para comparar vagas remotas similares.
Elegibilidade de localização
Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.
Seu perfilPaís não definidoEntre para comparar seu país com esta vaga.
Fluxo de contratação
O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.
1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.