Kitman Labs

Data Engineer

Rol remoto de Data Engineering con fit claro de ubicación del candidato.

Publicado2 jul 2026

Países elegibles2 países aceptados

Señal de senioritySenior

Modelo de trabajoRemoto

Ubicaciones aceptadas para candidatos

IrlandaReino Unido

AWS CI/CD Docker GCP Python SQL

Puedo aplicar realmente?Revisa la lista de países

Las ubicaciones aceptadas para candidatos están listadas (2).

Actualidad de la fuente2 jul 2026

Fit de ubicación2 países aceptados

Match de stackAWS, CI/CD

Camino de aplicaciónSitio de la empresa

Resumen de fit de MiraPor qué vale revisar este rol

Fit de ubicación2 países aceptadosAgrega tu país

Match de stackAgrega skills al perfil para compararAWS, CI/CD

Señal de senioritySeniorDefine tu nivel para una revisión más precisa.

Preparación para aplicarSitio de la empresaLa aplicación continúa en el sitio de la empresa.

Aplicación

Aplicar en el sitio de la empresa

Aplicación externa

Aplicando aData EngineerKitman Labs

Fit de país2 países aceptados

Camino de aplicaciónSitio de la empresa

WithMiraGuarda o suscríbete antes de salir

Aplicación de la empresa

WithMira mantiene este rol para descubrimiento. La aplicación continúa en el sitio de la empresa.

Aplicar en el sitio de la empresa

Guardar rol

Resumen del rol

Data Engineer

Requisitos y responsabilidades

Contenido del rol extraído en secciones para revisar más rápido.

What You'll Work On

Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
Write performant, well-structured SQL for large-scale transformations in BigQuery
Implement data quality checks using Dataform assertions and automated alerting
Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
Support Data Scientists in moving work from notebook to production pipeline
Contribute to feature pipelines and data preparation for ML workloads
Help bridge the gap between research prototypes and scalable, maintainable code

Analytics Engineering & Reporting

Build and maintain BigQuery data models using Dataform, following medallion architecture patterns (Bronze/Silver/Gold)
Contribute to Looker dashboards and LookML models, working alongside senior engineers and analysts
Write performant, well-structured SQL for large-scale transformations in BigQuery
Implement data quality checks using Dataform assertions and automated alerting
Support data observability across the warehouse — monitoring pipeline health, data freshness, and anomaly detection
Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
Support Data Scientists in moving work from notebook to production pipeline
Contribute to feature pipelines and data preparation for ML workloads
Help bridge the gap between research prototypes and scalable, maintainable code

Data Pipelines & Ingestion

Build and maintain robust Python data pipelines with testing, linting, and CI/CD integration
Work with orchestration tooling (Cloud Composer / Airflow) to schedule and monitor workflows
Develop familiarity with CDC concepts and event-driven ingestion patterns (Datastream, Pub/Sub)
Containerise workloads with Docker for deployment on Cloud Run or similar GCP services
Support Data Scientists in moving work from notebook to production pipeline
Contribute to feature pipelines and data preparation for ML workloads
Help bridge the gap between research prototypes and scalable, maintainable code

Data Science Collaboration

Support Data Scientists in moving work from notebook to production pipeline
Contribute to feature pipelines and data preparation for ML workloads
Help bridge the gap between research prototypes and scalable, maintainable code

What We're Looking For

SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
Docker basics — able to containerise and deploy data workloads
Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality

Details

SQL proficiency — comfortable writing complex, performant queries against large datasets in BigQuery
Dataform experience — or strong dbt experience with willingness to work in Dataform; understanding of modular, version-controlled data transformation
Python with an engineering mindset — clean, tested, linted code; comfortable with Git and CI/CD workflows
GCP familiarity — hands-on experience with BigQuery is essential; broader GCP exposure (Cloud Storage, Cloud Run, Pub/Sub, Datastream) is a strong advantage
Orchestration experience — hands-on with Cloud Composer, Airflow, or a comparable tool
Data modelling fundamentals — dimensional modelling, Kimball principles, or medallion architecture patterns
Docker basics — able to containerise and deploy data workloads
Collaborative and communicative — able to translate business requirements into data models and work effectively with Analytics, Product, and Data Science stakeholders
Pragmatic approach to AI tooling — comfortable using AI-assisted development to improve productivity and code quality
Looker / LookML experience
Familiarity with CDC concepts and tools (Datastream, Debezium)
Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
Curiosity about sports performance data

Nice to haves

Looker / LookML experience
Familiarity with CDC concepts and tools (Datastream, Debezium)
Exposure to ML frameworks or MLOps tooling (scikit-learn, MLflow, Vertex AI)
AWS experience as a complement (Redshift, Glue, RDS) — we value engineers who can draw on cross-cloud perspective
Curiosity about sports performance data

Roles similares