Aroundhome

Senior Data Engineer (Modern Data Platform & AI) (all genders)| Berlin, hybrid

Vaga remota de Data Engineering com fit claro de localização do candidato.

Publicada7 de jun. de 2026

Países elegíveis37 países aceitos

Sinal de senioridadeSenior

Modelo de trabalhoRemoto

Locais aceitos para candidatos

AlbâniaÁustriaBélgicaBulgáriaCroáciaChipre+31 mais

AWS CI/CD Python Snowflake Spark SQL

Posso mesmo aplicar?Confira a lista de países

Países aceitos para candidatos estão listados (37).

Atualidade da fonte7 de jun. de 2026

Fit de localização37 países aceitos

Match de stackAWS, CI/CD

Caminho de aplicaçãoSite da empresa

Resumo de fit da MiraPor que vale revisar esta vaga

Fit de localização37 países aceitosAdicione seu país

Match de stackAdicione skills ao perfil para compararAWS, CI/CD

Sinal de senioridadeSeniorDefina seu nível para uma análise mais precisa.

Prontidão para aplicarSite da empresaA aplicação continua no site da empresa.

Aplicação

Aplicar no site da empresa

Aplicação externa

Aplicando paraSenior Data Engineer (Modern Data Platform & AI) (all genders)| Berlin, hybridAroundhome

Fit de país37 países aceitos

Caminho de aplicaçãoSite da empresa

WithMiraSalve ou assine antes de sair

Aplicação da empresa

O WithMira mantém esta vaga para descoberta. A aplicação continua no site da empresa.

Aplicar no site da empresa

Salvar vaga

Resumo da vaga

Senior Data Engineer (Modern Data Platform & AI) (all genders)| Berlin, hybrid

Requisitos e responsabilidades

Conteúdo da vaga extraído em seções para revisão mais rápida.

Data Modeling & Transformation:Design necessary data models and transformations to curate raw data. Develop, optimize and maintain existing data models, pipelines, and transformations to support analytics, reporting, and AI use cases such as but not limited to curating, transforming, annotating and modeling data.
Design necessary data models and transformations to curate raw data.
Develop, optimize and maintain existing data models, pipelines, and transformations to support analytics, reporting, and AI use cases such as but not limited to curating, transforming, annotating and modeling data.
Data Platform Architecture:Architect and contribute in implementing a scalable, modern data platform, including data lakehouse or warehouse, to support real-time/near-real-time data flows from Kafka to downstream consumers. Optimize ETL/ELT pipelines using tools like DBT, Spark, or Airflow, bridging upstream (e.g. Debezium, MSK) and downstream processes. Evaluate and integrate new technologies to support hybrid monolith-microservices architecture and ML and AI enablement. Ensure seamless migrations and minimal disruptions during platform evolution.
Architect and contribute in implementing a scalable, modern data platform, including data lakehouse or warehouse, to support real-time/near-real-time data flows from Kafka to downstream consumers.
Optimize ETL/ELT pipelines using tools like DBT, Spark, or Airflow, bridging upstream (e.g. Debezium, MSK) and downstream processes.
Evaluate and integrate new technologies to support hybrid monolith-microservices architecture and ML and AI enablement.
Ensure seamless migrations and minimal disruptions during platform evolution.
Real-Time Data Integration: Build and optimize real-time data pipelines using Kafka, Spark, and Delta Live Tables.
Data Governance & Quality: Support the team lead in establishing and enforcing data governance frameworks, including data lineage, quality standards, catalogue, metadata management, SSOT for business glossaries/CBC terms, and policies to ensure reliable reporting. Ensure the existence of, or adaptation to, full Data Life Cycle Management (DLCM) and end-to-end testing.
Support the team lead in establishing and enforcing data governance frameworks, including data lineage, quality standards, catalogue, metadata management, SSOT for business glossaries/CBC terms, and policies to ensure reliable reporting.
Ensure the existence of, or adaptation to, full Data Life Cycle Management (DLCM) and end-to-end testing.
AI/ML Enablement: Collaborate with the team to integrate AI/ML capabilities, such as feature engineering and model serving, to accelerate data products for market penetration and operational efficiency, as well as operationalizing ML models and integrate AI into business processes.
Knowledge Sharing: Mentor the team on best practices, modern tools (e.g., Databricks, Snowflake, AI adaptation and integrations like Cursor/CodeRabbit), and cloud-native scalability. And last but not least foster a culture of innovation and continuous improvement.
Stakeholder Collaboration: Collaborate with Product Analytics, domain teams, and business to deliver data solutions that drive value and are aligned with business needs.

Details

Design necessary data models and transformations to curate raw data.
Develop, optimize and maintain existing data models, pipelines, and transformations to support analytics, reporting, and AI use cases such as but not limited to curating, transforming, annotating and modeling data.
Architect and contribute in implementing a scalable, modern data platform, including data lakehouse or warehouse, to support real-time/near-real-time data flows from Kafka to downstream consumers.
Optimize ETL/ELT pipelines using tools like DBT, Spark, or Airflow, bridging upstream (e.g. Debezium, MSK) and downstream processes.
Evaluate and integrate new technologies to support hybrid monolith-microservices architecture and ML and AI enablement.
Ensure seamless migrations and minimal disruptions during platform evolution.
Support the team lead in establishing and enforcing data governance frameworks, including data lineage, quality standards, catalogue, metadata management, SSOT for business glossaries/CBC terms, and policies to ensure reliable reporting.
Ensure the existence of, or adaptation to, full Data Life Cycle Management (DLCM) and end-to-end testing.

Skills

Master's degree in Computer Science, Data Engineering, or related field (or equivalent experience)
10+ years of experience in data engineering, with 5+ years in senior roles focused on modern architectures.
Excellent communication and collaboration skills, the ability to drive change and influence stakeholders, and a passion for mentoring, coaching, and sharing knowledge
Proven expertise in designing, developing & maintaining data lakehouses/DWH (e.g., Databricks Delta Lake, Snowflake) and transformations (e.g., DBT, SQL/Python/Spark).
Strong experience with cloud platforms such as AWS services (S3, Athena, MSK/Kafka, Terraform) and real-time streaming (e.g., Kafka, Spark Structured Streaming, Flink).
Hands-on knowledge of data governance tools (e.g., Unity Catalog, Collibra) for lineage, quality, catalogs, and SSOT.
Familiarity in AI/ML pipelines and MLOps (e.g., MLflow, feature stores) and complex system integration within modern data technologies.
Proficiency in CI/CD for data, and tools like Git, Airflow, or dbt Cloud.
Experience with large-scale data modeling (DataVault, dimensional, schema-on-read) and optimizing for self-service analytics.

Vagas similares