Resumo da vaga

Senior Data Engineer

Requisitos e responsabilidades

Conteúdo da vaga extraído em seções para revisão mais rápida.

Requirements

  • Core Technical Competencies Experience: 5+ years of hands-on data engineering experience in a production environment. Languages: Strong proficiency in Python, SQL (complex queries, performance tuning), and PySpark/Apache Spark. Data Modeling: Expert knowledge of data modeling (3NF, Star, Snowflake Schema) and Lakehouse/Warehouse architectures. ETL/ELT & Orchestration: Proven experience building pipelines using tools like dbt, Airflow, Dagster, or native cloud orchestrators (Glue, Data Factory, Composer). Integrations: Experienced in integrating data from diverse sources: APIs, RDBMS/NoSQL databases, flat files, and streaming platforms (Kafka, Kinesis, Pub/Sub). Cloud Platform Expertise (Specialization-Specific) Candidates should demonstrate deep expertise in anyone of the following: Snowflake: SnowSQL, Streams, Tasks, Snowpark, and cost optimization. Databricks: Delta Lake, Unity Catalog, Delta Live Tables (DLT), and Spark optimization. GCP: BigQuery, Dataflow, Dataproc, Pub/Sub, and Cloud Functions. Azure: Synapse Analytics, Data Factory, Azure Databricks, and Stream Analytics. AWS: Redshift, S3, Lake Formation, Glue, and Lambda. Professional Practices SDLC & DevOps: Proficient in Git workflows, CI/CD pipelines (GitHub Actions, Azure DevOps, AWS CodePipeline), and IaC (Terraform/CloudFormation). Data Governance: Strong understanding of data quality, lineage, observability, security (RBAC, encryption), and compliance frameworks. Agile: Active experience in Agile/Scrum environments using Jira or Azure Boards. Mentorship: Ability to lead projects and provide technical guidance to junior/mid-level engineers. Responsibilities Architecture: Architect, design, and implement scalable, reliable data solutions and pipelines aligned with business analytics needs. Optimization: Manage and fine-tune cloud resources and workloads for maximum performance, reliability, and cost-efficiency. Data Transformation: Lead the development of ETL/ELT processes for both batch and real-time data processing. Collaboration: Partner with Product, Engineering, and Data Science teams to deliver effective, data-driven solutions. Governance & Quality: Promote and enforce best practices in data governance, security, and data quality frameworks. Mentorship: Provide technical leadership and mentorship to the team, ensuring architecture quality and best practices. Documentation: Maintain comprehensive documentation of data architectures, configurations, and workflows. Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws. Originally posted on Himalayas
Vagas similares

Mantenha uma lista reserva.

Ver stack
FocoData EngineeringÁrea da vaga
Sinal de senioridadeSeniorNível do candidato
StackAWS, Azure, CI/CDSkills principais
Localização26 países aceitosElegibilidade

Stack

Use estas tags para comparar vagas remotas similares.

Elegibilidade de localização

Candidatos devem aplicar apenas quando o país do perfil estiver listado aqui.

Fluxo de contratação

O WithMira mostra a vaga e depois envia candidatos para a aplicação da empresa.

1Confira fit da vaga, stack e elegibilidade de localização no WithMira.
2Abra a página de aplicação da empresa pelo link rastreado.
3Salve a vaga ou assine oportunidades similares antes de sair.
Aplicar no site da empresaSite da empresaAbrir link