Jobiglo

No results.

Lead Data Engineer

Harvey Nash · Dublin

New
Hybrid Senior 🇬🇧 English
SQL Python PySpark Spark Databricks Snowflake Delta Lake Photon AWS Apache NiFi Airflow Databricks Workflows Unity Catalog OpenLineage Terraform CloudFormation Git CI/CD

Job description

About the role

We are looking for an experienced Lead Data Engineer to join a high‑impact team supporting financial institutions in Dublin. The role will drive the design, build and operation of a Databricks‑based lakehouse on AWS, enabling data scientists, investigators and product teams to detect criminal behaviour with confidence.

Key responsibilities

  • Own end‑to‑end design, development and optimisation of scalable Spark/PySpark pipelines on Databricks (batch and streaming).
  • Define and enforce Lakehouse/Medallion architecture standards (Bronze/Silver/Gold) including governance, lineage, quality SLAs and cost controls.
  • Architect and maintain secure AWS data infrastructure (S3, IAM, Glue, Lake Formation, KMS, Lambda, Step Functions, EKS/EC2).
  • Lead data ingestion using Apache NiFi, APIs, SFTP/FTPS and onboard diverse internal and external datasets.
  • Implement robust orchestration with Airflow, Databricks Workflows and Step Functions, ensuring observability and reliability.
  • Champion data quality, reliability and observability through SLIs/SLOs, alerting and runbooks.
  • Embed metadata and lineage (Unity Catalog, Glue, OpenLineage) for auditability and regulatory transparency.
  • Drive CI/CD and Infrastructure‑as‑Code practices (Terraform/CloudFormation) for data assets across environments.
  • Mentor engineers on Spark performance, Delta Lake optimisation and cost‑performance trade‑offs.
  • Collaborate with data science, product, security and compliance teams to deliver production‑grade data solutions.

Required profile

  • Expert‑level SQL with hands‑on experience in Databricks, Snowflake, Python and PySpark.
  • Proven production experience building and optimising large‑scale Spark pipelines (Delta Lake, Photon, cluster tuning).
  • Strong expertise in the AWS data ecosystem, including security, networking, encryption and cost optimisation.
  • Hands‑on experience with orchestration tools such as Airflow, Databricks Workflows and Step Functions.
  • Solid background in CI/CD, Git workflows and IaC (Terraform or CloudFormation).
  • Deep understanding of data governance, lineage and compliance (PII/PCI, retention, access controls).
  • Demonstrated ability to lead, mentor and influence technical and non‑technical stakeholders.
  • Pragmatic, delivery‑focused mindset with experience in incident management and on‑call readiness.

Required skills

  • SQL
  • Python
  • PySpark / Spark
  • Databricks
  • Snowflake
  • Delta Lake, Photon
  • AWS (S3, IAM, Glue, Lake Formation, KMS, Lambda, Step Functions, EKS, EC2)
  • Apache NiFi
  • Airflow
  • Databricks Workflows
  • Unity Catalog, OpenLineage
  • Terraform / CloudFormation
  • Git, CI/CD

Questions fréquentes

Le salaire n'est pas communiqué publiquement par le recruteur. Vous pouvez postuler et négocier directement avec Harvey Nash.
Cliquez sur "Postuler maintenant" en haut de la page. Vous pouvez importer votre CV en 1 clic — Jobiglo extrait automatiquement vos informations et postule pour vous.

Why are you reporting this job?

Thank you for your report. We will review this job.

Apply in 30 seconds

Enter your email to apply. An account will be created automatically.

By continuing, you accept our terms of use.

Already have an account? Login

ui.whatsapp_discuss_job

Published 2 weeks ago

Expires 1 month from now

11 views · 0 interested

Boost your chances

Upload your CV — we will match you with relevant openings.

Analyzing your CV...

Harvey Nash

Dublin