Jobiglo

No results.

Senior Site Reliability Engineer – Global API & Data Infrastructure

Jobgether · Irlande

New
Senior 🇬🇧 English
Terraform Ansible Python Go GitOps Cloud infrastructure Observability Incident response Chaos engineering FinOps

Job description

About the role

We are looking for a Senior Site Reliability Engineer to own the reliability, scalability and performance of the large‑scale API and data infrastructure that powers worldwide reuse of Wikimedia content. You will work in a fully distributed, globally collaborative environment alongside experienced SREs, software engineers and platform teams.

Key responsibilities

  • Define, track and improve SLOs, SLIs and error budgets for critical services.
  • Design and enhance observability systems, including metrics, logging and distributed tracing.
  • Participate in incident response, on‑call rotations and post‑incident reviews.
  • Build and maintain CI/CD and GitOps pipelines for secure, automated deployments.
  • Implement infrastructure‑as‑code using Terraform, Ansible and other automation‑first practices.
  • Design, operate and optimise scalable cloud infrastructure across production environments.
  • Drive capacity planning, performance optimisation and resilience testing, including chaos engineering.
  • Improve developer experience through self‑service infrastructure and streamlined workflows.
  • Collaborate with security, software and release engineering teams to embed reliability and security best practices.
  • Apply FinOps principles to optimise cost without compromising availability.
  • Mentor peers and promote best practices in SRE, automation and systems reliability.

Required profile

  • 5+ years of experience in SRE, DevOps or infrastructure engineering.
  • Strong experience with infrastructure‑as‑code tools such as Terraform and/or Ansible.
  • Proficiency in at least one programming language (Python, Go or similar).
  • Hands‑on experience with large‑scale distributed systems and cloud environments.
  • Demonstrated ability to design observability, incident response and reliability strategies.
  • Collaborative mindset with a focus on automation and continuous improvement.

Required skills

  • Terraform
  • Ansible
  • Python
  • Go
  • CI/CD pipelines
  • GitOps
  • Cloud infrastructure (AWS/GCP/Azure)
  • Observability (metrics, logging, tracing)
  • Incident response and on‑call duties
  • Chaos engineering
  • FinOps cost optimisation

Questions fréquentes

Le salaire n'est pas communiqué publiquement par le recruteur. Vous pouvez postuler et négocier directement avec Jobgether.
Cliquez sur "Postuler maintenant" en haut de la page. Vous pouvez importer votre CV en 1 clic — Jobiglo extrait automatiquement vos informations et postule pour vous.

Why are you reporting this job?

Thank you for your report. We will review this job.

Apply in 30 seconds

Enter your email to apply. An account will be created automatically.

By continuing, you accept our terms of use.

Already have an account? Login

ui.whatsapp_discuss_job

Published 3 days ago

Expires 1 month from now

8 views · 0 interested

Boost your chances

Upload your CV — we will match you with relevant openings.

Analyzing your CV...

Jobgether

Irlande