Job Openings Senior Cloud Platform Engineer (Azure/Kubernetes) - Remote Portugal

About the job Senior Cloud Platform Engineer (Azure/Kubernetes) - Remote Portugal

ABOUT THE OPPORTUNITY

Join a passionate technology consultancy where crafting exceptional software means building with people we love. We're seeking a Senior Azure Platform Engineer to architect and operate a modern, self-service Kubernetes platform that empowers hundreds of developers. This role offers you the opportunity to build a compelling internal Platform-as-a-Service (PaaS) product, enabling application teams to ship software at enterprise scale—fast, safe, and efficiently—using cutting-edge technologies chosen for real value, not trends.

PROJECT & CONTEXT

You'll design and operate a large-scale PaaS for containerized workloads on Azure, serving as the Azure Subject Matter Expert within the cloud platform team. The project involves building reliability and fault tolerance into systems using AKS, Istio for service mesh, Flux for GitOps, and Crossplane for infrastructure management. You'll streamline developer workflows by maintaining CI/CD pipelines and automating infrastructure through Terraform, while implementing robust security standards with Hashicorp Vault and OPA Gatekeeper. Working closely with Product Managers and Engineers, you'll plan and execute the technical roadmap for this strategic platform. The role includes building monitoring and observability with ELK/EFK, Prometheus, Thanos, Grafana, and New Relic, ensuring systems are simple to diagnose and fix. Participation in out-of-hours support may be required to maintain platform reliability.

WHAT WE'RE LOOKING FOR (Required)

  • Extensive Azure expertise: Hands-on experience with Microsoft Azure in commercial, large-scale enterprise environments
  • AKS and cloud-native tooling: Strong experience with Azure Kubernetes Service (AKS), Helm, Istio service mesh, Flux for GitOps, and Crossplane for infrastructure orchestration
  • Infrastructure as Code mastery: Proven track record managing and automating enterprise infrastructure estates using Terraform
  • Programming proficiency: Strong skills in one or more languages including Go, TypeScript, Python, or Bash
  • Observability and monitoring: Experience with monitoring and logging stacks including ELK/EFK, Prometheus, Thanos, Grafana, and New Relic
  • SRE principles: Solid understanding of Site Reliability Engineering and ITIL principles for operational excellence
  • Security implementation: Experience with security tools like Hashicorp Vault and policy enforcement with OPA Gatekeeper
  • Agile methodology: Comfortable working in fast-paced Agile environments using tools like JIRA for tracking and collaboration
  • Stakeholder engagement: Pragmatic, empathetic approach when interacting with stakeholders and developer customers
  • Language requirement: Fluent English (mandatory for team collaboration and stakeholder communication)

NICE TO HAVE (Preferred)

  • Experience building and operating internal developer platforms or PaaS products
  • Background in mentoring and knowledge sharing within technical teams
  • Contributions to open-source cloud-native projects
  • Experience with additional Azure services beyond core PaaS offerings