Job Openings
Senior Cloud Engineer (AWS)
About the job Senior Cloud Engineer (AWS)
Description
We are looking for a Senior Cloud Engineer (AWS) to join our team and help ensure the reliability, availability, and performance of our critical systems. You will work closely with development, operations, and security teams to design, build, and maintain scalable infrastructure, improve system resilience, and automate operational tasks.
Key Responsibilities
- Design, implement, and manage AWS infrastructure with Terraform/OpenTofu.
- Facilitate on-premise and Azure migrations to AWS.
- Implement and refine CI/CD pipelines to enhance deployment speed and reliability.
- Develop and automate operational processes to improve efficiency and reduce manual effort.
- Collaborate with software engineers to optimise application performance and reliability.
- Assist with incident response, root cause analysis, and post-mortem reviews to drive continuous improvement.
- Ensure security and compliance best practices are followed in system design and operations.
- Participate in on-call rotations to support critical infrastructure and services.
- Optimise cloud infrastructure for cost efficiency and performance.
- Maintain and improve the availability, scalability, and performance of production systems.
- Mentor teammates and developers
Qualifications
- 3+ years of experience in a Cloud Engineering, Site Reliability Engineering,
- DevOps, or related role with a specific focus on AWS.
- Expert knowledge of AWS.
- Strong proficiency with Terraform / OpenTofu, Terragrunt, and Terraform Pull Request automation (Atlantis, GitLab Terraform, etc).
- Expert knowledge of containerization and orchestration technologies (OCI runtimes and Kubernetes).
- Strong knowledge of ArgoCD or FluxCD.
- Experience with version control systems and CI/CD pipelines (Git, GitLab, GitHub, Tekton).
- Strong experience with observability concepts and associated tooling (eg Datadog)
- Strong knowledge of Linux system administration, networking, and troubleshooting.
- Good knowledge of database engines, specifically Aurora MySQL and Postgres
- Good programming or scripting skills in Python, Bash, Golang, or similar languages.
- Hands-on experience with the implementation of security best practices for infrastructure and applications in regulated environments (PCI DSS 3.2.1 and 4.0.0, SOX ITGC 404, NIST)
- Ability to complete SOC and risk assessment reports along with Business Continuity Planning