Job Openings Cloud/Site Reliability Engineer

About the job Cloud/Site Reliability Engineer

We are seeking an experienced Terraform Infrastructure Engineer to join our team. In this role, you will be responsible for designing, developing, and maintaining reusable Terraform modules for provisioning Google Kubernetes Engine (GKE) resources. Your expertise will drive the automation of deployment, scaling, and management of GKE clusters, node pools, and associated networking and storage components.

Key Responsibilities:

  • Design, develop, and maintain reusable Terraform modules for provisioning GKE resources.

  • Automate the deployment, scaling, and management of GKE clusters, node pools, and related components such as networking and storage.

  • Implement and enforce Infrastructure as Code (IaC) best practices, including version control, code reviews, and CI/CD pipeline integration.

  • Collaborate with cross-functional teams to optimize infrastructure performance and cost.

  • Create and maintain comprehensive documentation for GKE cluster usage, architecture, and lifecycle management.

  • Ensure system reliability, security, and scalability in cloud infrastructure design and deployment.

  • Continuously improve the infrastructure environment to meet operational goals.

Required Qualifications:

  • 9+ years of experience in Infrastructure Engineering or related field.

  • Strong experience with Kubernetes, specifically Google Kubernetes Engine (GKE).

  • Advanced proficiency with Terraform and Infrastructure as Code (IaC) practices.

  • Proven experience with cloud infrastructure management (Google Cloud Platform preferred).

  • Expertise in automating cloud-based infrastructure processes and managing lifecycle of clusters and related components.

  • Solid understanding of networking, storage, and security principles in cloud environments.

  • Experience with CI/CD integrations for infrastructure automation.

  • Ability to collaborate effectively with cross-functional teams and external stakeholders.

  • Excellent written and verbal communication skills.

Preferred Qualifications:

  • Familiarity with Google Cloud Platform (GCP) services.

  • Experience working with monitoring tools such as Prometheus, Grafana, or similar.

  • Familiarity with containerization technologies and container orchestration (Docker, Kubernetes).