Job Openings Cloud/Site Reliability Engineer

About the job Cloud/Site Reliability Engineer

JOB DESCRIPTION

  • Design, develop, and maintain reusable Terraform modules for provisioning GKE resources.
  • Automate the deployment, scaling, and management of GKE clusters, node pools, and associated networking and storage components.
  • Implement and enforce IaC best practices, including version control, code reviews, and CI/CD integration.
  • Collaborate with cross-functional teams to optimize infrastructure performance and cost.
  • Create and maintain comprehensive documentation for GKE cluster usage, architecture, and lifecycle management.
  • Ensure system reliability, security, and scalability in cloud infrastructure design and deployment.
  • Strong experience with Kubernetes

Must have:

  • Terraform expertise for reusable GKE modules
  • GKE cluster management: deploy, scale, automate
  • IaC best practices: Git, code reviews, CI/CD
  • Cloud reliability & security focus
  • Strong Kubernetes experience