Job Openings
Cloud/Site Reliability Engineer
About the job Cloud/Site Reliability Engineer
JOB DESCRIPTION
- Design, develop, and maintain reusable Terraform modules for provisioning GKE resources.
- Automate the deployment, scaling, and management of GKE clusters, node pools, and associated networking and storage components.
- Implement and enforce IaC best practices, including version control, code reviews, and CI/CD integration.
- Collaborate with cross-functional teams to optimize infrastructure performance and cost.
- Create and maintain comprehensive documentation for GKE cluster usage, architecture, and lifecycle management.
- Ensure system reliability, security, and scalability in cloud infrastructure design and deployment.
- Strong experience with Kubernetes
Must have:
- Terraform expertise for reusable GKE modules
- GKE cluster management: deploy, scale, automate
- IaC best practices: Git, code reviews, CI/CD
- Cloud reliability & security focus
- Strong Kubernetes experience