Job Openings Senior Software Engineer – Cloud Platform & Operations

About the job Senior Software Engineer – Cloud Platform & Operations

Our client is a fast‑growing technology company building modern infrastructure for AI‑driven applications. To strengthen their cloud foundation, were looking for a Senior Software Engineer to join the Cloud Operations team — a group responsible for the reliability, scalability, and evolution of their cloud‑native platform.

This role is ideal for engineers who enjoy working at the intersection of software engineering, distributed systems, and cloud infrastructure. You'll design and operate core platform components, shape the future of their managed cloud offering, and help ensure the system runs smoothly at scale.

Your Role

Platform Engineering

  • Design, build, and operate foundational cloud platform components
  • Develop and maintain Kubernetes clusters, including custom operators
  • Write production‑grade Go and Python code for platform services and automation

Reliability & Scalability

  • Improve the stability, performance, and cost efficiency of cloud environments across AWS, GCP, and Azure
  • Strengthen observability through metrics, logging, alerting, and monitoring frameworks
  • Participate in incident response, root‑cause analysis, and long‑term system hardening

Automation & Operations

  • Automate operational workflows, integrations, and infrastructure processes
  • Reduce operational overhead (KTLO) through engineering‑driven improvements
  • Collaborate closely with Platform, Regions & Clusters, and Feature teams to ensure seamless delivery

What You Bring

  • 5–7+ years in platform engineering, infrastructure, or SRE‑focused roles
  • Strong proficiency in Go and Python (expertise in one with willingness to use both)
  • Hands‑on experience running Kubernetes in production
  • Solid understanding of distributed systems and cloud‑native architectures
  • Experience with major cloud providers (AWS, GCP, or Azure)
  • Familiarity with CI/CD, infrastructure‑as‑code, and automation tooling
  • Comfortable participating in on‑call rotations and managing production incidents
  • Strong ownership mindset and clear communication skills

Nice to Have

  • Experience building Kubernetes operators or control‑plane components
  • Background in SaaS, database, or systems‑level products
  • Exposure to Prometheus, Grafana, OpenTelemetry, or similar observability tools
  • Knowledge of networking, load balancing, or service meshes
  • Contributions to open‑source projects

What's on Offer

  • Competitive salary, equity, and benefits
  • Fully remote role with flexible working hours
  • High‑impact position within a core cloud engineering team
  • Opportunity to work on large‑scale Kubernetes and multi‑cloud systems
  • Clear growth path