Job Openings Site Reliability Engineer I

About the job Site Reliability Engineer I

Company Description

Aqilea is an IT and engineering consulting partner that helps companies get more out of their technology and operations. With teams in Stockholm and Bangalore, we work closely with our clients to build solutions that fit their needs - from software development, AI and infrastructure engineering to industrial automation and embedded systems.


We combine strong technical expertise with a practical, business-focused approach to help organizations modernize, improve security, and scale with confidence. Above all, we focus on long-term partnerships built on trust, quality, and real results.

With us, you have great opportunities to take real steps in your career and the opportunity to take great responsibility.

About the Role

Company: Aqilea India

Job Title: Site Reliability Engineer (SRE)

Experience: 5–10 Years
Location: Bangalore (Hybrid)

About the Role

We are seeking a highly motivated Site Reliability Engineer (SRE) to join our dynamic product team. In this role, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical applications in a fast-paced eCommerce environment. You will work closely with development, DevOps, and product teams to build resilient systems and improve operational efficiency.

Key Responsibilities

  • Collaborate with cross-functional teams to ensure high system availability and reliability.
  • Apply SRE principles to improve system performance, scalability, and resilience.
  • Monitor application and infrastructure health, and proactively address potential issues.
  • Troubleshoot and resolve complex production incidents, ensuring minimal downtime.
  • Perform root cause analysis and implement preventive measures.
  • Build and enhance monitoring, alerting, and logging solutions.
  • Automate operational tasks and workflows to reduce manual effort.
  • Contribute to CI/CD pipeline improvements and release processes.
  • Ensure adherence to SRE metrics such as SLI, SLO, and Error Budgets.
  • Maintain documentation for processes, systems, and incident handling.
  • Participate in on-call rotations to support critical production systems.

Required Skills & Qualifications

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Production Support.
  • Strong experience working in eCommerce platforms.
  • Hands-on experience with CI/CD pipelines and DevOps practices.
  • Solid understanding of microservices architecture and API-based systems.
  • Experience with incident management, problem management, and ITIL processes.
  • Familiarity with ITSM tools such as ServiceNow.
  • Strong debugging and performance optimization skills.

Technical Skills

  • Programming/Scripting: Proficiency in at least one – Python, Java, Go, Ruby, or C#.
  • Frontend/Backend: Exposure to ReactJS, React Native, Node.js.
  • CI/CD Tools: Experience with GitHub Actions or similar tools.
  • Cloud Platforms: Hands-on experience with Microsoft Azure and/or Google Cloud Platform (GCP).
  • Infrastructure as Code: Experience with Terraform and/or Ansible.
  • Monitoring & Observability: Experience with Grafana, Splunk, or similar tools.
  • Containerization & Orchestration: Experience with Kubernetes (AKS, GKE).

Start: Immediate to 15 Days

Location: Bangalore (Hybrid)