Job Openings Site Reliability Engineer

About the job Site Reliability Engineer

Responsibilities:

  • Ensure availability, latency and reliability of critical systems
  • Apply SRE practices such as SLOs, error budgets and incident analysis
  • Automate resolution of repetitive issues and reduce MTTR
  • Participate in on-call rotations and lead post-incident reviews

Must-have:

  • Proven experience in SRE, DevOps or Systems Engineering roles
  • Strong grasp of monitoring, alerting and incident management
  • Scripting ability in Python, Bash or Go
  • Deep understanding of Linux system internals

Nice-to-have:

  • Exposure to chaos engineering or failure injection
  • Familiarity with incident response tools (PagerDuty, Opsgenie)
  • SRE/DevOps certifications (Google SRE, AZ-400, etc.)


For more details contact us at recruitment@lynxmind.com