Job Openings
Site Reliability Engineer
About the job Site Reliability Engineer
Responsibilities:
- Ensure availability, latency and reliability of critical systems
- Apply SRE practices such as SLOs, error budgets and incident analysis
- Automate resolution of repetitive issues and reduce MTTR
- Participate in on-call rotations and lead post-incident reviews
Must-have:
- Proven experience in SRE, DevOps or Systems Engineering roles
- Strong grasp of monitoring, alerting and incident management
- Scripting ability in Python, Bash or Go
- Deep understanding of Linux system internals
Nice-to-have:
- Exposure to chaos engineering or failure injection
- Familiarity with incident response tools (PagerDuty, Opsgenie)
- SRE/DevOps certifications (Google SRE, AZ-400, etc.)
For more details contact us at recruitment@lynxmind.com