Site Reliability Engineer

Or refer someone

Job Openings Site Reliability Engineer

About the job Site Reliability Engineer

Responsibilities:

Ensure availability, latency and reliability of critical systems
Apply SRE practices such as SLOs, error budgets and incident analysis
Automate resolution of repetitive issues and reduce MTTR
Participate in on-call rotations and lead post-incident reviews

Must-have:

Proven experience in SRE, DevOps or Systems Engineering roles
Strong grasp of monitoring, alerting and incident management
Scripting ability in Python, Bash or Go
Deep understanding of Linux system internals

Nice-to-have:

Exposure to chaos engineering or failure injection
Familiarity with incident response tools (PagerDuty, Opsgenie)
SRE/DevOps certifications (Google SRE, AZ-400, etc.)

For more details contact us at recruitment@lynxmind.com

Or refer someone