Job Openings DevOps Engineer

About the job DevOps Engineer

The role requires the incumbent to enhance platform reliability and performance by designing and implementing robust solutions. This position involves collaboration with other teams to innovate and optimize system processes and infrastructure.

Responsibilities:

  • Design and implement solutions to enhance the reliability and performance of the system and services.
  • Work on automation to scale existing components of the platform effectively.
  • Participate in critical architectural decisions that shape the future of the platform.
  • Troubleshoot and resolve complex technical issues, including involvement from initial discovery to post-mortem analysis.
  • Collaboratively improve incident management and on-call processes with the team.
  • Ensure the platform's stability by adhering to Site Reliability Engineering principles such as embracing calculated risks and utilizing Service Level Objectives (SLOs).
  • Eliminate repetitive tasks through automation, promoting efficiency and innovation.
  • Aid in the maintenance, scaling, and troubleshooting of the platform, providing support to the rest of the engineering organization.

Requirements:

  • Minimum of 7 years of relevant experience in a related field.
  • Strong familiarity with Site Reliability Engineering (SRE) practices and methodologies, including SLOs, error budgets, and incident management.
  • Background in software engineering or infrastructure with proficiency in automating solutions through coding.
  • Experience with large-scale distributed systems across multiple data centers.
  • Deep technical knowledge ranging from networking and infrastructure to services and databases.
  • Proficiency in programming or scripting languages such as Python, Java, or Bash.
  • Understanding and experience with Infrastructure as Code, and tools like Puppet and Ansible for configuration management and automation.