Job Openings DevOps Engineer

About the job DevOps Engineer

About the Company and Role:

Our client is a product company operating in the domain and internet infrastructure space, developing and maintaining internal platforms and backend systems related to domain management and supporting services.

The company builds and operates its products in-house, with a strong focus on long-term stability, predictable system behavior, and production reliability. The entire infrastructure is fully self-hosted across multiple data centers, and the engineering team works directly with real production systems rather than managed cloud services.


We are seeking a highly skilled DevOps Engineer to join our team and help manage, optimize, and scale our infrastructure. The ideal candidate will have hands-on experience with self-hosted Kubernetes clusters, ClickHouse, Infrastructure as Code (IaC), observability tools, PostgreSQL, Kafka, and microservices architectures. 

The key objective of this role is to assess the current state of the self-hosted infrastructure, identify operational risks and technical debt, and systematically bring structure, stability, and observability in line with the long-term technical strategy.

You will also play a critical role in maintaining and improving our CI/CD pipelines, preferably using GitLab.

Location: Remote role (EU time zone), full-time.

We are offering:

  • Up to $4,500, b2b contract;
  • Full-time remote role in an international product company;
  • Work on a high-load backend system with real users and production scale;
  • Direct influence on architecture and technical decisions;
  • Modern backend stack and engineering-driven culture.

Interview Process:

  • HR Screening (30–40 minutes);
  • Technical Interview;
  • Final Interview with lead and CTO;
  • Job Offer.

Key Responsibilities:

  • Design, deploy, and manage self-hosted Kubernetes clusters for production and development environments.
  • Assess the current state of the infrastructure, identify technical debt and operational risks, and systematically improve reliability, structure, and observability in alignment with the CTOs technical strategy.
  • Implement, monitor, and optimize data storage solutions
    with ClickHouse and PostgreSQL.
  • Develop and maintain Infrastructure as Code (IaC) using tools such
    as Terraform or Ansible.
  • Build and enhance CI/CD pipelines, preferably with GitLab, ensuring seamless deployment and operations.
  • Integrate and manage observability tools (e.g., Prometheus, Grafana, ELK/EFK stack, Jaeger) for real-time system monitoring and troubleshooting.
  • Collaborate with development teams to ensure efficient integration of Kafka for event-driven architectures and microservices.
  • Automate routine operational tasks to improve efficiency and reduce errors.
  • Ensure system scalability, performance, and high availability while meeting security compliance.
  • Troubleshoot and resolve infrastructure and application issues across the stack.

Required Skills & Qualifications:

  • 3+ years of experience in DevOps, Site Reliability Engineering (SRE), or a similar role.
  • Proven expertise in managing and scaling self-hosted Kubernetes clusters.
  • Strong knowledge of ClickHouse architecture, optimization, and operations.
  • Hands-on experience with Infrastructure as Code (IaC) tools like Terraform, Ansible, or Pulumi.
  • Proficiency in working with observability tools such as Prometheus, Grafana, ELK/EFK stack, or Jaeger.
  • Solid understanding of PostgreSQL administration and performance tuning.
  • Familiarity with Kafka ecosystem for event-driven microservices.
  • Experience designing and implementing CI/CD pipelines (preferably using GitLab CI/CD).
  • Proficient in scripting languages such as Bash, Python, or Go.
  • Deep understanding of microservices architectures and best practices.
  • Strong problem-solving skills and the ability to work in a fast-paced, agile environment.

Preferred Qualifications:

  • Knowledge of service meshes such as Istio or Linkerd.
  • Experience with security best practices in a DevOps environment.
  • Familiarity with distributed systems and their challenges.