Hong Kong, Hong Kong SAR, Hong Kong

Senior SRE / Linux Engineer (100% WFH)

 Job Description:


Join our client, a Leading Global Exchange Firm looking for a Senior SRE to manage and support the infrastructure that powers their platform, focusing on reliability, scalability, and performance.

Key Responsibilities

  • Enhance resiliency, throughput, and latency of trading systems.
  • Manage AWS cloud infrastructure, EC2 instances, and physical servers.
  • Harden OS builds and configurations for security.
  • Maintain configuration management tools.
  • Integrate our stack with Kubernetes.
  • Implement SRE best practices.
  • Design and test disaster recovery capabilities.
  • Participate in an on-call rota for escalations.


Qualifications

  • At least 5 years of SRE/DevOps experience with strong AWS and Linux skills.
  • Expertise in networking protocols, Linux kernel TCP stack, congestion control (e.g., BBR), AWS VPC / TGW, Kubernetes VPC CNI. DPDK experience is a plus.
  • Professional experience with kernel troubleshooting: strace, bpftrace, perf profiling/tracing, navigating / reading / building the relevant kernel code.
  • Professional experience with userland monitoring (e.g. Thanos/Prometheus/AlertManaging), logging (e.g. Splunk/Loki), alerting, troubleshooting, profiling/tracing, etc.
  • Familiarity with Kubernetes/Ansible/Chef and programming in Python, Golang, C, NodeJS.
  • Degree in computer science or engineering preferred.
  Required Skills:

Linux