Hong Kong, Hong Kong SAR, Hong Kong
Senior SRE / Linux Engineer (100% WFH)
Job Description:
Join our client, a Leading Global Exchange Firm looking for a Senior SRE to manage and support the infrastructure that powers their platform, focusing on reliability, scalability, and performance.
Key Responsibilities
- Enhance resiliency, throughput, and latency of trading systems.
- Manage AWS cloud infrastructure, EC2 instances, and physical servers.
- Harden OS builds and configurations for security.
- Maintain configuration management tools.
- Integrate our stack with Kubernetes.
- Implement SRE best practices.
- Design and test disaster recovery capabilities.
- Participate in an on-call rota for escalations.
Qualifications
- At least 5 years of SRE/DevOps experience with strong AWS and Linux skills.
- Expertise in networking protocols, Linux kernel TCP stack, congestion control (e.g., BBR), AWS VPC / TGW, Kubernetes VPC CNI. DPDK experience is a plus.
- Professional experience with kernel troubleshooting: strace, bpftrace, perf profiling/tracing, navigating / reading / building the relevant kernel code.
- Professional experience with userland monitoring (e.g. Thanos/Prometheus/AlertManaging), logging (e.g. Splunk/Loki), alerting, troubleshooting, profiling/tracing, etc.
- Familiarity with Kubernetes/Ansible/Chef and programming in Python, Golang, C, NodeJS.
- Degree in computer science or engineering preferred.
Required Skills:
Linux