Job Openings
Staff Engineer / Site Reliability Engineer - B2B, full remote
About the job Staff Engineer / Site Reliability Engineer - B2B, full remote
Job Location: 100% remote in Romania
Budget:
Recruitment process:
- HR call
- Technical screening
- Technical interview
Role description:
Join a small, fast-moving team that keeps a high-throughput Rust-based platform humming. You'll wear several hats, designing features in a compiled language one day, tuning Kubernetes the next, and hunting down performance glitches in production after that.
What You'll Do
- End-to-end ownership design, build, deploy, and operate production services on Linux, AWS, and Kubernetes.
- Reliability first develop monitoring, alerting, and chaos-testing strategies to keep SLOs green.
- Performance sleuthing profile code, trace distributed systems, and remove bottlenecks across the stack.
- Automation & infra-as-code codify everything (Terraform, Helm, CI/CD) to make repeatable releases boring.
- Systems thinking model how code, infrastructure, and customer traffic interact; propose resilient architectures.Mentorship & collaboration share hard-won lessons, review code, and guide incident post-mortems.
Must-Haves
- 7+ years building and operating production systems
- Deep Linux literacy
- Fluency with AWS core services (EC2, ALB/NLB, S3, RDS or Aurora, IAM)
- Kubernetes in anger: deployment patterns, debugging
- Comfortable coding in at least one compiled language (e.g., Rust, Go, C/C++, Java)
- Solid CS foundation (data structures, concurrency, networking)
- Proven record of diagnosing production incidents and leading post-incident improvements
Nice-to-Haves
- Hands-on Rust or Go in production
- Observability stacks (OpenTelemetry, Prometheus, Grafana)
- Multi-cloud or on-prem hybrid experience
- Massive scale, edge, or high-performance computing exposure
- Formal methods, distributed-systems research, or academic publications
- Security-first mindset (threat modeling, policy-as-code)
- Community contributions (open-source maintainer, conference speaker)
Why You'll Love It Here
Small team, huge impact; autonomy to choose the right tool; influence to improve the culture, continuous learning, and pragmatic engineering over dogma.
Ready to bring your battle scars and systems mindset to bear? Lets talk.