Job Openings Staff Engineer / Site Reliability Engineer - B2B, full remote

About the job Staff Engineer / Site Reliability Engineer - B2B, full remote

Job Location: 100% remote in Romania

Budget:

Recruitment process:

  • HR call
  • Technical screening
  • Technical interview

Role description:

Join a small, fast-moving team that keeps a high-throughput Rust-based platform humming. You'll wear several hats, designing features in a compiled language one day, tuning Kubernetes the next, and hunting down performance glitches in production after that.

What You'll Do

  • End-to-end ownership design, build, deploy, and operate production services on Linux, AWS, and Kubernetes. 
  • Reliability first develop monitoring, alerting, and chaos-testing strategies to keep SLOs green. 
  • Performance sleuthing profile code, trace distributed systems, and remove bottlenecks across the stack.
  • Automation & infra-as-code codify everything (Terraform, Helm, CI/CD) to make repeatable releases boring.
  • Systems thinking model how code, infrastructure, and customer traffic interact; propose resilient architectures.Mentorship & collaboration share hard-won lessons, review code, and guide incident post-mortems.

Must-Haves

  • 7+ years building and operating production systems
  • Deep Linux literacy
  • Fluency with AWS core services (EC2, ALB/NLB, S3, RDS or Aurora, IAM)
  • Kubernetes in anger: deployment patterns, debugging
  • Comfortable coding in at least one compiled language (e.g., Rust, Go, C/C++, Java)
  • Solid CS foundation (data structures, concurrency, networking)
  • Proven record of diagnosing production incidents and leading post-incident improvements

Nice-to-Haves

  • Hands-on Rust or Go in production
  • Observability stacks (OpenTelemetry, Prometheus, Grafana)
  • Multi-cloud or on-prem hybrid experience
  • Massive scale, edge, or high-performance computing exposure
  • Formal methods, distributed-systems research, or academic publications
  • Security-first mindset (threat modeling, policy-as-code)
  • Community contributions (open-source maintainer, conference speaker)

Why You'll Love It Here
Small team, huge impact; autonomy to choose the right tool; influence to improve the culture, continuous learning, and pragmatic engineering over dogma.

Ready to bring your battle scars and systems mindset to bear? Lets talk.