Job Openings Global Operations Center (GOC) Lead

About the job Global Operations Center (GOC) Lead

The Global Operations Center (GOC) Engineer is a key part of our 24x7 operations team, responsible for monitoring, initial incident triage, basic troubleshooting, and first-level escalation. You will be on the front lines ensuring availability and responsiveness across critical services.

This is an excellent opportunity for someone with a strong interest in operations, monitoring tools, and cloud technologies to grow in a dynamic, high-impact team.

Key Responsibilities:

  • Design and implement comprehensive monitoring solutions microservices running on AWS EKS.
  • Develop and maintain monitoring-as-code using Terraform/CloudFormation
  • Create custom dashboards, alerts, and runbooks for complex distributed systems.
  • Implement distributed tracing across microservices and serverless functions.
  • Monitor infrastructure, applications, and security alerts using tools such as Datadog, Splunk, or Prometheus.
  • Follow documented SOPs and runbooks to triage and resolve routine incidents.
  • Escalate non-resolvable or high-priority issues to the appropriate teams.
  • Perform daily health checks and support planned change validations.
  • Log and document all actions taken during shift in ticketing systems.
  • Participate in shift handovers with clear and complete status updates.
  • Maintain familiarity with system dashboards and performance metrics.
  • Contribute to documentation improvements and feedback loops for alert tuning.
  • Participate in on-call rotation and respond to critical monitoring alerts
  • Perform root cause analysis using metrics, logs, and traces
  • Reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR)
  • Conduct post-mortems and implement monitoring improvements.
  • Build monitoring to detect issues before they impact customers.

Required Qualifications:

  • 5+ years of experience in infrastructure monitoring, DevOps, or SRE roles.
  • 3+ years of hands-on experience with AWS cloud services
  • 2+ years working with Kubernetes/container orchestration
  • Experience in NOC, helpdesk, or IT operations.
  • Familiarity with Linux commands, system health checks, and incident logging.
  • Willingness to work in a rotating shift environment.

Automation Experience:

o Shell scripting for automation (Bash, PowerShell)

o Infrastructure as Code using Terraform or CloudFormation

o Git version control and CI/CD pipeline experience

Preferred Qualifications:

  • Exposure to tools like ServiceNow, Datadog, or Splunk.
  • Prior experience in 24x7 operations or production support.
  • Strong written communication and documentation skills.