Job Openings Senior Cloud AI Engineer (Hybrid, Lahore, USD Salary)

About the job Senior Cloud AI Engineer (Hybrid, Lahore, USD Salary)

Requirements:

  • Bachelor's or Masters degree in Computer Science, Engineering, or a related field.
  • 7+ years of professional software development experience, with at least 3-4 years focused on cloud architecture and backend systems.
  • Proficient in Google Cloud Platform (GCP) with hands-on experience in Compute (Cloud Run, Cloud Functions, GKE), Messaging (Pub/Sub), Storage & Databases (BigQuery, Cloud Storage, Spanner, Firestore), AI/ML tools (Vertex AI, Generative AI APIs), and Networking & Security.
  • Proven experience designing and building event-driven architectures and microservices.
  • Strong programming skills, preferably in Python (especially for AI/ML and GCP), Java, or Go.
  • Experience with workflow orchestration tools (e.g., Apache Airflow, Cloud Workflows, or custom solutions).
  • Solid understanding of AI/ML concepts, including experience with LLMs, RAG, vector databases, and building agentic or autonomous systems.
  • Demonstrable experience implementing resilience patterns (retries, circuit breakers, idempotency) and ensuring system observability.
  • Experience with data modeling, SQL, and NoSQL databases.
  • Proficiency with CI/CD tools, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform).
  • Excellent problem-solving skills and the ability to architect solutions for complex, ambiguous problems.
  • Experience with building cloud-based AI, Compute, and Storage tools and concepts.
  • Familiarity with knowledge graphs (e.g., Neo4j, JanusGraph) and their application in RAG or knowledge management.
  • Experience with Document AI or similar document processing technologies.
  • Understanding of front-end interactions with backend agentic systems.

Responsibilities:

  • Design AI-driven systems on GCP using agentic pipelines, Pub/Sub workflows, Cloud Run, BigQuery, and automation.
  • Lead backend development of core services, atomic actions, orchestration, and agentic control loops, including context, session, and history management.
  • Integrate AI/ML models (LLMs, custom agents) with focus on RAG, embeddings, and prompt engineering.
  • Implement resilience practices: retries, circuit breakers, backoff, mitigation patterns, and observability (logging, monitoring, watchdogs).
  • Manage data persistence and flow, using tools like BigQuery for immutable record-keeping.
  • Build complex workflows with validation, consensus mechanisms, and automated cleanup actions.
  • Collaborate with PMs, analysts, and engineers; translate requirements; mentor and promote best practices.
  • Advocate CI/CD, infrastructure as code, and automated testing for reliable deployments.
  • Enforce security best practices and ensure compliance.