Job Openings Senior AI/ML Data Engineer

About the job Senior AI/ML Data Engineer

Job Title: Senior AI/ML Data Engineer 
Location: California (Onsite)

Job Type: C2C

Position Overview

We are seeking a highly skilled Senior AI/ML Data Engineer with expertise in multimodal AI models, vector databases, and Azure-based data engineering solutions. The candidate will lead design and implementation of scalable AI-driven data platforms, integrating advanced ML pipelines with modern cloud architectures.

This is a client-facing, onsite role in California, requiring both technical depth and leadership ability to deliver enterprise-grade AI/data solutions.

Key Responsibilities

  • Design, build, and optimize AI/ML pipelines using multimodal models (CLIP, BLIP, Whisper, or similar)
  • Implement vector database solutions (FAISS, Milvus, Weaviate) and embedding pipelines for retrieval-augmented systems.
  • Develop and maintain data ingestion, ETL, and streaming solutions using Microsoft Fabric, Azure Data Factory, Event Hub, and Kafka.
  • Architect, implement, and optimize Azure-based solutions, including Azure Synapse, Databricks, and Azure SQL.
  • Write efficient SQL, Python, and Spark code for data transformation, ML feature engineering, and analytics.
  • Use infrastructure-as-code (Terraform, ARM templates) for automated deployment and environment consistency.
  • Collaborate with cross-functional teams (data science, cloud, and business stakeholders) to translate requirements into scalable solutions.
  • Ensure data governance, compliance, and security best practices are applied across all platforms.
Required Skills & Qualifications
  • Hands-on experience with AI/ML multimodal models (CLIP, BLIP, Whisper, or similar).
  • Strong proficiency in Python for AI/ML and automation workflows.
  • Experience with vector databases (FAISS, Milvus, Weaviate) and embedding pipelines.
  • Proficiency in Microsoft Azure services:

    • Azure Data Factory

    • Azure Synapse

    • Azure Databricks

    • Azure SQL

    • Event Hub

  • Experience in data streaming platforms (Azure Event Hubs, Kafka).
  • Strong skills in SQL and Spark for large-scale data engineering.
  • Experience with infrastructure-as-code (Terraform, ARM templates).
  • Strong problem-solving, debugging, and performance optimization skills.
  • Excellent communication skills with ability to mentor junior engineers and engage with stakeholders.
Nice to Have
  • Experience with Oracle PL/SQL and hybrid database ecosystems.
  • Knowledge of data warehousing, data modeling, and governance frameworks.
  • Exposure to MLOps frameworks and enterprise-grade AI deployment.