Job Openings
Senior AI/ML Data Engineer
About the job Senior AI/ML Data Engineer
Job Title: Senior AI/ML Data Engineer
Location: California (Onsite)
Job Type: C2C
Position Overview
We are seeking a highly skilled Senior AI/ML Data Engineer with expertise in multimodal AI models, vector databases, and Azure-based data engineering solutions. The candidate will lead design and implementation of scalable AI-driven data platforms, integrating advanced ML pipelines with modern cloud architectures.
This is a client-facing, onsite role in California, requiring both technical depth and leadership ability to deliver enterprise-grade AI/data solutions.
Key Responsibilities
- Design, build, and optimize AI/ML pipelines using multimodal models (CLIP, BLIP, Whisper, or similar)
- Implement vector database solutions (FAISS, Milvus, Weaviate) and embedding pipelines for retrieval-augmented systems.
- Develop and maintain data ingestion, ETL, and streaming solutions using Microsoft Fabric, Azure Data Factory, Event Hub, and Kafka.
- Architect, implement, and optimize Azure-based solutions, including Azure Synapse, Databricks, and Azure SQL.
- Write efficient SQL, Python, and Spark code for data transformation, ML feature engineering, and analytics.
- Use infrastructure-as-code (Terraform, ARM templates) for automated deployment and environment consistency.
- Collaborate with cross-functional teams (data science, cloud, and business stakeholders) to translate requirements into scalable solutions.
- Ensure data governance, compliance, and security best practices are applied across all platforms.
Required Skills & Qualifications
- Hands-on experience with AI/ML multimodal models (CLIP, BLIP, Whisper, or similar).
- Strong proficiency in Python for AI/ML and automation workflows.
- Experience with vector databases (FAISS, Milvus, Weaviate) and embedding pipelines.
-
Proficiency in Microsoft Azure services:
-
Azure Data Factory
-
Azure Synapse
-
Azure Databricks
-
Azure SQL
-
Event Hub
-
- Experience in data streaming platforms (Azure Event Hubs, Kafka).
- Strong skills in SQL and Spark for large-scale data engineering.
- Experience with infrastructure-as-code (Terraform, ARM templates).
- Strong problem-solving, debugging, and performance optimization skills.
- Excellent communication skills with ability to mentor junior engineers and engage with stakeholders.
Nice to Have
- Experience with Oracle PL/SQL and hybrid database ecosystems.
- Knowledge of data warehousing, data modeling, and governance frameworks.
- Exposure to MLOps frameworks and enterprise-grade AI deployment.