Job Openings Data Engineer (Databricks)

About the job Data Engineer (Databricks)

We are looking for a skilled and proactive Data Engineer with deep expertise in Databricks to join our data platform team. You will be responsible for designing, building, and optimizing scalable data pipelines and lakehouse architectures that power analytics, reporting, and machine learning across the organization

Responsibilities

  • Develop and maintain robust ETL/ELT pipelines using Databricks and Apache Spark
  • Design and implement Delta Lake architectures for structured and semi-structured data
  • Collaborate with data analysts, scientists, and product teams to deliver clean, reliable datasets
  • Optimize performance of Spark jobs and manage cluster resources efficiently
  • Automate workflows using Databricks Jobs, Workflows
  • Ensure data quality, lineage, and governance using Unity Catalog and monitoring tools
  • Document data models, pipeline logic, and architectural decisions
  • Participate in code reviews and contribute to engineering best practices

Required skills and experience

  • 4+ years of experience as a Data Engineer or in a similar role
  • Strong hands-on experience with Databricks, including Delta Lake, Spark SQL
  • Proficiency in Python and SQL for data manipulation and pipeline development
  • Solid understanding of Apache Spark internals and performance tuning
  • Experience with cloud platforms (Azure, AWS,GCP)
  • Knowledge of data modeling, partitioning, and lakehouse principles
  • Ability to work with large-scale datasets and optimize storage and compute costs
  • Strong communication skills and ability to collaborate across teams

Nice to have

  • Experience with Azure Data Factory or Airflow
  • Proficiency with modern data warehouses and tools including Snowflake, Synapse, Red Gate
  • Exposure to data governance frameworks and tools (e.g., Unity Catalog, Purview)
  • Understanding of machine learning workflows and integration with data pipelines
  • Experience with BI tools (Power BI, Tableau) and supporting analytics teams
  • Contributions to open-source projects or technical blogs