Ofertas de empleo Data Quality Engineer Sr

Acerca del puesto Data Quality Engineer Sr

We build AI-powered solutions, combining the power of deep learning with the accuracy of ontologies to drive natural-language understanding.

The Data Quality Engineer will be responsible for building our data quality monitoring platform, integrating with our various data sources to enable monitoring and evaluation of data quality. You will enjoy working with a highly talented and diverse team of data scientists and engineers specializing in deep learning, active learning, and classical machine learning on one of the richest data sets in Life Sciences and Healthcare.

The ideal candidate will have a strong background in Data Quality (with Python) with experience in Data Engineering, have experience working with large data sets, and deploying data-driven solutions. You are focused on results, a self-starter, able to put the team first, and have demonstrated success in using data science to develop and deploy solutions with a focus on impact.

Role and Responsibilities:

Build out Data Quality processes and services at scale in collaboration with engineering peers and leads

Work with Tech Lead, Product, and peer engineers to develop and monitor standards for data quality across many data sources of various sizes/complexity

Build on all aspects of building the Data Quality component of our Data Platform Build data quality monitoring tools to ensure that the Data Platform is providing high-quality and fit-for-purpose data

Ensure engineering designs are guided by high performance, scalability, and security, following the strict healthcare policy compliance, with low-cost

Agile through experimentation, prototyping, and solid execution

Required Qualifications:

Solid understanding of data pipelines, data structures, data modeling, and software architecture

Ability to write robust code in Python, including unit tests and documentation

5+ years of experience writing data quality tooling using Great Expectations or similar technologies

Solid experience in diverse data storage technologies such as PostgreSQL, Big Query, Redshift, Snowflake, Elasticsearch, etc.

Ability to support EST timezone

Preferred Experience:

BS or MS in Computer Science, Computer/Electrical Engineering, or a closely related Software Engineering field

An excellent track record (5+ years) of delivering Data Quality solutions, including data quality monitoring, measuring, reporting, and alerting

Experience with working with diverse data sets from relational databases to

unstructured datasets in Hadoop

Experience with Airflow

Experience with cloud platforms, such as Google Cloud Platform, Amazon Web Services, etc.

Experience with Healthcare and Life Sciences domain

A great collaborator who can work across operating styles and can bring together multiple perspectives, able to handle conflicts with the best interests of the company and customers in mind

Able to translate business and technical requirements into clean/logical design

Strong ability to perform design and code reviews