Data Engineer
DescripciĆ³n del puesto:
About Us
We are a start-up based in Munich and in Barcelona. Were founded in Spring 2020 and two years later we are now already a team of +80 super motivated ML Engineers, data scientists and developers.
Our mission is to build beautiful Artificial Intelligence products. Were curious, passionate, and relentless in our drive to develop a truly end-to-end product, pushing the boundaries of innovation as far as we can.
We work in two fronts:
- Our product: currently used by +5000 users. Through the use of AI, enables and simplifies the core corporate finance processes (budgeting, resource allocation, what-if, ).
- Our AI laboratories: we develop AI prototypes for our clients as part of their R&D initiatives. Our laboratories are diverse and aim to test state of the art approaches to currently unsolved business challenges
Our AI and ML hub is based in Barcelona and Prague, we are currently a team of 30+ DS & ML engineers and Data Engineers both working from our office or remotely. We are looking for passionate Data Engineers to join our mission working either remotely or from our awesome office in Barcelona.
In the AI team, we encourage a fun and agile environment. We contribute to the development of the brain of our products. We work in scrum, and we develop our products in our own Python packages powered by open-source ML libraries and we deploy them in AWS to power our applications.
Role
If you have a passion for data and technology and want to join a crew of sharp analytical minds, then this may be the right opportunity for you.
As a Data Engineer , you will take part of the following tasks:
- Develop high quality data models in SQL to support our applications
- Develop automated data pipelines in AWS (lambdas, step functions, Glue)
- Design, building, and launching of new data models and data pipelines
- Implement best practices in data engineering including data integrity, validation, reliability, and documentation and improving discoverability of data
- Optimize database design for performance
- Build relationships with Data Scientists, Product Managers and Software Engineers to understand data needs
Qualifications / Experience
- 5+ years of experience in data modeling, ETL/ELT development, and Data Warehousing
- 3+ years of experience in with Cloud database technology, specifically Redshift
- 2+ years experience developing and deploying high-performance solutions using Apache airflow, Lambda Functions, Glue, Python and Spark
- Solid understanding of database design principals
- Solid understanding of query execution plans
What sets us apart?
- We are an internationally diverse team that supports one another
- We develop high-quality software and thus create sustainable added value for our customers
- We live a feedback culture so that we can constantly reflect and improve
- We offer flexible remote work with free time management within the projects
- We enjoy new technologies and love to learn new things and grow with them
- We give freedom for further training because lifelong learning is important to us
Conocimientos necesarios:
Performance Environment Data Warehousing Spark Modeling Data Integrity Data Modeling Intelligence Artificial Intelligence Pipelines Database Design Corporate Finance Apache Reliability Art Validation R Developers Budgeting Scrum Time Management Python Documentation Software Finance SQL Engineering Design Business Training Management