Senior Data Engineer
Stanisława Żółkiewskiego 17a, Kraków +2 Locations
Grape Up
Core Skills:
Experience with technologies including but not limited to:
Python / SQL
AWS Data Engineering Stack (S3, Glue, Redshift, Kinesis, Athena)
Big Data & Distributed Processing (Hadoop, Spark)
Data Pipelines Orchestration (Airflow, Prefect)
At Grape Up, we transform businesses by unlocking the potential of AI and data through innovative software solutions.
We partner with industry leaders, from the automotive and finance industry, to build sophisticated Data & Analytics platforms that transform how organizations manage and leverage their data assets. Our solutions provide comprehensive capabilities spanning data storage, management, advanced analytics, machine learning, and AI, enabling enterprises to accelerate innovation and make data-driven decisions.
Responsibilities
Implement a scalable architecture capable of handling the high volume of simulation data
Build a flexible data preprocessing pipelines that are extensible and that can be integrated into customer’s existing platform
Define KPIs to measure the improved reusability and automation of the new pipelines and test their performance in an end-to-end setting with model training.
Develop and implement processes and best practices for data management and governance
Optimize and enhance system setup and improve data structures following industry best practices
Collaborate effectively with data engineering team members while partnering closely with analytics and data science teams to meet user needs
Requirements
PhD or master’s degree in computer science, Data Science, AI, or related field
5+ years of professional experience in Data Engineering and Big Data
Proven experience in implementing and deploying solutions in AWS using AWS stack (AWS Glue, Redshift, Athena)
Proven experience in designing and implementing data governance and data management frameworks
Expert level knowledge of Python
Automotive or IoT data processing experience
Strong problem-solving skills and independent work ethic
Fluency in English, both written and spoken
Nice To Have
Experience with Infrastructure as Code (Terraform, CloudFormation)
Experience with Kubernetes
Experience with alternative data platforms (Databricks, Snowflake)
Experience with Machine learning and MLOps