Cloud Data Engineer

Data

Cloud Data Engineer

Data
-, Kraków +9 Locations

DCG

Full-time
B2B
Senior
Remote
5 598 - 6 936 USDNet per month - B2B

Job description

Responsibilities:

  • Implement and manage data ingestion pipelines from diverse sources such as Kafka, RDBMS (Postgres) using CDC (Change Data Capture), and file systems (CSV) following Medalion Architecture principles
  • Develop and optimize data transformations using PySpark and SQL to handle data ranging from MB to GB, depending on the source
  • Conduct unit testing and integration testing to ensure the accuracy and reliability of data transformations and pipelines
  • Work with AWS technologies, including S3 for data storage and Docker on AWS for containerized applications
  • Implement and manage infrastructure using Terraform, such as creating S3 buckets, managing Databricks Service Principals, and deploying infrastructure as code
  • Deploy and manage solutions using CI/CD pipelines, particularly with CircleCI, to ensure seamless and automated deployment processes

 

Requirements:

  • Minimum 4-5 years of a professional experience
  • Proficiency in SQL and Python
  • Strong experience with AWS cloud services
  • Hands-on experience with DataBricks
  • Knowledge of ETL Processing
  • Effective communication skills in English (minimum B2 level)
  • Knowledge of system design
  • Understanding of Medalion Architecture

Nice to have:

  • Familiarity with Kedro and Airbyte 
  • Knowledge of Machine Learning

 

Offer:

  • Private medical care
  • Co-financing for the sport card
  • Training & learning opportunities
  • Constant support of dedicated consultant
  • Team-building events organized by DCG
  • Employee referral program

Tech stack

    AWS

    advanced

    ETL

    advanced

    Databricks

    advanced

    SQL

    advanced

    Python

    advanced

    PySpark

    regular

    Azure

    nice to have

Office location

Check similar offers
BILLENNIUM SPÓŁKA AKCYJNA

BILLENNIUM SPÓŁKA AKCYJNA

Warszawa

Remote

Remote

Undisclosed Salary
AWS
Airflow
ETL
SQL
Python
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT
Check similar offers
BILLENNIUM SPÓŁKA AKCYJNA

BILLENNIUM SPÓŁKA AKCYJNA

Warszawa

Remote

Remote

Undisclosed Salary
AWS
Airflow
ETL
SQL
Python
SeniorSeniorB2BB2B
New
Future Processing

Future Processing

Remote

Remote

36 - 53USD/h
AWS
PySpark
Snowflake
SQL
Python
SeniorSeniorB2BB2B
New
Sii

Sii

Remote

Remote

Undisclosed Salary
Azure/AWS
Apache Kafka
ETL/ELT
Snowflake
Apache Airflow
SQL
Python
Apache Spark
SeniorSeniorPermanentPermanent
New
Enxoo

Enxoo

Warszawa

Remote

Remote

Undisclosed Salary
Data cloud
integration
Salesforce
English
SeniorSeniorB2BB2B
New
PTT Consulting Sp. z o. o.

PTT Consulting Sp. z o. o.

Warszawa

Remote

Remote

40 - 46USD/h
TBM Studio
Data Warehousing
Cloud (AWS /Azure / GCP)
Power BI
Data integration (incl. SEI integration)
Data modeling
End-to-end building of data platform
SQL
Data orchestration tools
Python
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT