Senior/Lead Data Software Engineer (Python, Spark, Azure)

Data

Senior/Lead Data Software Engineer (Python, Spark, Azure)

Data
BRAIN PARK, Fabryczna 1A, 31-553 Krakow, 3rd, 4th & 5th floor, Krakow

EPAM Systems

Full-time
Any
Senior
Hybrid

Job description

We are seeking a Senior/Lead Data Software Engineer to join our team working on a scalable, ML-ready platform that enhances portfolio model development and deployment with advanced data governance and AI capabilities.

You will play a key role in migrating from an IaaS Big Data platform to Azure-native Databricks, optimizing data workflows and improving data quality. Join us to contribute to innovative solutions that boost client services and regulatory compliance.

Responsibilities

  • Migrate and optimize over 500 data jobs using Azure Databricks optimization techniques
  • Manage and process 12 TB of data efficiently across platforms
  • Tune machine learning models for Azure environments using Java Spark and Delta tables
  • Update and maintain libraries to address security vulnerabilities
  • Develop and maintain ETL/ELT pipelines using PySpark and related technologies
  • Collaborate with cross-functional teams to integrate GenAI capabilities into data workflows
  • Monitor data quality and implement improvements to ensure accuracy and reliability
  • Automate deployment and operational tasks using Terraform and GitLab CI/CD
  • Support data governance initiatives to comply with regulatory standards
  • Troubleshoot and resolve performance issues in data processing systems
  • Document system processes and provide technical guidance to junior engineers
  • Implement best practices for code quality and data security
  • Participate in code reviews and knowledge sharing sessions
  • Optimize costs associated with data storage and processing

Requirements

  • Proficiency in Python and Spark with at least 3 years in data engineering roles
  • Strong experience with Azure Databricks and PySpark
  • Proven expertise in designing and implementing ETL/ELT solutions
  • Experience migrating big data platforms to Azure-native services
  • Proficiency with Delta tables for model tuning
  • Knowledge of data governance and regulatory compliance frameworks
  • Familiarity with Docker, Kubernetes (AKS), and Terraform for infrastructure automation
  • Ability to manage large data volumes with high efficiency
  • Excellent problem-solving and analytical skills
  • Strong communication and collaboration abilities
  • English proficiency at B2 level or higher

We offer

  • We gather like-minded people:
    • Engineering community of industry professionals
    • Friendly team and enjoyable working environment
    • Flexible schedule and opportunity to work remotely within Poland
    • Chance to work abroad for up to 60 days annually
    • Business-driven relocation opportunities
  • We provide growth opportunities:
    • Outstanding career roadmap
    • Leadership development, career advising, soft skills, and well-being programs
    • Certification (GCP, Azure, AWS)
    • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
    • English classes
  • We cover it all:
    • Stable income (Employment Contract or B2B)
    • Participation in the Employee Stock Purchase Plan
    • Benefits package (health insurance, multisport, shopping vouchers)
    • Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
    • Referral bonuses
    • Corporate, social and well-being events
  • Please, note:
    • The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview.
    • We will reach out to selected candidates exclusively.

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

Tech stack

    English

    B2

    Azure Databricks

    master

    Python

    master

    PySpark

    master

    Apache Spark

    master

    Delta Lake

    advanced

    Azure

    advanced

    ETL

    advanced

    data governance

    regular

Office location

Senior/Lead Data Software Engineer (Python, Spark, Azure)

Summary of the offer

Senior/Lead Data Software Engineer (Python, Spark, Azure)

BRAIN PARK, Fabryczna 1A, 31-553 Krakow, 3rd, 4th & 5th floor, Krakow
EPAM Systems
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Klikając w przycisk „Aplikuj” lub w inny sposób wysyłając zgłoszenie rekrutacyjne, zgadzasz się na przetwarzanie Twoich danych osobowy... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Check similar offers
ITDS

ITDS

Krakow

Hybrid

Hybrid

6 007 - 7 278USD/month
Shell Scripting
Apache Spark
Leadership
GCP
Data Warehousing
Oracle
Python
Banking Domain
ETL/ELT
Data Engineering
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT
Applied -
11 day left (until 30.06.2026)
Applied -
Check similar offers
ITDS

ITDS

Krakow

Hybrid

Hybrid

6 007 - 7 278USD/month
Shell Scripting
Apache Spark
Leadership
GCP
Data Warehousing
Oracle
Python
Banking Domain
ETL/ELT
Data Engineering
SeniorSeniorB2BB2B
New
Addepto

Addepto

Remote

Remote

5 665 - 8 610USD/month
AWS
Airflow
Iceberg
Docker
Kubernetes
Java
SQL
Python
Big Data
Apache Spark
SeniorSeniorB2BB2B
New
H2B Group

H2B Group

Poland (Remote)

Remote

Remote

6 462 - 7 847USD/month
Azure
Databricks
Python
Apache Spark
SeniorSeniorB2BB2B
New
emagine Polska

emagine Polska

Krakow

Hybrid

Hybrid

Undisclosed Salary
Copywriting (content)
Machine Learning (ML)
training
Spark
Artificial Intelligence (AI)
Active Directory (AD)
Python
Data Engineering
Scala
Pandas (Python)
SeniorSeniorAnyAny
New
DCG

DCG

Warszawa

Remote

Remote

36 - 40USD/h
Azure Databricks
Data architecture
Microsoft Azure Cloud
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT