#1 Job Board for tech industry in Europe

All offers Kraków DataSenior Spark Engineer

Senior Spark Engineer

Data

Senior Spark Engineer

Link Group

6 483 - 10 374 USDNet/month - B2B

Type of work

Full-time

Experience

Senior

Employment Type

B2B

Operating mode

Remote

Link Group

We're a team of tech enthusiasts who love putting together amazing IT teams for Fortune 500 companies and startups across the globe. Our goal is to help you find your ideal job and work with some of the biggest players in the tech industry.

Company profile

Tech stack

Spark

advanced

Cloud

regular

SQL

regular

Big Data

regular

Job description

Online interview

Friendly offer

Employment Type: Full-Time, Remote

Job Description: We are looking for a highly experienced Senior Spark Engineer with deep expertise in Apache Spark, particularly in performance tuning and managing cyclic Spark data flows. The candidate should be proficient in troubleshooting and optimizing real-time data processing systems, including customizing the Catalyst Optimizer. Experience working with federated data systems and distributed computing environments is essential, along with the ability to integrate external systems and APIs. The role will involve optimizing Spark pipeline performance across large-scale, multi-cloud environments.

Key Responsibilities:

Expertise in Apache Spark: Utilize in-depth knowledge of Spark, including performance tuning, query optimization, and customizing the Catalyst Optimizer for distributed systems.
Federated Data Systems: Design, implement, and manage data workflows within federated models across multi-cloud environments.
Performance Optimization: Diagnose and address bottlenecks in Spark jobs, ensuring scalable and efficient performance on large clusters.
Distributed Computing: Manage Spark clusters, oversee task scheduling, resource allocation, and ensure fault tolerance in distributed environments.
API Integration: Connect Spark applications with external systems and APIs to improve data processing workflows.
Scala and Java Development: Apply strong skills in Scala and Java to build, maintain, and optimize real-time distributed applications in Spark.
Front-End Collaboration: Work with front-end developers and data teams to create and deploy user interfaces for monitoring Spark pipeline performance.
CI/CD and Version Control: Develop and manage CI/CD pipelines to ensure reliable software development practices, version control, and automated deployment for distributed applications.

Required Skills and Experience:

Apache Spark: Advanced experience in tuning, optimizing, and customizing the Spark Catalyst Optimizer for maximum performance.
Scala and Java Proficiency: Strong hands-on experience with Scala and Java in Spark-based distributed systems.
Federated Data Models: Proven experience managing federated data systems in multi-cloud environments (e.g., AWS, GCP, Azure).
Distributed Computing: Deep understanding of distributed computing principles, including task scheduling, resource management, fault tolerance, and cluster optimization.
Performance Optimization: Demonstrated expertise in optimizing Spark pipelines for large-scale, high-volume systems.
API Integration: Experience integrating Spark with third-party systems and APIs to streamline data workflows.
Front-End Development: Basic to intermediate skills in front-end development to collaborate on building monitoring dashboards for Spark systems.
Software Development: Strong programming fundamentals, experience with version control (Git), and a solid understanding of CI/CD pipelines.

Preferred Qualifications:

Experience with Kubernetes for managing Spark clusters in containerized environments.
Familiarity with cloud platforms such as AWS, GCP, or Azure.
Knowledge of SQL and database integration with Spark.
Experience with big data tools (e.g., Hadoop, Kafka) used alongside Spark.

Why Join Us?

Be part of a forward-thinking, tech-driven team.
Work on cutting-edge distributed systems using federated models.
Collaborate with experts in cloud computing, big data, and data engineering.
Opportunities for professional growth and continuous learning.

Check similar offers

RPA/Power Apps Developer

New

BlueSoft

5.19K - 6.74K USD

Lublin

, Fully remote

Fully remote

Power Apps

Power Platform

Power Automate

Principal / Lead Cartographer- Hybrid/ Remote

New

HERE Technologies

6.74K - 9.08K USD

Szczecin

, Fully remote

Fully remote

Figma

GitLab

Agile

Staff Software Engineer (Privacy Engineering)

New

Affirm

7.74K - 9.9K USD

Warszawa

, Fully remote

Fully remote

Backend

Databases

AWS

Senior Data Engineer

New

ITMAGINATION

4.93K - 6.74K USD

Warszawa

, Fully remote

Fully remote

SQL

Google Cloud Platform

Apache Spark

Senior Cloud Data Engineer (AWS and Databricks)

New

Future Processing

5.45K - 8.56K USD

Gliwice

, Fully remote

Fully remote

Databricks

AWS

SQL