Data Engineer - Data Platform

Data

Data Engineer - Data Platform

Data
-, Warszawa +1 Location

Allegro

Full-time
Permanent
Mid
Hybrid

Job description

Job Description:

We are looking for a skilled and proactive Data Engineer to join our Vento team.

The role is critical for the development of the Vento service, which standardizes and validates all Allegro clickstream events. The Data Engineer ensures the high quality of data consumed by AI agents and core business dashboards (Board-level metrics). 

This is the right job for you if you:

  • Have solid, production-proven experience in Big Data ecosystems and building stable data pipelines.

  • Are proficient in Python, Spark (Scala is nice to have) and open to navigating or aligning with Java backend applications.

  • Understand data quality principles, event contract validation, anomaly detection, and data processing within streaming environments (Clickstream).

  • Are open to feedback, keen to participate in code reviews, and eager to develop your skills.

  • Are a proactive communicator who naturally signals technical problems and takes ownership of your tasks with a high level of autonomy.

  • Show a broad technical perspective and are interested in how Big Data, Backend, and Frontend pillars interconnect.

  • Are enthusiastic about utilizing AI coding tools to automate repetitive tasks and optimize your daily workflow.

  • Speak Polish at a minimum C1 level and English at a minimum B2 level.

In your daily work you will handle the following tasks:

  • Designing, scaling, developing, and maintaining data validation pipelines for high-volume clickstream data within the Vento service.

  • Implementing robust anomaly detection mechanisms to ensure data integrity and quality processing before ingestion into the core Data Assets layer.

  • Collaborating with cross-functional stakeholders to reconcile data requirements.

  • Actively participating in engineering discussions, code reviews, and cross-team initiatives.

What's in it for you:

  • Flexible working hours in the hybrid model (4/1) - working hours start between 7:00 a.m. and 10:00 a.m. We also have 30 days of occasional remote work.

  • Well-located offices (with e.g. fully equipped kitchens, bicycle parking, terraces full of greenery) and excellent work tools (e.g., raised desks, ergonomic chairs, interactive conference rooms).

  • A 16" or 14" MacBook Pro or corresponding Dell with Windows (if you don't like Macs) and all the necessary accessories.

  • A wide selection of fringe benefits in a cafeteria plan - you choose what you like (e.g., medical, sports or lunch packages, insurance, purchase vouchers).

  • English classes that we pay for related to the specific nature of your job.

  • A training budget, inter-team tourism (see more here), hackathons, and an internal learning platform where you will find multiple trainings.

  • An additional day off for volunteering, which you can use alone, with a team, or with a larger group of people connected by a common goal.

  • Social events for Allegro people - Spin Kilometers, Family Day, Fat Thursday, Advent of Code, and many other occasions we enjoy.

#goodtobehere means that:

  • You will join a team you can count on - we work with top-class specialists who have knowledge- and experience-sharing in their DNA.

  • You will love our level of autonomy in team organization, the space for continuous development, and the opportunity to try new things. You get to choose which technology solves the problem and you are responsible for what you create.

  • You will value our Developer Experience and the full platform of tools and technologies that make creating software easier. We rely on an internal ecosystem based on self-service and widely used tools such as Kubernetes, Docker, Consul, GitHub, and GitHub Actions. Thanks to this, you can contribute to Allegro from your very first days on the job.

  • You will be equipped with modern AI tools to automate repetitive tasks, allowing you to focus on developing new services and refining existing ones (also leveraging AI support).

  • You will create solutions that will be used (and loved!) by your friends, family and millions of our customers.

  • You will meet the Allegro Scale, which starts with over 1000 microservices, an open-source data bus (Hermes) with 300K+ rps, a Service Mesh with 1M+ rps, tens of petabytes of data, and production-used machine learning.

  • You will become part of Allegro Tech - We speak at industry conferences, cooperate with tech communities, run our own blog (it's been over 10 years!), record podcasts, lead guilds, and we organize our own internal conference - the Allegro Tech Meeting. We create solutions we love (and can) to talk about!

Send us your CV and... see you at Allegro!

Tech stack

    English

    B2

    Big Data

    regular

    Python

    regular

    Spark

    regular

    Java

    regular

    AI

    regular

    Scala

    nice to have

Office location

Data Engineer - Data Platform

Summary of the offer

Data Engineer - Data Platform

-, Warszawa
Allegro
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest Allegro.pl z siedzibą w Poznaniu, ul. Wierzbięcice 1B (dalej jako "administrator"). Masz p... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Check similar offers
Link Group

Link Group

Warszawa

Hybrid

Hybrid

7 423 - 10 722USD/month
Trino
AI
Prometheus
Grafana
Kubernetes
Kafka
SQL
Python
CloudWatch
MidMidPermanentPermanent
New
ADVERTISEMENT: Recommended by Just Join IT
Applied -
30 day left (until 17.07.2026)
Applied -
Check similar offers
Link Group

Link Group

Warszawa

Hybrid

Hybrid

7 423 - 10 722USD/month
Trino
AI
Prometheus
Grafana
Kubernetes
Kafka
SQL
Python
CloudWatch
MidMidPermanentPermanent
New
B2Bnetwork

B2Bnetwork

Hybrid

Hybrid

5 494 - 6 593USD/month
GenAI
LLM
Spark
Flink
Kubernetes
Kafka
SQL
MidMidB2BB2B
New
ERGO Technology & Services

ERGO Technology & Services

Hybrid

Hybrid

Undisclosed Salary
aws cloud
Agile
Python
MidMidPermanentPermanent
New
Warsaw Hub @ Helvetia Baloise Group

Warsaw Hub @ Helvetia Baloise Group

Warszawa

Remote

Remote

6 045 - 6 869USD/month
Airflow
PostgreSQL
PySpark
Terraform
MS SQL
Databricks
Azure DevOps
Python
German
Apache Spark
MidMidB2B, PermanentB2B, Permanent
New
Alior Bank SA

Alior Bank SA

Warszawa

Hybrid

Hybrid

Undisclosed Salary
ETL
Power BI
Data
SQL
MidMidPermanentPermanent
New
ADVERTISEMENT: Recommended by Just Join IT