#1 Job Board for tech industry in Europe

  • Job offers
  • Lead Data Engineer
    New
    Data

    Lead Data Engineer

    30 000 - 40 000 PLNNet/month - B2B
    Type of work
    Full-time
    Experience
    Senior
    Employment Type
    B2B
    Operating mode
    Remote

    Tech stack

      Python

      advanced

      R

      regular

      Redshift

      regular

      MySQL

      regular

      Kinesis Firehose

      nice to have

      Java

      nice to have

    Job description

    Online interview

    PROJECT


    We are developing a mobile application that allows users to compare prices of the same products across different stores in the United States. Our app aims to provide consumers with real-time insights into product availability, pricing, and brand choices, helping them make more informed shopping decisions. We are currently expanding our efforts to enhance the quality of product metadata and automate data processing, utilizing cutting-edge technologies like Natural Language Processing (NLP) and machine learning.

    To support our rapid growth, we are looking for a talented Data Engineer with a strong focus on Prompt Engineering to help us build and optimize automated systems that will improve data accuracy, reduce manual work, and ensure smooth integration of new data sources. Your work will be instrumental in ensuring that our users can confidently compare prices with accurate and complete product information.


    SALARY

    💵 B2B: 30 000 - 40 000 PLN net


    RESPONSIBILITIES:

    Primary Role: Develop and maintain prompt-engineering scripts and semi-automatic flagging/review systems to clean, validate, and standardize metadata for various product fields. 


    Prompt Engineering & Scripts:

    • Create and optimize prompts/scripts to automatically clean, categorize, and tag product metadata.
    • Improve existing scripts for product types, sizes, tags, brands, and other metadata fields.

    Data Cleanup & Standardization:

    • Write scripts to handle data cleanup for brands, prices, sizes, and availability.
    • Convert size values to numerical representations based on product type-specific rules.

    Manual Review System:

    • Develop systems for manual review of flagged items in the data warehouse (DW).
    • Build review mechanisms for user-flagged issues and those identified by automated systems.

    Data Integration & Deduplication:

    • Ensure proper merging of products by identifying and consolidating duplicates using UPCs and metadata.

    Product Grouping:

    • Create scripts for identifying and grouping similar products based on size, color, scent, etc.


    SKILLS & QUALIFICATIONS

    We are seeking a highly skilled professional with a proven ability to work independently and deliver innovative technological solutions within their area of specialization. The ideal candidate will possess:

    • Self-Motivation: Ability to manage tasks autonomously and meet project deadlines without direct supervision.
    • Technical Expertise: In-depth knowledge of Date Engineering, with a track record of implementing successful projects.
    • Advisory Skills: Proven experience in assessing complex problems and providing expert recommendations on optimal technological solutions that align with business objectives.
    • Communication Skills: Strong ability to articulate technical concepts to non-technical stakeholders and collaborate effectively with cross-functional teams.
    • Continuous Learning: Commitment to staying updated with industry trends and advancements to enhance the technological capabilities of the organization.


    Requirements:

    5+ years of experience as a Data Engineer or in a similar role.

    ✅ Strong proficiency in Python (Mid/Senior level), R (Mid/Senior level), and experience with Java is a plus.

    ✅ Experience with NLP techniques, especially in prompt engineering for data automation.

    ✅ Proficient in designing and optimizing ETL pipelines and working with cloud data warehouses (e.g., Snowflake, BigQuery, Redshift).

    ✅ Familiarity with product metadata structures and experience working with large datasets in a commercial environment.

    ✅ Excellent problem-solving skills and attention to detail, especially in data validation and error handling.

    ✅ Strong communication skills to work effectively with cross-functional teams, including QA, backend developers, and product managers (English and Polish B2 + is a must).

    ✅ Experience with APIs, data integration, and handling multiple data sources.


    BENEFITS

    ✅ 25 days of paid leave for B2B or contract services;

    ✅ Flexible working hours, with the option to work remotely or from the office;

    ✅ Private healthcare (Luxmed);

    ✅ Subsidy for insurance (Warta);

    ✅ MPCharity: a program where you can propose initiatives that you would like to support.


    ABOUT US

    We are a Katowice-based IT company with over 100 people on board—and one dog. We have greatly diversified our operations, and despite the pandemic, we’ve experienced the most growth in the past few years. Our work focuses on our own product, getzendo.io, collaboration with a client from the United States, and a software house division (providing IT services to external companies).


    RECRUITMENT PROCESS

    Step 1: Interview with an HR representative (1h)

    Step 2: Technical Interview (1h)

    Step 3: Interview with CTO (1h)

    Step 4: Culture check with team (1h)

    30 000 - 40 000 PLN

    Net/month - B2B

    Check similar offers

    Senior Data Engineer z j. angielskim lub niemieckim (People and Project Analytics)

    New
    dmTECH Polska
    13.4K - 18.3K PLN
    Łódź
    , Fully remote
    Fully remote
    Java
    Snowflake
    Python

    Data Engineer (Azure Cloud Data Platform)

    New
    Baloise Solution Hub
    22K - 36K PLN
    Warszawa
    , Fully remote
    Fully remote
    English
    MLOps
    MS Fabric

    Senior Machine Learning Engineer (GCP)

    New
    Holisticon Connect
    26.9K - 31.9K PLN
    Wrocław
    , Fully remote
    Fully remote
    Python
    DBT
    data catalog

    AI and Automation Team Lead

    New
    capital.com
    29K - 35K PLN
    Warszawa
    , Fully remote
    Fully remote
    Python
    Kubernetes
    REST API

    Data Engineer

    New
    Connectis
    32K - 37K PLN
    Gdańsk
    , Fully remote
    Fully remote
    REST API
    Azure Databrics
    Azure Data Factory