Data Engineer

Python

Data Engineer

Python
Remote, Poland (Remote) +1 Location

RemoFirst

Full-time
B2B
Mid
Remote

Job description

We are one of the fastest-growing private companies in the USA, recently at 85th position on the Inc. 5000 list of fastest-growing companies in 2025. Backed by $40+M in venture funding, we are scaling rapidly and investing heavily in AI-driven data solutions to supercharge our operations.

We’re hiring a Data Engineer to join our team at the intersection of engineering, and AI innovation. This role is perfect for a passionate Data practitioner who wants to see their work directly impacts new innovative products, company’s growth and leads to expanding own competencies in the field.


What You’ll Do

  • Build data pipelines

    • Build ETL/ELT pipelines for extracting data from sources and placing it in target destinations

    • Transform data into formats usable by AI-based solutions (in RAG, fine-tuning scenarios)

  • Manage datasets for AI model training & fine-tuning

    • Work on instruction tuning datasets

    • Synthetic data generation

  • Evaluation & “Golden” datasets

    • Build “golden” datasets with domain experts

    • Build automated evaluation pipelines


What We’re Looking For

  • Technical Skills

    • Strong data engineering background: Python, SQL, Rust is a nice-to-have

    • Familiarity with AI concepts - RAG, fine-tuning, datasets

    • Experience in building ETL/ELT pipelines

  • Experience

    • 2–5 years in data engineering space, at least 1 year in AI-focused environment

    • Experience in AWS environment

  • Traits

    • Ability to take ownership, but also cooperate in small teams

    • Analytical mind

    • Being detail-oriented


Why Join Us

  • Growth opportunity: This is a chance to support how AI transforms a category-leading startup — and grow your career as we scale.

  • Direct impact: Your work will be at the center of the most important projects in our company

  • Competitive compensation and benefits

Tech stack

    English

    C1

    AWS

    advanced

    Rust

    advanced

    SQL

    advanced

    Python

    advanced

    RAG

    regular

    ETL/ELT pipelines

    junior

Office location