Senior Data Engineer with AWS, Python (FastAPI)

5 611.60 - 6 874.21 USDNet per month - B2B
4 769.86 - 5 611.60 USDGross per month - Permanent
Data

Senior Data Engineer with AWS, Python (FastAPI)

Data
Prosta 20, Warszawa +4 Locations

DataArt

Full-time
B2B, Permanent
Senior
Remote
5 611.60 - 6 874.21 USD
Net per month - B2B
4 769.86 - 5 611.60 USD
Gross per month - Permanent

Job description

Client

Our client is a leading legal recruiting company aiming to build a data-driven platform specifically designed for lawyers and law firms. The platform brings everything together in one place — news and analytics, real-time deal and case tracking from multiple sources, firm and lawyer profiles enriched with cross-linked insights, rankings, and more.


Project overview

The platform aggregates data from hundreds of public sources including law firm websites, deal announcements, legal databases, and media publications creating a unified ecosystem of structured and interconnected legal data. It combines AI-driven enrichment, automated data processing, and scalable infrastructure to ensure comprehensive and reliable coverage of the legal market.


Position overview

We are seeking a Senior Data Engineer to join our team to design, build, and scale robust data pipelines for collecting, transforming, and structuring large volumes of legal and financial data collected via scrapers. You will collaborate closely with AI/ML engineers, DevOps, Front-end and Back-end teams to ensure smooth and efficient data workflows integral to the platform.


Responsibilities

  • Design and implement data ingestion pipelines to collect and process structured and unstructured data from multiple online sources (web scraping, APIs, feeds, etc.).

  • Develop and optimize ETL/ELT workflows using Python and SQL.

  • Build and orchestrate scalable data workflows leveraging AWS services such as Batch and S3.

  • Develop and deploy internal data APIs and utilities supporting platform data access and manipulation.

  • Implement robust text extraction and parsing logic to handle diverse data formats.

  • Ensure data quality through validation, deduplication, normalization, and lineage tracking across Raw ➝ Curated ➝ Enriched data layers.

  • Containerize and orchestrate data workloads using Docker and native AWS solutions.

  • Collaborate closely with AI, Back-end, and Front-end teams to ensure efficient data integration and flow.

Requirements

  • Experience with AWS services (AWS Batch, S3, Step Functions)

  • Data Quality experience

  • AWS Batch and Amazon S3

  • AWS Step Functions

  • Amazon SQS

  • Master Data Management (MDM) experience

  • Relational databases, specifically PostgreSQL

  • Proven expertise in Python programming

  • Solid understanding of the AWS ecosystem

  • Practical experience with Docker and containerized development workflows

  • Experience with web scraping, text extraction, or other data‑ingestion techniques from diverse online sources

  • Strong analytical mindset, effective communication skills, and ability to collaborate across multiple teams

Nice to have

  • Hands-on experience with Apache Spark and SQL for distributed data processing.

  • Experience with EMR, SageMaker.

Tech stack

    English

    B2

    AWS

    advanced

    Data Quality

    advanced

    Master Data Management

    advanced

    PostgreSQL

    advanced

    Python

    advanced

    Docker

    advanced

Office location

Published: 09.02.2026

Senior Data Engineer with AWS, Python (FastAPI)

5 611.60 - 6 874.21 USDNet per month - B2B
Summary of the offer

Senior Data Engineer with AWS, Python (FastAPI)

Prosta 20, Warszawa
DataArt
5 611.60 - 6 874.21 USDNet per month - B2B
4 769.86 - 5 611.60 USDGross per month - Permanent
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest DataArt Poland Sp z o o z siedzibą w Lublinie, Ul. Zana 39 a, 20-601 Lublin (dalej jako "a... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.