Senior Data Engineer, Clinical Data Platform

Data

Senior Data Engineer, Clinical Data Platform

Data
Tomasza Zana 39a, Lublin +4 Locations

DataArt

Full-time
Permanent, B2B
Senior
Remote
5 454 - 6 272 USDNet per month - B2B
4 363 - 5 181 USDGross per month - Permanent

Job description

We are considering only candidates who are located in Poland.

Project overview

You will work on a platform that processes clinical and real-world data (EHRs, labs, registries, trial data) and powers analytics, reporting, and data products for a healthcare / clinical research client.

Position overview

We are looking for a Senior Data Engineer to build and operate a clinical data platform on Databricks, with a strong focus on robust data pipelines, data models, and data quality.

Technology stack

The platform is built on Databricks (Spark, Delta Lake) and includes reusable pipelines, a shared data model, and automated data quality checks.

Responsibilities

  • Design, build, and maintain end-to-end Databricks data pipelines (ingestion, transformation, publishing) for production use

  • Work with data models (staging, curated, canonical, or dimensional) and help evolve them together with architects and analysts

  • Embed data quality and data governance rules into all pipelines (checks, validation, monitoring, alerting)

  • Optimize Databricks jobs for performance and cost (cluster configuration, partitioning, caching, file layout)

  • Collaborate with data architects, analysts, and domain experts to clarify requirements and refine technical solutions

Requirements

  • 5+ years of experience in data engineering, DWH, or big data, including production data pipelines

  • Strong hands-on experience with Databricks: Spark (PySpark/Scala), Delta Lake, Databricks Jobs / Workflows

  • Proven experience designing and operating end-to-end pipelines on Databricks for batch or near-real-time data

  • Experience with data pipelines and CI/CD for data

  • Practical experience with data modeling (layered models, canonical or dimensional models) for analytics and reporting

  • Experience embedding data quality and data governance rules into pipelines (schema checks, business rules, SLOs, monitoring)

  • Good communication skills, upper-intermediate or higher English proficiency, and the ability to work closely with stakeholders in distributed teams and communicate directly with clients

Nice to have

  • Experience designing and delivering PoC solutions on Databricks to quickly validate ideas using real data

  • Experience with ontologies or a semantic layer (business concepts, metrics, mappings) on top of analytical data

Tech stack

    English

    B2

    CI/CD

    advanced

    Spark

    advanced

    Data modeling

    advanced

    Databricks

    advanced

    Delta Lake

    advanced

Office location

Check similar offers
dmTECH Polska

dmTECH Polska

Remote

Remote

38 - 49USD/h
Kotlin
Docker
GCP
Terraform
BigQuery
Snowflake
Java
Python
SeniorSeniorB2BB2B
ADVERTISEMENT: Recommended by Just Join IT
Check similar offers
dmTECH Polska

dmTECH Polska

Remote

Remote

38 - 49USD/h
Kotlin
Docker
GCP
Terraform
BigQuery
Snowflake
Java
Python
SeniorSeniorB2BB2B
Link Group

Link Group

Remote

Remote

Undisclosed Salary
Git
CI/CD
ETL
Data
Apache
GCP
BigQuery
SQL
SeniorSeniorB2BB2B
New
Future Processing

Future Processing

Remote

Remote

37 - 54USD/h
AWS
PySpark
Snowflake
SQL
Python
SeniorSeniorB2BB2B
New
Sii

Sii

Remote

Remote

Undisclosed Salary
Microsoft Azure
Power BI
Snowflake
Databricks
SQL
Python
SeniorSeniorB2BB2B
New
Svitla Systems

Svitla Systems

Remote

Remote

Undisclosed Salary
Unix
Tableau
Power BI
AI
SQL
Python
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT