Senior Data Engineer / Databricks Developer

Data

Senior Data Engineer / Databricks Developer

Data
Centrum, Warsaw

emagine Polska

Full-time
Any
Senior
Remote

Job description

Project Description – DQX-Based Data Quality Monitoring

We are looking for an experienced senior data engineer / Databricks developer to support the design and implementation of a DQX-based Data Quality Monitoring capability. The objective is to build a scalable solution that enables business users and data owners to monitor, understand, and act on data quality where it matters most. The solution should create a clear link between data input, data usage, and business-critical data quality rules, allowing data owners to define and enforce quality expectations across key data domains.

The capability should support business users in self-managing data quality oversight through dashboards, trending, and rule-based monitoring. This includes visibility into data quality performance over time, identification of recurring issues, and transparency on which data points are most critical based on documented business usage. The initial scope will focus on selected Study Management data domain, starting with Study Personnel and Milestones, with the ambition to design the solution so it can scale to additional Clinical data areas over time.

The solution is expected to be built natively on Databricks, leveraging relevant data quality frameworks such as DQX, and should support a governed operating model where data quality rules are owned and endorsed by relevant data owners. The expected outcome is a robust and scalable Data Quality Monitoring solution that improves transparency, strengthens data ownership, supports audit readiness, and enables proactive management of data quality issues before they create downstream impact.

What we offer:

  • Remote role

  • B2B Contract

  • Rate: 41 euro/h+ VAT

Main Responsibilities

The consultant is expected to contribute to the following:

  • Design and implementation of a Databricks-native Data Quality Monitoring framework (DQX).

  • Configuration and implementation of DQX-based data quality rules.

  • Development of data pipelines and data models supporting monitoring and reporting.

  • Dashboards and trending views for business users and data owners.

  • Linkage between data quality rules, data usage, and critical data points.

  • Documentation of technical design, rule logic, and operating model.

  • Support for scaling the solution to additional Clinical data domains.

Key Requirements

  • Strong hands-on experience with Databricks.

  • Experience with Spark, SQL, Delta Lake, and Python.

  • Experience designing and implementing data pipelines / ETL-ELT.

  • Experience with data quality frameworks, preferably DQX.

  • Understanding of data governance, data ownership, and rule-based data quality monitoring.

  • Ability to translate business requirements into scalable technical solutions.

  • Experience working in complex enterprise environments.

Nice to Have

  • Experience with regulated or compliance-heavy environments.

  • Experience with clinical operations, trial operations, or life sciences data.

  • Experience with dashboarding and business-facing data quality reporting.

  • Azure DevOps / CI-CD experience.

  • Experience with integrations, APIs, or downstream system connectivity.

  • Snowflake experience.

  • Understanding of Veeva or related clinical systems.

Tech stack

    English

    B1

    Documentation

    advanced

    Governance

    advanced

    Operations

    advanced

    DataStage (ETL)

    advanced

    SQL

    advanced

    Python

    advanced

    Spark

    advanced

    ETL

    advanced

    Microsoft Azure

    advanced

    DevOps

    advanced

Office location

Senior Data Engineer / Databricks Developer

Summary of the offer

Senior Data Engineer / Databricks Developer

Centrum, Warsaw
emagine Polska
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Check similar offers
DCG

DCG

Warsaw

Remote

Remote

33 - 35USD/h
AWS
Bash
PySpark
Data architecture
MongoDB
Snowflake
SQL
Python
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT
Applied -
29 day left (until 15.07.2026)
Applied -
Check similar offers
DCG

DCG

Warsaw

Remote

Remote

33 - 35USD/h
AWS
Bash
PySpark
Data architecture
MongoDB
Snowflake
SQL
Python
SeniorSeniorB2BB2B
New
Margo

Margo

Warsaw

Remote

Remote

55USD/h
ORM
AWS
DBT
Python 3
SQLAlchemy
Sagemaker
Amazon AWS
MLflow
Pandas
SQL
SeniorSeniorB2BB2B
New
DevsData LLC

DevsData LLC

Remote

Remote

18 - 20USD/h
Azure
Databricks
SQL
Python
SeniorSeniorB2BB2B
New
EPAM Systems

EPAM Systems

Remote

Remote

Undisclosed Salary
AWS
Spark
Azure
Databricks
Agile
Java
Python
Scala
Big Data
SeniorSeniorB2BB2B
New
Future Mind

Future Mind

Remote

Remote

Undisclosed Salary
PySpark
Azure
Databricks
SQL
Python
SeniorSeniorAnyAny
New
ADVERTISEMENT: Recommended by Just Join IT