Saventic Health is a mission-driven health-tech startup based in Warsaw, dedicated to transforming the diagnosis of rare diseases using cutting-edge Artificial
Intelligence. We are developing innovative solutions that have the potential to significantly shorten diagnostic timelines and improve patient outcomes. As a startup,
we thrive on innovation, agility, and a collaborative spirit where every team member makes a tangible impact. Due to the sensitive nature of our work and strict
regulatory requirements, maintaining trust and data security is paramount, influencing our data handling practices and infrastructure choices (currently primarily on-
premise for core operations).
The Opportunity:
We are seeking a highly motivated and experienced Senior Data Engineer to design, build, and manage the data infrastructure that fuels our AI-driven diagnostic
platform. This is a critical role where you will lay the foundation for how we collect, store, process, and utilize complex medical data, enabling breakthroughs in rare
disease diagnosis.
You will be responsible for creating robust, scalable, and reliable data pipelines and systems. While navigating the requirements of our on-premise infrastructure for
sensitive data, you will also leverage cloud environments for specific workloads and contribute to our evolving data strategy, potentially including future hybrid cloud
architectures. Your expertise will be crucial in ensuring our Data Scientists and Medical teams have access to high-quality, timely data.
Your Impact:
As a Senior Data Engineer, you are central to unlocking the potential of our data and directly contributing to our mission:
-
Enable AI Discovery: Build the data pipelines and structures that provide the high-quality fuel for our life-changing AI models.
-
Ensure Data Integrity & Trust: Implement systems and processes to guarantee data quality, security, and compliance within a regulated healthcare environment.
-
Shape Our Data Foundation: Play a key role in designing our data architecture, selecting tools, and defining best practices for data management and governance as we grow.
-
Drive Efficiency & Scalability: Optimize data processing and storage solutions to handle increasing data volumes and complexity efficiently.
-
Foster Innovation: Explore and implement cutting-edge data technologies and techniques appropriate for our unique challenges.
Key Responsibilities:
-
Data Pipeline Development: Design, build, automate, and maintain scalable and reliable ETL/ELT pipelines for diverse data sources.
-
Data Modeling & Architecture: Design and implement effective data models and database schemas; contribute to the overall data architecture design.
-
Data Storage Management: Manage and optimize our data storage solutions, including our current PostgreSQL databases and potentially other SQL/NoSQL/Data Lake solutions in on-premise and cloud environments.
-
Data Quality & Governance: Implement data quality checks, monitoring, and anomaly detection; contribute to data governance policies and ensure compliance (e.g., GDPR).
-
Performance Optimization: Monitor and optimize the performance of data pipelines, queries, and data retrieval processes.
-
Tooling & Technology: Evaluate, select, and implement appropriate data engineering tools and technologies.
-
Collaboration: Work closely with Data Scientists, MLOps/DevOps Engineers, and domain experts to understand data requirements and deliver solutions.
-
Security: Ensure data security best practices are implemented throughout the data lifecycle.
-
Documentation: Maintain clear documentation for data pipelines, schemas, and processes.
-
Mentoring: Support the development and mentor less experienced team members.
Who You Are:
-
Experienced Data Engineer: Proven track record in a Data Engineering role, with experience operating at a senior level.
-
Strong Programmer: Proficiency in Python is required for data manipulation, pipeline development, and scripting.
-
SQL & Database Expert: Deep understanding of SQL and extensive experience with relational database systems (including current hands-on experiencewith PostgreSQL). Experience with database design and data modeling is essential.
-
Pipeline Builder: Hands-on experience designing, building, and operationalizing complex data pipelines using relevant tools (e.g., Airflow, Prefect, Dagster,
- Spark, Kafka, or cloud-native equivalents).
-
Data Warehousing Knowledge: Solid understanding of data warehousing concepts, dimensional modeling, and associated technologies (experience with cloud DWs like BigQuery, Redshift, Snowflake is a plus).
-
Cloud Data Savvy: Experience with data services in at least one major cloud provider (AWS, GCP, Azure).
-
Data Quality Advocate: Understanding of data quality principles and experience implementing relevant checks and monitoring.
-
Infrastructure Aware: Familiarity with Linux environments and comfortable working with infrastructure components relevant to data engineering.
-
Proactive & Self-Motivated: Thrives in a fast-paced startup environment, takes initiative, identifies problems, and drives solutions independently.
-
Collaborative Communicator: Excellent communication skills to work effectively across technical and non-technical teams.
-
Passionate: Genuine interest in applying data engineering skills to solve real-world problems in healthcare.
Nice to Haves:
- Experience working with healthcare data (EHR, genomics, imaging) and relevant standards (HL7, FHIR, DICOM).
- Experience in regulated industries (Healthcare, Pharma, Finance) and handling sensitive data.
- Experience with big data technologies (e.g., Spark, Hadoop ecosystem).
- Experience with streaming data technologies (e.g., Kafka, Kinesis, Flink).
- Experience working with Large Language Models (LLMs), particularly regarding data preparation, optimization, Retrieval-Augmented Generation (RAG), or fine-
- tuning support.
What We Offer:
- A chance to make a significant impact on patient lives through data-driven insights.
- A pivotal role in building the data foundation of a growing HealthTech startup.
- Opportunity for professional growth in a challenging and rewarding field.
- A dynamic, innovative, and collaborative startup culture that values initiative.
- Competitive salary.
Ready to build the future of diagnostic data?
If you are excited by the challenge of architecting and managing data systems for cutting-edge AI in a meaningful domain, we encourage you to apply!