Data Enginner - AI Lab
Unleash the Power of Data — Shape the Future of Healthcare Intelligence!
Poland or Portugal-based opportunity with a fully remote work model (5 days per week).
As a Senior Data Engineer – AI Healthcare Analytics, you will be working for our client, a leader in healthcare data solutions dedicated to transforming complex health data into actionable insights. You’ll build scalable data pipelines and infrastructure that fuel AI and analytics projects, empowering smarter patient care and operational excellence.
Your main responsibilities:
Design and develop production ETL/ELT pipelines from diverse healthcare data sources, including claims, patient records, and referrals, into S3 data lakes and data warehouses.
Implement and manage batch workflows with orchestration, error handling, retries, and lineage tracking.
Write modern Python-based data processing jobs using the latest libraries to transform and merge healthcare datasets.
Build and optimize AWS data infrastructure, including S3, Athena, IAM, and Glue, focusing on cost efficiency and query performance.
Integrate healthcare-specific data such as ICD/CPT codes, NPI data, and demographic details, normalizing and processing them for analytical use.
Develop and optimize complex SQL queries for healthcare metrics, cohort analysis, patient segmentation, and performance dashboards.
Ensure data quality through validation, schema enforcement, anomaly detection, and automated validation pipelines.
Profile data for completeness, accuracy, and duplication, preparing comprehensive data quality reports.
Monitor and optimize data pipelines for throughput, memory efficiency, and cost — implementing caching, partitioning, and cost-tracking strategies.
Collaborate using Git, maintain detailed documentation, and develop clear technical specifications and data dictionaries.
You're ideal for this role if you have:
At least 5 years of professional experience building production-grade ETL/ELT pipelines processing large volumes of healthcare data.
Strong proficiency in Python (3.12+) and experience with modern data processing libraries.
Hands-on AWS experience with S3, Athena, IAM, and Glue, including cost management and schema design.
Expertise in writing and optimizing advanced SQL queries on datasets with millions or billions of rows, utilizing CTEs and window functions.
Experience working with large-scale datasets and performance trade-offs.
Good understanding of development practices such as Git workflows, code reviews, and documentation.
It is a strong plus if you have:
Healthcare data experience — claims processing, patient journey analysis, ICD/CPT coding, PHI/HIPAA compliance.
Familiarity with healthcare provider data (NPI, facility types).
Multi-cloud experience with Azure Data Factory, Synapse, or ADLS.
Knowledge of LLM data workflows, prompt engineering, and structured output formats like JSON or YAML.
Experience with data modeling for analytics (star/snowflake schemas, data warehouses).
Exposure to metadata and data catalog tools.
Skills in orchestration tools like Airflow, Dagster, or Step Functions.
Experience with PostgreSQL, Supabase, or serverless Postgres solutions.
Language Required for the role:
Fluent English
Eligibility for the role:
Only candidates with an existing legal right to work in Europe will be considered for this role.
#MAKEYourCareerBETTERInterested? Apply now and include your CV in English, along with a statement confirming your consent to the processing and storage of your personal data.
We offer you:
ITDS Business Consultants is involved in many various, innovative and professional IT projects for international companies in the financial industry in Europe. We offer an environment for professional, ambitious, and driven people. The offer includes:
Stable and long-term cooperation with very good conditions.
Enhance your skills and develop your expertise in various industries.
Work on the most strategic projects available in the market.
Define your career roadmap and develop yourself in the best and fastest possible way by delivering strategic projects for different clients of ITDS over several years.
Participate in Social Events, training, and work in an international environment.
Access to attractive Medical Package.
Access to Multisport Program.
Access to Pluralsight.
Flexible hours & remote work.
You can report violations in accordance with ITDS’s Whistleblower Procedure available here.
Ref. number
8088
Data Enginner - AI Lab
Data Enginner - AI Lab