Data Steward
Al. Jerozolimskie 158, Warszawa
Bayer Sp. z o.o.
At Bayer we’re visionaries, driven to solve the world’s toughest challenges and striving for a world where ,Health for all, Hunger for none’ is no longer a dream, but a real possibility. We’re doing it with energy, curiosity and sheer dedication, always learning from unique perspectives of those around us, expanding our thinking, growing our capabilities and redefining ‘impossible’. There are so many reasons to join us. If you’re hungry to build a varied and meaningful career in a community of brilliant and diverse minds to make a real difference, there’s only one choice.
Data Steward
As technical data steward, you will be a part of Bayer's Pharma Data & AI Team. This role blends hands-on data quality engineering with stewardship of metadata, driving the implementation of data management standards and automated quality controls throughout the data product lifecycle. You’ll ensure our commercial data products remain accurate, compliant, and well-governed by collaborating closely with data product owners and embedding best practices in development and operations.
YOUR TASKS AND RESPONSIBILITIES
Design, develop, and maintain automated testing and data quality frameworks based on product requirements, i.e. by:
authoring test plans and scripts
implementing checks using e.g. dbt, Great Expectations, PyTest
performing regression/performance/UAT, and documenting defects in bug tracking tools
participating in code reviews to uphold testing and data management standards and advising on best practices.
Operationalize data quality in delivery pipelines by:
embedding quality gates in CI/CD and orchestration tools
collaborating with engineering to implement schema validation, contract testing, and change management for batch and streaming interfaces
managing quality metrics and thresholds
maintaining incident runbooks
partnering with DevOps teams to quickly resolve data quality issues
Monitor and report on data product and pipeline health by:
using observability tools (e.g., Dagster) to detect and address issues
publishing quality dashboards and trend reports on data usage and data quality
Define, steward, and maintain business and technical metadata aligned with the data catalog, glossary, and ownership model; contribute to logical and physical data models and update the catalog with lineage, asset relationships, and product certification status.
Oversee data classification and PII tagging in compliance with data privacy regulations, in partnership with security and compliance teams.
Drive continuous improvement in QA methodologies, data quality tooling, and data stewardship practices.
WHO YOU ARE
Bachelor’s degree in Computer Science, Information Systems, Data Science, or a related field
3+ years of experience in Data Stewardship, Quality Engineering, Data Management, or Data Governance roles, ideally within pharmaceutical or regulated industries
Good understanding of data quality frameworks and governance models, including defining, monitoring, and enforcing standards using test frameworks like dbt, pyTest, and Great Expectations
Good understanding of data classification, PII tagging, and compliance with pharma-specific data privacy regulations, leveraging e.g. Collibra, Snowflake and AWS security features for implementation.
Experience building and maintaining business glossaries, metadata documentation, and asset relationships within enterprise data catalogs (e.g., Collibra), with basic knowledge of developing and maintaining logical and physical data models using industry-standard modelling tools and updating the Data Catalog accordingly.
Good understanding of cloud data management solutions (e.g. Snowflake, Databricks, AWS), scripting languages (e.g. SQL, Python), and automation for data validation and operational support, supported by solid analytical skills for identifying patterns and actionable insights in complex data sets.
Basic experience with observability tools and orchestration platforms (e.g., Dagster, Airflow) for monitoring pipeline and product health and troubleshooting issues
Experience collaborating with DevOps and engineering teams to specify requirements for data management tooling and access controls.
Strong interpersonal skills and ability to work collaboratively in a team environment.
Good stakeholder management and communication skills, with the ability to translate business needs into technical requirements and drive process improvements, while being capable of conveying technical concepts to non-technical audiences.
Familiarity with Agile/Scrum methodologies.
Strong problem-solving skills and attention to detail.
Strong adaptability and willingness to quickly learn and adopt new technologies and approaches in a dynamic, evolving data platform and business environment.
WHAT DO WE OFFER:
A flexible, hybrid work model
Great workplace in a new modern office in Warsaw
Career development, 360° Feedback & Mentoring programme
Wide access to professional development tools, trainings, & conferences
Company Bonus & Reward Structure
Increased tax-deductible costs for authors of copyrighted works
VIP Medical Care Package (including Dental & Mental health)
Life & Travel Insurance
Pension plan
Co-financed sport card - FitProfit
Meals Subsidy in Office
Budget for Home Office Setup & Maintenance
Access to Company Game Room equipped with table tennis, soccer table, Sony PlayStation 5 and Xbox Series X consoles setup with premium game passes, and massage chairs
Tailored-made support in relocation to Warsaw when needed
Please send your CV in English

Digital Hub Warsaw - here the best and most creative minds work in a diverse and inclusive environment on groundbreaking solutions that support Bayer's vision of "health for all - hunger for none." We create digital solu...
Data Steward
Data Steward
Al. Jerozolimskie 158, Warszawa
Bayer Sp. z o.o.