Data Engineer

Data

Data Engineer

Data
11-29 Fashion Street, London

Evaluate Ltd

Undetermined
B2B
Mid
Office

Job description

About Evaluate Ltd

Evaluate provides trusted commercial intelligence for the pharmaceutical and medical device industries.

Our EvaluatePharma® online subscription services provides a seamless view of the past, present and future of the global pharmaceutical market in a single, standardised platform. Vantage – our award-winning, independent editorial team – provide thought-provoking news and insights into the current and future developments in the industry. Evaluate has been a trusted partner to industry-leading organisations for over 20 years. For more information on how we give our clients the time and understanding to drive better decisions, visit www.evaluate.com.

Requirements

Position Description
A Data Engineer is needed to support the product development team in data design and structuring, ETL processes, and pipeline generation.

Responsibilities

  • Support the product development team in the creation of features for predictive ML models.
  • Support the product development team in the manipulation of large data sets, and the extraction of insights.
  • Support the product development team in the design and optimisation of database solutions that sit upstream of main production environment.
  • Develop & maintain data publication and synchronisation processes, supporting Tableau.
  • Designing robust & efficient data processes, aggregations and pipelines.
  • Establish industry best practices using modern tools and processes.
  • Productionising Tableau data transforms.
  • Creation of attributes for data science models.
  • Manipulation of large data to create aggregated analyses.
  • Creation of optimised upstream database systems, e.g. consolidated/standardised clinical trial or pricing database (across multiple geographies).
  • Pulling bespoke database extracts for hypothesis testing, and ideation.
  • Trend analysis.
  • Pulling, processing and structuring new data sets, from public and private sources, e.g. grant information.

Working Conditions

Based in the London office with flexible working. Occasional travel to other offices, subsidiaries & partner locations. Initially 100% working from home while government travel restrictions remain in place due to Covid-19.

Required Skills & Qualifications

  • Have a good grasp of modern data practises and apply them to complex problems.
  • Building services features and libraries that contribution to library code and core services.
  • Have a good understanding of data architecture.
  • Experience with AWS
  • Ability to write code in Python
  • Experience wielding large data sets in formats such as XML, JSON and CSV.
  • Data manipulation using ETL tools and databases (Alteryx, Matillion, Redshift/Snowflake, RDS, S3).
  • Knowledge of database design, consolidating and flow outputs into current/new database.
  • Experience with data visualisation tools such as Tableau, Power BI or QuickSight.
  • Experience with big data technologies.
  • Experience in data processing using traditional and distributed systems.
  • Experience designing data models
  • Experience in SQL, NoSQL database management systems
  • Bachelor degree in Computer Science, similar technical field of study or equivalent practical experience.
  • Expertise in the design, creation and management of large datasets/data models.
  • Experience working on building and optimising logical data model and data pipelines while delivering high data quality solutions that are testable and adhere to SLAs.
  • Experience in using various data design patterns and knowledge of when/when not to use one.
  • Experience of working in an Agile (Scrum) environment

Nice to have

  • R or MATLAB.
  • Knowledge of pharmaceutical industry, in particular the stages of pharmaceutical product development.
  • Familiarity with research, clinical trial or patent documents.

Tech stack

    AWS

    regular

    Python

    regular

    XML

    regular

    ETL tools

    regular

Office location

Published: 30.11.2020