All offersPoznańData(Senior) Data Scientist
(Senior) Data Scientist
Data
Pearson

(Senior) Data Scientist

Pearson
Poznań
Type of work
Undetermined
Experience
Mid
Employment Type
B2B, Permanent
Operating mode
Office
Pearson

Pearson

At Pearson, we create learning experiences designed for real-life impact. We build learning platforms, digital content and inventive solutions to support students all around the world. We are one of the 10 most innovative education companies of 2022.

Company profile

Tech stack

    statistical modelling
    master
    collaboration skills
    advanced
    written and spoken English
    advanced
    ML theory
    regular
    Python for ML
    regular
    Relational Databases
    regular
    Git-based code review
    regular
    cloud data processing
    nice to have
    Psychometric models
    nice to have

Job description

Online interview

About Pearson 


  • Learning is the most powerful force for change in the world. More than 20,000 Pearson employees deliver our products and services in nearly 200 countries, all working towards a common purpose – to help everyone achieve their potential through learning. We do that by providing high quality, digital content and learning experiences, as well as assessments and qualifications that help people build their skills and grow with the world around them. We are the global experts in learning. See the 5 types of products & services that we deliver to tens of millions of learners annually.
  • The global learning market is vast - at around $7 trillion today, growing to over $10 trillion by 2030. This all adds up to huge momentum in our industry, and a great opportunity for Pearson to innovate and scale to meet the growing and changing demands of consumers globally. The most interesting thing about that extraordinary $7 trillion learning market? It is currently only 3% digital! And that’s a huge opportunity for us to grow our business. See how we're going to do that.

About the Data Science team in Poznań


  • The Data Science team at Pearson Poznań office is a part of an international R&D unit that creates Intelligent Learning Capabilities (i.e. systems that use methods from the field of Data Science, incl. Artificial Intelligence, to facilitate the process of human learning). The Poznań team consists of Data Scientists, Research Engineers, and Technical Project Managers.
  • Our role is to:
    • Design, build, and continuously improve customer-facing Intelligent Learning Capabilities​ across Pearson.
    • Provide technical leadership in the application of Data Science & Machine Learning​ at Pearson.
  • We specialise in two main types of intelligent capabilities:
    • Adaptive modelling & diagnostic capabilities that fast-track learners through a course of learning and identify struggling learners.
    • Natural language processing-based assessment capabilities that evaluate a learner's soft & hard skills through text or speech.
  • In our work, we often innovate around implementing psychometric ideas using the latest machine learning methods and prototyping innovative natural language processing methods. See more of our work.
  • We proudly support the Polish AI community by sponsoring conferences (ML in PL, Why R?, GHOST Day, Cloud & Datacenter Day), hosting meetups with the Women in Machine Learning & Data Science organisation, and hosting our own series of open talks & workshops.

Role overview


  • We are looking for a (Senior) Data Scientist to join the Product Data Science team in Poznań
  • The position provides the opportunity to engage in cutting-edge research & development in a highly-collaborative environment. You will prototype and implement data-driven products for that will have an impact on millions of learners around the globe. 
  • This position requires a competent professional with a broad set of skills around statistical reasoning and modelling for educational measurement. Ideally, you should have a strong statistical background and experience with various types of ML algorithms. Most importantly, however, you need to be self-sufficient and comfortable with balancing the uncertainty, complexity and ambiguity of research with the need for predictable, incremental software development. 

Role details


High-level responsibilities
  • Conducting Research & Development activities in the field of data-driven educational products and solutions
  • Designing and implementing prototype products and solutions (esp. using machine learning algorithms), in collaboration with Engineering teams
  • Designing frameworks for monitoring products and solutions on production (in collaboration with Engineering teams); evaluating results for improvement
  • Creating and conducting internal training
  • Creating technical documentation, incl. patent documentation
  • Sharing work results internally and externally (publications, conferences, meet-ups)
  • Staying up to date with state-of-the-art methods for statistical modelling for educational measurement
  • Providing input into team vision, goals, and processes
  • Supervising or leading colleagues with less experience

Key qualifications

  • At least 3 years of experience in building production-ready machine learning models to solve a diverse set of business problems (preferably 5+ years for the senior role)
  • Advanced knowledge of statistical modelling and machine learning theory 
  • Strong mathematical background 
  • Proficiency in Python for machine learning, e.g. pandas, numpy, scikit-learn, NLTK, spaCy, Hugging Face 
  • Proficiency in SQL and relational databases 
  • Experience with Git-based code review 
  • Proficiency in Linux, Unix or macOS 
  • Experience with best practices for software engineering 
  • Experience with working in a diverse, remote, asynchronous team 
  • Collaboration skills, e.g. being able to give (and receive) feedback 
  • At least intermediate presentation and mentoring skills 
  • Proficiency in written and spoken English 

Nice to have 

  • Familiarity with psychometric models 
  • Proficiency in deep learning frameworks, e.g. TensorFlow or PyTorch 
  • Hands-on experience in using state-of-the-art Natural Language Processing models
  • Experience with:  
    • Docker 
    • cloud data processing, preferably on AWS (S3, EC2, Athena, SageMaker) and Snowflake 
    • GPGPU 
    • reinforcement learning 

Benefits 


  • Professional development
    • We understand that people learn best by doing. That's why if you work with us, you can take more time on a project to complete it using a new technology that you'd like to learn, or devote time to a side project.
    • However, we do understand the need for formal training. Your manager will regularly discuss your development goals with you, and provide you with online learning resources (e.g. O'Reilly Safari, Coursera, Linux Academy) and formal training (e.g. conferences).
    • You'll also get free access to Pearson digital products and services (over 22,500 eTextbooks and assessments), as well as a discount for buying print & digital products.
  • Work facilities & culture
    • Although we have a conveniently-located modern office, we have a culture of remote asynchronous work. We believe that creative work doesn't happen only at your office desk from 9 am to 5 pm. While we prefer to hire in Poznań, we're also open to semi/mostly-remote candidates (e.g. living in another city and visiting the office once per month).
    • If you do happen to visit the office, you'll find snacks stocked in our pantry (e.g. muesli, fruits, cookies).
  • Perks
    • Private health care
    • Accident insurance
    • MultiSport Card
    • Access to MyBenefit (redeem points for cinema tickets, shopping discounts, etc)
    • Volunteer time off