Data Scientist

Data

Data Scientist

Data
Centrum, Warsaw

emagine Polska

Full-time
Any
Senior
Remote

Job description

  • Pharma

  • Start ASAP/to determinate

  • 100% remote

  • B2B up to 50 e/h netto+VAT

The role of the Data Scientist in RAG & Document Intelligence focuses on designing and implementing AI solutions that enhance the accessibility of knowledge within an organization, transforming complex enterprise documents into actionable insights.

Main Responsibilities:

  • Optimize RAG pipelines by experimenting with various strategies for parsing, chunking, and retrieval to improve answer quality and reduce errors.

  • Extract structured information from unstructured content, ensuring high-quality input for processing.

  • Design and conduct experiments to evaluate models based on accuracy, latency, and cost, and derive insights from the data.

  • Implement NLP techniques to solve real-world problems, enhancing user experience through effective query handling.

  • Monitor and evaluate the performance of AI models and make necessary adjustments for cost-efficiency.

Key Requirements:

  • Strong Python skills with experience in machine learning and generative AI workflows.

  • Solid understanding of NLP principles like text representation and semantic search.

  • Experience in designing and optimizing RAG pipelines for unstructured documents.

  • Proficiency with document parsing and handling diverse formats.

  • Familiarity with evaluation frameworks for LLMs and defining specific quality metrics.

  • Knowledge of multi-agent AI frameworks.

  • Experience with vector databases and cloud services (Azure/AWS).

  • Strong analytical skills with an experimental approach to problem solving.

  • Fluent in English (written and spoken).

Nice to Have:

  • Hands-on experience with Databricks GenAI products.

  • API development and integration skills.

  • Familiarity with Model Context Protocol.

  • Knowledge of knowledge management and taxonomy design.

Other Details:

  • Impact Level: Greenfield AI Initiative

  • Collaboration: Work with various business units across the organization.

Tech stack

    English

    B1

    Microsoft Azure

    advanced

    Copywriting (content)

    advanced

    Machine Learning (ML)

    advanced

    Cloud

    advanced

    Microsoft Forms

    advanced

    User Experience (UX)

    advanced

    Artificial Intelligence (AI)

    advanced

    Amazon Web Services (AWS)

    advanced

    Python

    advanced

    API (Application Programming Interface)

    advanced

Office location

Data Scientist

Summary of the offer

Data Scientist

Centrum, Warsaw
emagine Polska
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.