#1 Job Board for tech industry in Europe

  • Job offers
  • All offersBiałystokPythonSenior Data Scientist
    Senior Data Scientist
    Python
    Avenga

    Senior Data Scientist

    Avenga
    Białystok
    Type of work
    Full-time
    Experience
    Senior
    Employment Type
    B2B
    Operating mode
    Remote
    Avenga

    Avenga

    We are a team of IT and digital transformation experts. Strategic consulting, customer experience service, software development, and outsourcing of top-class IT specialists is what we do (and more!).

    Company profile

    Tech stack

      RAG pipelines

      advanced

      Python

      advanced

      NLP

      regular

      LLM

      regular

      Deep Learning Models

      regular

    Job description

    Online interview

    Senior Data Scientist, Generative AI Products, Digital Transformation


    FTE: 1

    Duration: 6 months

    Working hours: 8 hours with so-called overlap from 16:00 -20:00 CET.

    Experience: senior

    English: B2/C1/C2

    Remote work: YES

    Recruitment process: Technical verification (1h) and Meeting with future manager


    Position description

    As a Senior Data Scientist, you will collaborate with the Data Science and Machine Learning team and will create data science, machine learning, and AI solutions to better address the needs of our constituents (students, alumni, faculty, researchers, staff, and community at large). You will have the chance to guide and continuously improve the ways in which we engage, educate, and empower people around the world, combining the best of human touch and technology scale. You will experiment with everything from the latest AI algorithms and techniques to blended and immersive environments, multi-modal and varied-form content, and the most innovative research and teaching methodologies. You will be highly influential in advancing our LLM applications and guide teams towards impactful and ethical AI. We seek an expert who is eager to grow and disseminate GenAI model expertise across the organization.


    Role description

    In this role, you will translate the needs of our cross-functional stakeholders into user-facing applications that leverage NLP techniques and large language models (LLMs). As a Sr. Data Scientist on our GenAI applications team, you will work on products like conversational search interfaces, chatbots, text summarizers, recommender engines, and more based on the needs of the constituents. You will partner with Product Managers, Machine Learning Engineers, Cloud Platform Engineers, and cross-functional partners to develop production-grade algorithms. Your innovations will drive value creation through personalized engagement, expanded reach, and experimental ways of learning that will continue the client leadership in education, business, and societal impact. Architect the overall framework and infrastructure for GenAI products like search interfaces, bots, summarizers, etc. Develop and implement techniques to optimize model performance to meet specific product goals Collaborate closely with product management and engineering leads to align on technical roadmap. Guide engineering teams to effectively leverage LLM capabilities in product implementations Establish protocols and systems for building fair, accountable and transparent LLM-based applications.


    Lead efforts to proactively assess and mitigate risks due to model biases or failures Implement robust feedback pipelines, monitoring and corrections to ensure model safety Design and oversee curation of high-quality datasets tailored for LLM training for each product. Build data science pipelines from feature generation, data visualization and models evaluation; design the solution, build initial code and provide documentation with ways of working to maximize time to value and re-usability.


    Communicate clearly and effectively to technical and non-technical audiences, verbally and visually, to create understanding, engagement, and buy-in. Contribute novel research and analyses to leading academic conferences and journals. Identify trends and opportunities to drive innovation, both in what we do and how we do it; evaluate new data science, machine learning, and AI technologies and tools that can boost team performance, innovation and business value. Proactively analyze latest developments in large language models to deeply understand model capabilities, limitations, and best practices. Develop techniques to continually improve language understanding and model training Embody the values and passions that characterize Client, with empathy to engage with colleagues from a wide range of backgrounds. Mentor and develop junior data scientists in state-of-the-art GenAI methods Set technical vision and lead initiatives to accelerate product impact through cutting-edge LLM innovations


    Qualifications:

    • Minimum of seven years’ post-secondary education or relevant work experience
    • Minimum of three years’ experience in developing machine learning models with a track record of creating meaningful business impact and working with multiple stakeholders.
    • Experience with production RAG pipelines and agentic information retrieval and search systems, with the ability to write production level code.
    • Minimum of five years’ experience with Python.
    • Minimum of three years' experience building production NLP and deep learning models using PyTorch/Tensorflow, along with using large language model architectures (BERT, GPT-3 etc.)
    • Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT/SFT, etc. using Langchain and similar tools
    • Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications
    • Proficiency with various prompting techniques, with a clear understanding of tradeoffs between prompting and fine tuning