At the Conversational Language Intelligence team, our goal is to enhance both customer facing and internal solutions by leveraging cutting-edge NLP models. We use advanced transformers models to solve complex business problems, from automating customer support to providing internal search systems for the company. To do this we develop a wide range of solutions from standard classifiers to large retrieval models. In our work we utilize the capabilities of Large Language Models for synthetic data generation and Retrieval-Augmented Generation (RAG) systems, ensuring robust and scalable solutions.
Why is it worth working with us?
- Being a part of Machine Learning Research team, you will be responsible for bringing to production research solutions for Allegro
- While working on a new problem, you will explore it in depth and conduct literature review, looking for the most promising techniques for a given problem
- You will be responsible for the preparation of the production-grade machine learning models, supporting the development team for a correctly functioning production model and meeting technical and performance requirements
- You will support other teams in the implementation of tasks requiring the use of ML models. Your support will be needed both at the technical (e.g., what architecture will be appropriate for the domain) and best-practices level (e.g., building data sets, modeling, metrics, implementation of the ML-based solutions to the production)
- To apply state-of-the-art solutions, you will stay up to date with the scientific progress. You will deepen your knowledge by reading the latest papers in your domain, sharing the knowledge with other team members of the research teams operating in Allegro. You will participate in scientific conferences, visiting venues where the latest discoveries are presented
- You will have the possibility to share the results of your research in the scientific community, and by taking part in the scientific conferences (oral presentations, poster sessions). You will develop your scientific career, as well as Allegro's presence in the science community
- In your daily work you will expand your knowledge by cooperating with people who have hands-on experience in implementation of the ML models at scale unprecedented anywhere else in Poland
What we offer:
- Well-located office (with fully equipped kitchens and bicycle parking facilities) and excellent working tools (height-adjustable desks, interactive conference rooms)
- Annual bonus up to 10% of the annual salary gross (depending on your annual assessment and the company's results)
- A wide selection of fringe benefits in a cafeteria plan – you choose what you like (e.g. medical, sports or lunch packages, insurance, purchase vouchers)
- English classes that we pay for related to the specific nature of your job
- Working in a team you can always count on — we have on board top-class specialists and experts in their areas of expertise
- A high degree of autonomy in terms of organizing your team’s work; we encourage you to develop continuously and try out new things
- Hackathons, team tourism, training budget and an internal educational platform, MindUp (including training courses on work organization, means of communications, motivation to work and various technologies and subject-matter issues)
We are looking for people who:
- Have a master's or PhD in machine learning, mathematics, computer science, statistics or related fields
- Know the methodology of conducting scientific research and the use of iterative process of conducting experiments
- Have a good knowledge of machine learning techniques in the field of natural language processing (transformer architecture, tokenizers, BERT models, GPT models, etc.)
- Have experience in working with real data that deviate from the standard, well-developed collections used in research
- Know Python and libraries necessary to work with model development (PyTorch, Transformers, Pandas, Numpy, etc.)
The following are also a plus
- Prior experience in running large-scale computation on cloud platform (GCP, AWS or Azure)
- Prior experience in using LLMs for synthetic data generation/solving business problems