About our Team
We invite you to the one of the largest speech and language processing teams in Europe.
We deliver cutting-edge AI solutions deployed on top Samsung products. We participate in development of the Galaxy AI services available on all our phones and other devices.
You will be working with a team that develops next-generation Machine Translation tools. In our lab engineers, researchers, and linguists work together on innovative products dealing with languages from all over the world. With us you have unique opportunity to work on products available on a wide range of devices and used by millions of users worldwide.
Role and Responsibilities
- Management of Linux servers used for data processing and model training,
- Management of services used by the team (translation engines, search engines, Large Language Model AI assistant service etc.)
- Communication with IT team for smooth server operation (server maintenance, firewall exceptions)
- External cloud management
- Development and maintenance of data processing pipelines used for data preparation tasks
- Automation of repetitive tasks for Natural Language Processing (NLP), such as: retrieval of text data, text corpora management, model validation
- Significant impact on technological stack: this is R&D team and we can decide what technologies we use more freely than regular development teams
Technologies in use
- OS: Ubuntu (22.04 LTS, etc), Windows 11
- GPU computing: CUDA libraries, distributed processing
- Virtualization engines: docker, kubernetes
- Workload management: slurm , kubeflow
- System administration: Ansible, shell scripts
- Services: nginx, elasticsearch, mongodb, mysql, huggingface inference endpoints
- Text manipulation utilities: grep, sed, find, wc, sort, awk, cut, paste, etc.
- Programming languages: Python, C++, Java, JavaScript (react), kotlin
- Data Engineering & Data Science (variety of libraries for training & test data collection, data augmentation, text corpus processing)
Skills and Qualifications
- Bachelor's or master's degree in Computer Science, Mathematics, Physics, Telecommunications or related fields
- Working proficiency in Python and shell script
- Practical knowledge of Linux administration (including user access rights, backups, network file systems, cron tasks, etc.)
- Knowledge of Linux’s text manipulation utilities
- Regular expressions in practice
- Experience with docker, python virtualenvs
- Code/configuration versioning (git, github) as a part of daily routine
- Knowledge of English at a level that allows for easy communication
- Ability to write easy to read documentation in English (installation and usage instructions)
- Ability to focus on details, follow good practices, responsibility and reliability
- We are looking for a communicative, friendly person who is always willing to help
Nice to have
- Practical knowledge in Data Engineering and/or Data Science.
- Experience with:
o Slurm administration
o Ansible
o mongodb database
o processing of large text files
o distributed processing
- Creativity, ability to adapt knowledge to create innovation and open mind is a plus
We offer
- Team:
- Friendly working atmosphere
- Wide range of trainings (technical / soft-skills / e-learning platform)
- Opportunity to work in multiple projects
- Multidisciplinary team
- Working with the latest technologies on the market
- Monthly integration budget
- Possibility to attend conferences
- Equipment:
- Laptop and PC workstation + 2 external monitors
- OS: Windows, Linux
- Benefits:
- Private medical care (possibility to add family members)
- Multisport card
- Life insurance
- Lunch card
- A partial reimbursement of the cost of an English language course
- Possibility to learn Korean for free
- Variety of discounts (Samsung products, theaters, restaurants)
- Unlimited free access to Copernicus Science Center for you and your friends
- Possibility to test new Samsung products
- Location:
- Office in Warsaw Spire near metro station
- Hybrid work model - 3 days from the office per week