All offersWarszawaPythonData Engineer for Voice Assistant
Data Engineer for Voice Assistant
new
Python
Samsung R&D Institute Poland

Data Engineer for Voice Assistant

Samsung R&D Institute Poland
Warszawa
Type of work
Full-time
Experience
Mid
Employment Type
Permanent
Operating mode
Hybrid
Samsung R&D Institute Poland

Samsung R&D Institute Poland

Samsung R&D Institute Poland is one of the largest research and development centers in Poland. Our offices are located in Warsaw and Kraków. It is there that the high-quality software for Samsung Electronics products is created. The work of our engineers affects the future of operations, among other flagship models of smartphones and TV sets, mobile networks, multimedia and intelligent buildings.

Company profile

Tech stack

    PostgreSQL
    master
    Linux
    advanced
    Bash
    advanced
    Git
    advanced
    Jenkins
    advanced
    Docker
    advanced
    Voila
    advanced

Job description

Online interview

Data Engineer for Voice Assistant

About our Team

We invite you to the one of the largest speech and language processing teams in Europe. We work closely with other R&D teams to develop and test our next-generation personal Intelligent Assistant. In our lab engineers, researchers, and linguists work together on innovative products for the multilingual European market. We define the way users access, explore and interact with devices, knowledge, information, and services. With us you have unique opportunity to work on product available on a wide range of devices and used by millions of users.

 

Role and Responsibilities

  • Development and maintenance of dashboards and internal web services to present, access, annotate text or visualize usage data related to Voice Assistant,
  • Management of Linux servers used for data acquisition and processing,
  • Development and maintenance of data processing pipelines used for language analytics tasks,
  • Automation of repetitive tasks for Natural Language Processing (NLP), such as: retrieval of text data, text corpora management, text corpora annotation,
  • Exploration of available text data, to create meaningful reports (e.g. trends report, usage patterns report) and define metrics (e.g. end to end success rate) for other development teams,
  • Significant influence on the direction of work in the team, opportunity to participate in creation of project proposals, research and patent applications (especially in the field of data processing and analytics),Significant impact on technological stack: this is R&D team and we can decide what technologies we use more freely than regular development teams.

 

 Technologies in use

  • Python,
  • DevOps (Linux, Bash, git, Jenkins, Docker, Openstack, nginx, Ansible)
  • Data Engineering & Data Science (variety of libraries for training & test data collection, data augmentation, text corpus processing),
  • Databases (PostgreSQL, InfluxDB)
  • Data Visualization and dashboarding tools (Voila, Dash, Grafana, Flask, Jupyter, Python visualization stack)

 

Skills and Qualifications 

  • Bachelor's or master's degree in Computer Science, Mathematics, Telecommunications or related fields.
  • Proficiency in Python.
  • Practical knowledge of the Linux environment and Bash scripting.
  • Experience in Git, Github, Jenkins, Grafana, Docker or similar tools.
  • Knowledge of English at a level that allows for easy communication.
  • Creativity, ability to adapt knowledge to create innovation and open-mind is a plus.


Nice to have

  • Practical knowledge in Data Engineering and/or Data Science.
  • Experience in databases (especially Postgresql, InfluxDB).
  • Experience in any subdomain of Natural Language Processing (text classification, word & sentence embeddings, named entity recognition, information extraction, evaluation of machine learning models, sentiment analysis, deep learning methods).
  • Experience in human-computer interaction application development text or voice (Chatbot development, voice assistant, messenger bot, Alexa Skills development, Google Assistant Actions development etc.).
  • Ability to use data visualization and dashboarding tools in Python in practice.

 

We offer

  • Team:
  • Friendly working atmosphere
  • Wide range of trainings (technical / soft-skills / e-learning platform)
  • Opportunity to work in multiple projects
  • Multidisciplinary and multicultural team 
  • Working with the latest technologies on the market
  • Monthly integration budget
  • Possibility to attend local and foreign conferences
  • Opportunity to participate in science research (scientific papers, project proposals, patents applications, development of own side-projects)


  • Equipment:
  • Laptop and PC workstation + 2 external monitors
  • OS: Windows, Linux


  • Benefits:
  • Private medical care (possibility to add family members)
  • Multisport card
  • Life insurance
  • Lunch card
  • A partial reimbursement of the cost of an English language course
  • Possibility to learn Korean for free
  • Variety of discounts (Samsung products, theaters, restaurants)
  • Unlimited free access to Copernicus Science Center for you and your friends
  • Possibility to test new Samsung products


  • Location:
  • Office in Warsaw Spire near metro station / Office in Cracow Quattro Business Park
  • Hybrid work system (3 days per week from the office)