VirtusLab
Join the VLteam and elevate your career to new heights! Join us in shaping the future of software engineering with a team that values flexibility, fosters an open-minded culture, and delivers outstanding solutions. We have extensive knowledge about Data Engineering & Data Science, Cloud-Native Services, Reactive Systems, Dev Tooling and Frontend. We are also worldwide experts in Scala language, officially supporting its development and tooling.
We are #VLteam – tech enthusiasts constantly striving for growth. The team is our foundation, that’s why we care the most about the friendly atmosphere, a lot of self-development opportunities and good working conditions. Trust and autonomy are two essential qualities that drive our performance. We simply believe in the idea of “measuring outcomes, not hours”. Join us & see for yourself!
About the role
Join us to drive business innovation with production-grade ML pipelines. Become a key member of our team as you dive into Big Data, utilise Azure for cloud computing, and deploy solutions on edge devices. Collaborate with Data Scientists on impactful AI-powered projects.
Our projects are the tip of the iceberg – expect a landscape of more intriguing and diverse challenges ahead.
One of the available projects: T-Rex
Project Scope
In collaboration with a prominent UK-based retailer's data science team, we create solutions for personalisation challenges, such as web-page product recommendations. The main objective is to deliver well-engineered components that enable work on modelling and experimentation in a hybrid-cloud environment. The ultimate goal is to provide an end-to-end experience stretching from pure data, training different models, and creating APIs to deliver self-improving models.
We:
- Work on components to support Big Data processing on an on-prem Hadoop cluster.
- Ensure that high-quality code is delivered.
- Choose the best architecture and tools for the business requirement.
- Push the data to the cloud and select the best way to store and process it.
- Build robust code and architecture to allow easy productisation of data scientists' models with minimal time and effort.
- Enhance monitoring capabilities and reliability.
Tech Stack
Python: (3.7+) with complete typing PySpark: base for our ETL Azure (incl. Azure ML): model training and serving Tensorflow, pandas, scikit-learn, scipy, nltk Jenkins, Azure DevOps, Terraform, Git @ GitHub
Project Challenges
Personalisation models use tens of terabytes of input data. We leverage the Hadoop on-prem cluster to extract significant business features and transfer them to the cloud. We adopted the hybrid-cloud model to iterate faster on the business use cases given to us by data scientists.
Team
The team consists of a Technical Lead and 3-4 Engineers in Poland. These people collaborate closely with the client's UK-based Data Science Unit.
For more projects, check out our website.
What we expect in general
Don’t worry if you don’t meet all the requirements. What matters most is your passion and willingness to develop. Moreover, B2B does not have to be the only form of cooperation. Apply and find out!
What's on offer?
Check similar offers