All offersKrakówDataData Scientist
Data Scientist
Data
Inuits

Data Scientist

Inuits
Kraków
Type of work
Undetermined
Experience
Mid
Employment Type
B2B, Permanent
Operating mode
Remote
Inuits

Inuits

We specialize in designing and building augmented teams of highly-skilled professionals for long-term collaborations. Our approach is rooted in human-to-human interactions and thoughtful actions, resulting in customer intimacy and the organic development of success stories within our extended team.

Company profile

Tech stack

    Python
    advanced
    XML
    advanced
    Airflow
    regular
    Scala
    regular
    Spark
    regular

Job description

Online interview
Inuits? Who we are: 

We are Open Source enthusiasts providing digital solutions for clients in all sectors. We build innovative and tailor-made solutions using cutting-edge technologies. Our people are located in Belgium, The Netherlands, The Czech Republic, Poland and Ukraine. Our clients are private companies and public agencies.

For a project with our client 1010data we are looking for Data Scientists to join our Data Science team in Krakow.

1010data travels at the speed of thought to make Big Data discovery easy; they power sub-second responses to analyses run on billions of rows of data. 1010data is defining the way the world interacts with data. An essential tool to more than 700 of the world’s top retail, manufacturing, telecom, government, and financial services enterprises including Shell, Nespresso, Dollar General, P&G, and RiteAid. The 1010data platform is a highly differentiated product that is becoming the industry standard for Big Data Discovery and Data Sharing. With more than 30 trillion rows of data in a private cloud, 1010data is designed to scale to the largest volumes of granular data, the most diverse and varied data sets, and the most complex advanced analytics. All while delivering lightning-quick system performance.

We are seeking Data Scientists with solid skills in data analysis to create large-scale analytical solutions (data products and applications) to client-driven business problems. At the core of 1010data’s technology stack is a fast parallel processing database that powers all of our product offerings. Your work will involve utilising this core capability in designing high-performance data pipelines and the corresponding operational processes. You will be responsible for conducting analysis to further the features, functionality, and data quality of the analytical solutions.

What you will take on:

  • Become an expert user of 1010data XML Macro language to analyse and process massive datasets;
  • Help solve real-world analytical problems by translating them into ad-hoc analyses to produce solid findings and share them with your team;
  • Develop an understanding of the domain in which 1010data’s products are geared and use your knowledge of the database and platform to solve data-driven problems;
  • Maintain production data pipelines to produce our well-known products with high data quality standards;
  • Find and implement optimisations, improvements, and design modifications to data engineering challenges using primarily the 1010data XML Macro Language along with other technologies like Airflow, Scala, Spark, and Python;
  • Collaborate with Data Engineering and Data quality teams by conducting root cause analysis and take corrective measures to improve the quality of Data products;
  • Participate in Agile development sprints and share progress updates regularly;
  • Support clients, sales, and marketing teams as needed.

What you already have:

Education & Experience

  • BS/MS in a highly analytical discipline (Computer Science /Physics /Mathematics /Econometrics) or equivalent;
  • 1-3 years professional experience in data analysis or practical experience building analytical products.

Skills

  • Database experience (understanding of database structures and query languages such as SQL);
  • Demonstrated experience with scripting languages and statistical software (R, SAS, SPSS, MATLAB);
  • Solid understanding of statistical concepts.

Desired

  • Experience developing data products using consumer spend data;
  • Experience working with parallel processing frameworks like Spark;
  • Experience constructing data pipelines using Airflow;
  • Background in vector/matrix arithmetic is a plus;
  • Experience with list/vector-based languages.

Communication and collaboration

  • Have strong, positive interpersonal skills;
  • Able to communicate clearly, consider options presented by others and reach an informed, balanced technical opinion;
  • Create clear, concise memos, summaries, design documentation, and presentations.

What are the benefits?

  • sports subscription (MultiSport);
  • training budget;
  • private healthcare (Luxmed);
  • flat structure; 
  • small, international teams;
  • international projects;
  • many, many more ;) 

As the client is based in the US, there will be an alignment with the EST timezone.