All offersWrocławDataData Engineer (Spark)
Data Engineer (Spark)
new
Data
GetInData | Part of Xebia

Data Engineer (Spark)

GetInData | Part of Xebia
4 630 - 6 730 USDNet/month - B2B
Type of work
Full-time
Experience
Mid
Employment Type
B2B
Operating mode
Remote

Tech stack

    English
    advanced
    Java
    regular
    Apache Spark
    regular
    Python
    regular
    Microsoft SQL
    regular

Job description

Online interview

Who are GetInData | Part of Xebia? 👩‍💻 👨‍💻 

GetInData | Part of Xebia is a leading data company working for international Clients, delivering innovative projects related to Data, AI, Cloud, Analytics, ML/LLM, and GenAI. The company was founded in 2014 by data engineers and today brings together 120 Data & AI experts. Our Clients are both fast-growing scaleups and large corporations that are industry leaders. In 2022, we joined forces with Xebia Group to broaden our horizons and bring new international opportunities.


What about the projects we work with?

We run a variety of projects in which our sweepmasters can excel. Advanced Analytics, Data Platforms, Streaming Analytics Platforms, Machine Learning Models, Generative AI and more. We like working with top technologies and open-source solutions for Data & AI and ML/AI. In our portfolio, you can find Clients from many industries, e.g., media, e-commerce, retail, fintech, banking, and telcos, such as Truecaller, Spotify, ING, Acast, Volt, Play, and Allegro. You can read some customer stories here.


What else do we do besides working on projects?

We conduct many initiatives like Guilds and Labs and other knowledge-sharing initiatives. We build a community around Data & AI, thanks to our conference Big Data Technology Warsaw Summitmeetup Warsaw Data Tech TalksRadio Data podcast, and DATA Pill newsletter.


Data & AI projects that we run and the company's philosophy of sharing knowledge and ideas in this field make GetInData | Part of Xebia not only a great place to work but also a place that provides you with a real opportunity to boost your career.

If you want to be up to date with the latest news from us, please follow up on our LinkedIn profile.


About role 💻

We are working on developing a modern platform for the collection and analysis of lineage metadata. As a Data Engineer, you will work closely with the GetInData and client teams on building and maintaining specifications and integrations for tools like Apache Airflow, Flink, or Spark, as well as developing Marquez service for the collection, aggregation, and visualization of lineage metadata. Public GitHub environments and discussions open to the rest of the world limit the number of meetings to a minimum.


Responsibilities

  • Working with the GetInData and client teams and open source community to identify requirements and execute them
  • Writing simple, understandable, testable, and efficient code with a focus on solid technical stability and great user experience
  • Collaborating with stakeholders to understand project functional requirements and objectives from a data processing perspective
  • Ensuring compliance with industry standards and regulations in terms of security, data privacy applied in the data processing layer
  • Defining and implementing security measures according to best practices
  • Ensuring compliance with industry standards and regulations


Requirements

  • Very good knowledge of Java or Python, or both
  • Experience working with databases or data processing systems
  • Good knowledge of SQL
  • Experience working with Spark
  • Knowledge of big data landscape
  • Ability to work independently and focus on impactful features
  • Strong B2/C1 English - ability to express complex ideas both in text and speech
  • Experience as a contributor to an open-source community is a plus
  • Ability to actively participate/lead discussions with clients to identify and assess concrete and ambitious avenues for improvement


We offer👩‍💻 

  • Salary: 110-160 PLN net + VAT/h B2B (depending on knowledge and experience)
  • 100% remote work
  • Flexible working hours
  • Possibility to work from the office located in the heart of Warsaw
  • Opportunity to learn and develop with the best Big Data specialists in Poland
  • International projects
  • Possibility of conducting workshops and training.
  • Certifications
  • Co-financing sport card
  • Co-financing health care
  • All equipment needed for work
4 630 - 6 730 USD

B2B