Data Engineer PySpark
Job Title: Data Engineer
Location: Krakow, Poland (Hybrid)
Employment Type: Contract
About Hirexa Solutions:
Hirexa Solutions is a leading player in the recruitment ecosystem across the United States, United Kingdom, Europe, and India. As the fastest-growing next-generation provider of technology talent, we empower our clients to become resourceful, achieve higher productivity, adopt agile structures, and effectively execute project deliverables.
Envisioned and co-founded by veterans of the Information Technology industry, our mission is to make recruitment efficient, flawless, and cost-effective. Our unwavering commitment to strategic investments in intelligent technology underscores our passion for people and our dedication to helping organizations realize their true potential.
Position Overview:
For one of our partners, we are seeking a Data Engineer who will be responsible for Python coding, building data pipelines using Spark and PySpark. The ideal candidate will possess the necessary skills and experience to contribute to the success of our partner organization.
Experience:
5–10 years of hands-on experience in the data analytics space as a data engineer. Familiar with ETL, DQ, DM, and reject and recycling concepts.
A significant portion of this experience should involve building data analytics solutions in a big data environment using Hadoop clusters or cloud environment.
Technical Skills:
Extensive experience in building data pipelines using Spark, particularly PySpark.
Candidates must have a minimum of 3 years of hands-on experience in coding with PySpark applications using RRDs, DataFrames & datasets and NOT Spark SQLs.
Candide should has developed numerous spark application for various use cases of processing large volumes of data, used performance tuning, works extensively on complex transformation skills using group, window,
Candidate who has participated in PySpark code hackathon
Please apply only if you are confident in writing decent PySpark code during the interview.
Strong proficiency in Python.
Write clean, efficient, and reusable Python code
Identify, troubleshoot, and fix bugs in programs to ensure code quality
Creating scripts and tools to automate tasks and processes
Note: This role requires advanced Python coding skills. Candidates will be required to demonstrate their Python skills during the interview.Familiarity or exposure to tools such as Airflow, Databricks, and Azure is a plus. The primary focus is on Spark, PySpark, and data engineering.
How to Apply:
If you are interested in this opportunity, please submit your resume. We look forward to hearing from you!
Data Engineer PySpark
Data Engineer PySpark
NA, Kraków
Hirexa