With 900+ employees in Poland supporting over 45 clients, we leverage our holistic portfolio of capabilities in consulting, design, engineering, operations, and emerging technologies to help clients realize their boldest ambitions and build future-ready, sustainable businesses. It is the freedom provided to every individual at Wipro to learn, grow and create a career path that makes us an organization of opportunities beyond boundaries. Beyond boundaries of region, age, gender, ability, and routine. We invite you to be a part of this experience!
Title: Site Reliability Engineer - Hadoop/ Kafka
Work Mode: Hybrid (2 - 3 days a week)
Employment Type: Employment Contract (UoP) OR B2B
What you'll do:
- Carry out SRE duties for Big Data on various open-source platforms such as Hadoop, Kafka, Spark, and HBASE.
- Keep an eye on the platforms and adhere to runbooks/SOPs to manage platform and application problems.
- Familiarize yourself with the cluster maintenance processes and implement changes as per the documented installation and validation plans.
- Showcase robust troubleshooting and debugging skills, aiming to pinpoint and rectify the issue, while also offering advice on how to prevent such problems in the future.
- Conduct thorough root cause analysis of major production incidents, document for future reference, and put in place proactive measures to enhance system reliability.
- Automate routine tasks using scripts or automation tools to lessen manual work, decrease the chance of human errors, and boost system reliability.
What you need to have:
- At least 2-3 years of experience for a junior level role and 5+ for mid-level/senior level working as a (Hadoop/ Kafka) site reliability engineer.
- High level Knowledge on Hadoop platforms and core Hadoop components.
- Troubleshooting both Hadoop platform service, application problems and identifying the root cause.
- Writing ansible playbooks and automate manual tasks using Ansible, shell scripting and python scripting.
- Should be familiar with Unix/Linux system internals, networking, and distributed systems.