Salary offered: 15 000 - 19 000 PLN gross per month (contract of employment)
Main purpose of the role & responsibilities in the business:
As a Senior Data Engineer, you will be responsible for designing and developing data processing and data persistence software components for solutions that handle data at scale.
Working in agile teams, Lead Data Engineers providing strong development leadership for team members and take responsibility for the quality of the codebase as well as the match to user needs. You will work within a multi-skilled agile team to design and develop large-scale data processing software to meet user needs in demanding production environments.
Most of our work comes through repeat business and direct referrals, which comes down to the quality of our people. The success of our Data Engineering teams means that customers are bringing us an increasing number of exciting data projects using cutting-edge technology to solve real-world problems.
We are seeking more high calibre people to join our Data & Analytics capability where you will grow and contribute to industry-leading technical expertise.
Your responsibilities will include:
- Working to develop data processing software primarily for deployment in Big Data technologies. The role encompasses the full software lifecycle including design, code, test and defect resolution.
- Working with architects to ensure the software supports non-functional needs.
- Collaborating with colleagues to resolve implementation challenges and ensure code quality and maintainability remains high. Leads by example in code quality.
- Working with operations teams to ensure operational readiness.
- Advising customers and managers on the estimated effort and technical implications of user stories and user journeys.
- Coaching and mentoring more junior technical staff.
Minimum requirements:
- Software development experience with distributed data processing technologies.
- Experience in leading the development of substantial components for large-scale data processing solutions, taking responsibility for non-functional needs of ETL/ELT data processing pipelines such as robustness, performance and security. ‘Development’ incorporates design, code, test defect resolution and operational readiness, and includes setting the standards for these activities.
- Software development experience in open source technologies including JVM languages, Python, R, Ruby, Scala.
- Thorough understanding of ETL/ELT data processing pipelines.
- Experience in successfully deploying multiple large-scale data processing solutions.
- Thorough knowledge of RDBMS and NoSQL data stores and of working with data in a variety of structures including JSON and XML.
- Has coached and mentored junior and experienced technical staff.
- Able to make effective decisions within fast-moving Agile delivery.
- Able to write code that is testable and has experience with automated unit testing.
- Has experience using continuous integration tools and techniques (e.g. Jenkins) and is aware of best practice.
- Able to work collaboratively with others using version control (e.g. Git).
- Able to prioritise their own time and activities.
- Able to understand UML diagrams.
Desirable skills:
- Software development experience with Cloudera’s distribution of Apache Hadoop.
- Experience in performance tuning.
- Experience in data visualisation and complex data transformations.
- Experience with steaming and event-processing architectures including technologies such as Kafka and change-data-capture (CDC) products.
- Expertise in continuous improvement and sharing input on data best practice.
- Experience in unstructured text processing including Natural Language Processing.
- Understanding of data redaction implementations.
- Participation in the development and/or technology communities.
- Open source contributor.
What we offer:
- Company share scheme plan.
- Private medical insurance at Medicover.
- Group life insurance.
- 3000 Pounds for refer-a-friend scheme.
- Reimbursement of the costs of purchasing a chair and desk to be used in the home office.
- Access to Pluralsight online trainings.
- Core hours (starting work between 8 am and 10 am).
- 1,5% participation in PPK.