All offersKrakówDevOpsSite Reliability Engineer
Site Reliability Engineer
DevOps
Scale IT Up

Site Reliability Engineer

Scale IT Up
Kraków
Type of work
Undetermined
Experience
Senior
Employment Type
B2B
Operating mode
Office

Tech stack

    Kubernetes
    advanced
    Docker
    advanced
    AWS / Cloud
    advanced
    Terraform
    advanced
    Ansible
    regular

Job description

Role: Site Reliability Engineer

Location: Cracow, centre
Type of contract: b2b (perm is an option as well)
Budget: 18000-27000 pln net + benefits


Key mentions: product company, newly established business in Poland ~ intelligent transport industry ~ great brands ~ SaaS
Be part of a new product engineering team, based in Krakow, building a ‘software as-a-service’ platform to create a digital market for global ground-transportation.

 Role overview:

The project is just about to start, together with your colleagues you will create the product almost entirely from scratch!

We are the first fully-integrated global, neutral, booking platform for regulated Taxis and PHVs is funded by RCI Bank and Services, the financial services provider for the Groupe Renault brands in the world and for the Nissan Group brands mainly in Europe.

We are looking for an talented Platform Reliability Engineer who is passionate about  automation and continuous innovation. You will play a key role in improving the automated application development & delivery lifecycle for software releases across the enterprise, for multiple applications across the various Portfolios.

 
Your main responsibilities will be:

  • Automate, automate, automate;
  • Solve problems relating to mission critical services and building automation to prevent problem recurrence; 
  • Influence and create new designs, architectures, standards and methods for distributed systems, with a strong bias towards ‘Everything as Code’;
  • Embrace the ‘feedback loop’ by engaging with various engineering teams to transform concepts, requirements for services and tools;
  • Engage in capacity planning, software performance analysis and system tuning, ensuring we’re ready for Production;
  • Work collaboratively with all participants in software development activities and be supportive of developers and testers as they set up their build dev/test environments.
  • Occasional on-call duties to provide application support, incident management, and troubleshooting;
 
How?

  • Simplify complex systems
  • Optimise MTTD/MTTR/MTTF
  • Automate SOPs
  • Actually, automate everything
  • SLOs, SLIs & reliability budgets
  • Smart alerting & monitoring
  • Dashboards - visualise production
  • Educate SDE teams in what ‘Production ready’ means
  • Embrace Chaos Engineering
  • Employ ‘best practices’ for platform operations
  • Review and highlight any potential security risk or fragility within the existing platforms, and ongoing developments
  • ChatOps enhancement & optimisation
  • DR testing


Requirements:

  • Prior experience on similar role, especially with regards to reliability and chaos engineering;
  • Commercial experience with Kubernetes and cloud systems  (AWS/Azure/GDP/OpenStack);
  • Experience with Docker, Terraform, Ansible;
  • Solid familiarity with platform engineering (experience in working with PaaS solutions would be an asset);
  • Helm;

  • High communication skills;
  • Excellent command of English;