All offersWrocławDevOpsSite Reliability Manager
Site Reliability Manager
DevOps
Zoolatech

Site Reliability Manager

Zoolatech
Wrocław
Type of work
Undetermined
Experience
Senior
Employment Type
Permanent
Operating mode
Remote

Tech stack

    Azure
    advanced
    .Net
    advanced
    Java
    advanced
    Cloud
    advanced
    Git
    regular
    Security
    regular
    ARM
    regular
    AKS
    regular

Job description

Online interview
Friendly offer
Project Description

Our client is a Danish jewelry brand, and one of the most famous jewelry brands in the world. From our side, we`re focusing on creating for them our staff team to help them to improve their automatization processes and develop great partnerships for years.

As an SRE manager, you’ll be responsible for creating an SRE process from scratch for one of the biggest jewelry e-commerce projects in Europe. You will need to be involved in creating the main SRE process, building the structure, hiring and forming the SRE team, and deep working with key stakeholders to improve necessary processes.

Responsibilities:

  • Engage, influence, and evangelize SRE practices with development, operational, and product groups to align technology service/solution delivery;
  • Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed up;
  • Manage availability, latency, scalability, and efficiency of product applications development by instilling engineering reliability into our development life cycle with a focus on fault-tolerant approaches;
  • Drive capacity planning, performance analysis, instrumentation, and other non-functional systems requirements;
  • Must be able to define and report "progress" on strategic initiates and project-level tasks to all stakeholders including senior executives and use effective communication approaches with each constituency;
  • Implement metrics-driven processes to ensure service quality targets are met.

Skills Required:

  • Expert knowledge in all aspects of designing, developing, and managing large real-time systems;
  • Strong experience driving the full product lifecycle, including defining and hitting success metrics;
  • Project and process management, experience with agile software development methodologies;
  • Experience in software delivery using .NET, Java;
  • Experience with Azure (AAD, Security, ARM, AKS);
  • Strong hands-on technical experience in software deployment and operations on public Cloud platforms, CI/CD, deployment automation, and Pipelines;
  • 4+ years of experience with container orchestration technologies, observability, incident management, and continuous integration and deployment best practices;
  • Leadership experience as a people manager leading a technical team.

Will be a plus:

  • Familiarity with Git;
  • Experience operating services at scale;
  • Experience with highly available systems at scale;
  • Experience with incident command/management.

We offer

  • Permanent employees/B2B;
  • Private medical care + dental care/Compensation for insurance up to 500$ per year;
  • Language courses: English and Polish;
  • Paid days off (20/26), sick leaves, Public holidays/Paid days off (15), sick leaves (5), Public holidays;
  • Prof. training & courses: compensation up to 70%;
  • Support Sports Teams, Events.