Site reliability engineer

DevOps

Site reliability engineer

DevOps
Centrum, Stockholm Metropolitan Area

emagine Polska

Full-time
Any
Mid
Hybrid

Job description

emagine is looking for a Site reliability engineer for our client in retail industry.

Period from: 2026-06-22
Period to: 2026-11-30


Job description:

  • Work in a cross functional team working with Reliability as Expertise in a product or a product area.

  • Apply Reliability engineering practices with support from SRE governance teams.

  • Ensure delivery quality and supply KPI reporting.

  • Collaborate closely within product teams to ensure predictable operations and minimal disruptions to Production.

  • Collaborate closely within your Capability, share best practices as well as discuss and improve on operations ways of working.

  • Work together in a cross-functional product team to monitor, manage, and resolve issues of the supported applications.

  • Technical analysis, troubleshooting of complex issues/Incidents in production.

  • Improve monitoring performance by focusing on preventive measures.

  • Product Improvements (code & log analysis).

  • Continuous improvement on proactive monitoring, housekeeping automation to proactively detect and avoid incidents.

  • Ensure environment stability and reliability.

  • Automate processes impacting development and production leveraging tools and building scripted solutions.

  • Participate in On-Call technical support to resolve business critical incidents

Profile / requirements:

  • 5+ years of experience in Site Reliability Engineering, maintenance & operations and/or development.

  • Strong working experience eCommerce.

  • Strong working experience in DevOps practices(automated testing, CI/CD etc.).

  • Experience within solutions architecture and how to fast pinpoint causes of issues.

  • Experience from working with API-based frameworks (e.g., Commerce tools or Fabric is ideal).

  • Experience from ITIL support processes and ITSM tools (e.g., ServiceNow) in a micro services context.

  • Experience of maintaining/supporting and/or developing desktop and mobile applications.

  • Knowledge of design principles and fundamentals of solutions architecture is a plus.

  • Understanding of performance engineering (Application Reliability).

  • Knowledge of multiple front-end languages and libraries (ReactJS, React Native, NodeJS).

  • Experience in building CI/CD workflows using GitHub Actions.

  • Experience working on cloud-based infrastructure e.g., Azure and GCP.

  • Experience in provisioning Infra resources leveraging Infra as Code (Terraform / Ansible).

  • A passion for problem solving with strong analytical capabilities.

  • Know at least one of {Python, Ruby, Java, C#, Go} at an intermediate level.

  • Experience in monitoring tools (Splunk, Grafana etc.).

  • Experience working through SRE Metrics such as SLI, SLO and Error Budget.

  • Experience with managed cloud Kubernetes services (e.g. AKS, GKE).

  • Familiarity with common tech stacks in Headless Ecommerce is a nice to have.

  • Knowledge of Azure DevOps and/or other cloud environments is nice to have.

Tech stack

    English

    B1

    API (Application Programming Interface)

    advanced

    Provisioning

    advanced

    Operations

    advanced

    maintenance

    advanced

    Java

    advanced

    Python

    advanced

    Ruby

    advanced

    C#

    advanced

    ITIL

    advanced

    CI/CD

    advanced

Office location

Site reliability engineer

Summary of the offer

Site reliability engineer

Centrum, Stockholm Metropolitan Area
emagine Polska
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.