Site reliability engineer

DevOps

Centrum, Stockholm Metropolitan Area

emagine Polska

Full-time

Any

Mid

Hybrid

Job description

emagine is looking for a Site reliability engineer for our client in retail industry.

Period from: 2026-06-22
Period to: 2026-11-30

Job description:

Work in a cross functional team working with Reliability as Expertise in a product or a product area.
Apply Reliability engineering practices with support from SRE governance teams.
Ensure delivery quality and supply KPI reporting.
Collaborate closely within product teams to ensure predictable operations and minimal disruptions to Production.
Collaborate closely within your Capability, share best practices as well as discuss and improve on operations ways of working.
Work together in a cross-functional product team to monitor, manage, and resolve issues of the supported applications.
Technical analysis, troubleshooting of complex issues/Incidents in production.
Improve monitoring performance by focusing on preventive measures.
Product Improvements (code & log analysis).
Continuous improvement on proactive monitoring, housekeeping automation to proactively detect and avoid incidents.
Ensure environment stability and reliability.
Automate processes impacting development and production leveraging tools and building scripted solutions.
Participate in On-Call technical support to resolve business critical incidents

Profile / requirements:

5+ years of experience in Site Reliability Engineering, maintenance & operations and/or development.
Strong working experience eCommerce.
Strong working experience in DevOps practices(automated testing, CI/CD etc.).
Experience within solutions architecture and how to fast pinpoint causes of issues.
Experience from working with API-based frameworks (e.g., Commerce tools or Fabric is ideal).
Experience from ITIL support processes and ITSM tools (e.g., ServiceNow) in a micro services context.
Experience of maintaining/supporting and/or developing desktop and mobile applications.
Knowledge of design principles and fundamentals of solutions architecture is a plus.
Understanding of performance engineering (Application Reliability).
Knowledge of multiple front-end languages and libraries (ReactJS, React Native, NodeJS).
Experience in building CI/CD workflows using GitHub Actions.
Experience working on cloud-based infrastructure e.g., Azure and GCP.
Experience in provisioning Infra resources leveraging Infra as Code (Terraform / Ansible).
A passion for problem solving with strong analytical capabilities.
Know at least one of {Python, Ruby, Java, C#, Go} at an intermediate level.
Experience in monitoring tools (Splunk, Grafana etc.).
Experience working through SRE Metrics such as SLI, SLO and Error Budget.
Experience with managed cloud Kubernetes services (e.g. AKS, GKE).
Familiarity with common tech stacks in Headless Ecommerce is a nice to have.
Knowledge of Azure DevOps and/or other cloud environments is nice to have.

Tech stack

English

API (Application Programming Interface)

advanced

Provisioning

advanced

Operations

advanced

maintenance

advanced

Java

advanced

Python

advanced

Ruby

advanced

C#

advanced

ITIL

advanced

CI/CD

advanced

Office location

Site reliability engineer

Summary of the offer

Site reliability engineer

Centrum, Stockholm Metropolitan Area

emagine Polska

By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Check similar offers