#1 Job Board for tech industry in Europe

Senior Site Reliability Engineer

7 090 - 8 726 USDNet per month - B2B

5 890 - 7 253 USDGross per month - Permanent

DevOps

Senior Site Reliability Engineer

DevOps

-, Warszawa +4 Locations

AVSystem

Full-time

B2B, Permanent

Senior

Remote

7 090 - 8 726 USDNet per month - B2B

5 890 - 7 253 USDGross per month - Permanent

Job description

Senior Site Reliability Engineer

As a Senior Site Reliability Engineer, you’ll be at the heart of our most critical operations, building the fault-tolerant systems that serve our most important clients: Communications Service Providers (CSP).

We build, test, launch, and operate the complex, high-stakes systems for our global telco customers. Your mission is to ensure the reliability, efficiency, and performance of our core products (like UMP, CEM, BSAP, and DHCP) across both our cloud and complex on-premise deployments.

This is no small task. Our products handle hundreds of millions of devices in 100+ large deployments worldwide by industry giants like T-Mobile, Play and Vodafone, also via our Cloud offering.

As we embrace more cloud-native and Kubernetes-based deployments, we're facing new architectural challenges. We aren't just looking for someone to maintain systems; we are looking for an experienced engineer who loves to solve complex operational problems and is passionate about building the automation that will lead us forward.

Requirements

5+ years of professional experience in Site Reliability Engineering, DevOps, or a related role such as Systems or Software Engineering.
Advanced proficiency in a high-level language such as Python or Go, with the ability to design and build complex, maintainable automation services and frameworks.
Deep expertise with cloud infrastructure (e.g., GCP, AWS, or Azure) and container orchestration (Kubernetes).
Proven experience with Infrastructure as Code and configuration management (e.g., Terraform, Ansible).
Expert-level understanding of networking (TCP/IP, OSI) and Unix/Linux systems (Ubuntu, RHEL).
Expertise in designing, implementing, and managing monitoring and observability tools (e.g., Prometheus, Grafana, Zabbix).
A strong sense of ownership, and the ability to lead technical discussions and mentor other engineers.
Proficiency in English (B2+).

A huge plus if you have experience with:

A formal SRE role in a previous company.
Database setup and administration (e.g., MongoDB, Redis).
Performance tuning and debugging of JVM-based applications.
Building and scaling distributed systems.
Log management stacks (e.g., ELK, Loki).

Responsibilities

Design, build, and maintain complex, maintainable automation services and frameworks to eliminate toil and scale our operations.
Proactively identify, debug, and resolve complex performance and reliability issues within our core product codebases.
Communicate directly with technical customer teams to troubleshoot, manage, and resolve complex production issues.
Lead blameless postmortems and Root Cause Analyses (RCAs) for complex incidents, driving preventative measures.
Establish and monitor Service Level Indicators (SLIs) to align the team with availability and latency objectives.
Participate in an additionally paid 24/7 on-call rotation, responding to and resolving critical system issues.
Mentor junior and mid-level engineers through code reviews, design discussions, and pair programming.
Collaborate with development teams on feature design and architecture to ensure reliability, scalability, and operability from the start.
Set up and configure software, networks, and operating systems across bare metal, VMs, and cloud/Kubernetes infrastructure.
Drive improvements to our monitoring and observability stack (Prometheus, Grafana, Loki) to provide a comprehensive view of system health.

What we offer

Freedom and responsibility. Our goal is to inspire people more than manage them. We want our teams to do what is best for our products. This, in turn, generates a sense of responsibility which drives us to do great work.
Technical challenges: our customers rely on the reliability of our products to generate revenue in their business. The telco industry is ever-growing and needs us to support that growth.
Open-source contribution opportunities.
A team of highly skilled and humorous colleagues.
Access to the best tools and equipment available in the market.
A MacBook Pro / ThinkPad with 2 monitors.
Company events and team building activities.
Multiple career paths and employee development options – we want you to develop into a tech lead in the future, but we’ll support you in getting another dream role in site reliability, management, product development or sales.
Flexible working hours/remote work when you need it
Trainings and conferences
Multisport card
Kitchen full of snacks and treats (including Good Lood ice cream)
Car parking area and bike room
A relaxed work atmosphere – no dress code, no open space

Tech stack

Polish

English

DevOps

advanced

Cloud

advanced

IaC

advanced

Prometheus

advanced

Grafana

advanced

Software Development

advanced

Linux

advanced

Kubernetes

advanced

SRE

advanced

Networking

advanced

Office location

Senior Site Reliability Engineer

7 090 - 8 726 USDNet per month - B2B

Summary of the offer

Senior Site Reliability Engineer

-, Warszawa

AVSystem

7 090 - 8 726 USDNet per month - B2B

5 890 - 7 253 USDGross per month - Permanent

By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. The administrator of personal data processed for recruitment purposes is AVSystem spółka z ograniczoną odpowiedzialnością based in Kra... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Check similar offers

Fibertide

Remote

Senior Site Reliability Engineer

6 817 - 8 726USD/month

Programming

AWS

Computer science

Cloud

IaC

Algorithms

GCP

Networks

English

Communication

SeniorSeniorMandate contract, B2BMandate, B2B

ADVERTISEMENT: Recommended by Just Join IT

Salary

7 090 - 8 726 USD

Net per month - B2B

5 890 - 7 253 USD

Gross per month - Permanent

Applied -

Check similar offers

Fibertide

Remote

Senior Site Reliability Engineer

6 817 - 8 726USD/month

Programming

AWS

Computer science

Cloud

IaC

Algorithms

GCP

Networks

English

Communication

SeniorSeniorMandate contract, B2BMandate, B2B

N-iX

Remote

Senior Site Reliability Engineer

5 000 - 6 000USD/month

Redis

Postgres SQL

CI/CD

Azure

Terraform

SeniorSeniorB2BB2B

Ness Solution

Remote

DevOps - Senior Site Reliability Engineer

35 - 44USD/h

Datadog

DNS

CI/CD

Bash

Grafana

Ansible

GitLab

Terraform

Python

HTTP

SeniorSeniorB2BB2B

Sigma Software

Remote

Principal Site Reliability Engineer

New

Undisclosed Salary

AWS

CI/CD

Terraform

Kubernetes

Python

SeniorSeniorB2B, PermanentB2B, Permanent

New

DCV Technologies

Remote

Platform/Site Reliability Engineer (SRE)

Undisclosed Salary

AWS

Azure

GCP

API

Terraform

Agile

Python

SeniorSeniorB2BB2B

ADVERTISEMENT: Recommended by Just Join IT