Site Reliability Engineer

DevOps

Site Reliability Engineer

DevOps
Centrum, Lisbon

emagine Polska

Full-time
Any
Senior
Remote

Job description

Role Overview

We are looking for a skilled and proactive Observability Engineer to implement, automate, and support enterprise-grade observability and monitoring solutions across cloud and application platforms. The ideal candidate should have strong AWS infrastructure knowledge, hands-on automation skills, and experience building reliable monitoring and alerting ecosystems for modern distributed applications.

The role involves working closely with Platform Engineering, Data Engineering, and Application teams to develop observability solutions and bring operational visibility, reliability, incident detection, and platform performance.

Main Responsibilities

·        Design, implement, and maintain observability solutions for cloud-native and distributed systems.

·        Build monitoring, logging, alerting, and dashboarding solutions across infrastructure and applications.

·        Develop automation scripts and tooling using Python.

·        Implement and maintain Infrastructure as Code (IaC) using Terraform.

·        Build and support CI/CD pipelines using Jenkins and Git-based workflows.

·        Configure and optimize monitoring for AWS services, Kubernetes workloads, APIs, databases, and applications.

·        Create actionable alerts and operational dashboards to improve incident response and system reliability.

·        Work with engineering teams to onboard applications into observability platforms.

·        Support troubleshooting, root cause analysis, and performance optimization initiatives.

·        Ensure observability standards, governance, and best practices are followed across projects.

Key Requirements

·        Strong hands-on experience with Amazon Web Services (AWS).

·        Solid Python development/scripting experience.

·        Strong experience with Terraform.

·        Experience building and maintaining CI/CD pipelines using Jenkins.

·        Elasticsearch / ELK Stack experience and building queries.

·        Worked with Data Platforms monitoring is preferred.

·        Experience with Linux systems and shell scripting.

·        Understanding of monitoring, logging, and alerting concepts.

·        Experience working in Agile/DevOps environments.

Nice to Have Skills

Experience with any of the following is highly desirable:

·        Snowflake

·        Databricks

·        dbt

·        Matillion

·        Grafana

·        New Relic

·        Datadog

·        Prometheus

·        Elasticsearch / ELK Stack experience

NOTES: We are looking for an Engineer who loves to build. This is a highly technical role—90% of the job is hands-on coding in python and terraform.

Tech stack

    English

    B1

    Web Services

    master

    Python

    master

    Git

    advanced

    Cloud

    advanced

    Jenkins

    advanced

    CI/CD

    advanced

    Linux

    advanced

    Agile

    advanced

    Performance optimization

    advanced

    Microsoft Platform

    advanced

Office location

Site Reliability Engineer

Summary of the offer

Site Reliability Engineer

Centrum, Lisbon
emagine Polska
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Check similar offers
EPAM Systems

EPAM Systems

Gdansk

Remote

Remote

Undisclosed Salary
AWS
CI/CD
monitoring
Docker
Kubernetes
Python
Infrastructure as code
Site Reliability Engineering
SeniorSeniorAnyAny
New
ADVERTISEMENT: Recommended by Just Join IT
Applied -
25 day left (until 15.07.2026)
Applied -
Check similar offers
EPAM Systems

EPAM Systems

Gdansk

Remote

Remote

Undisclosed Salary
AWS
CI/CD
monitoring
Docker
Kubernetes
Python
Infrastructure as code
Site Reliability Engineering
SeniorSeniorAnyAny
New
Akamai Technologies

Akamai Technologies

Kraków

Remote

Remote

Undisclosed Salary
Prometheus
Terraform
Linux
Python
SeniorSeniorPermanentPermanent
New
XTB

XTB

Remote

Remote

6 294 - 7 991USD/month
Python
Kubernetes
Ansible
Prometheus
Grafana
ELK
SeniorSeniorPermanentPermanent
New
Fibertide

Fibertide

Remote

Remote

6 779 - 8 677USD/month
Programming
AWS
Computer science
Cloud
IaC
Algorithms
GCP
Networks
English
Communication
SeniorSeniorMandate contract, B2BMandate, B2B
New
Hard Rock Digital

Hard Rock Digital

Remote

Remote

7 662 - 8 483USD/month
Grafana
Docker
Terraform
Kubernetes
Ansible
DevOps
Azure
SeniorSeniorB2BB2B
New
ADVERTISEMENT: Recommended by Just Join IT