Site Reliability Engineer
Location: 100% Remote
Working hours: 9:00–17:00 CET
On‑call: Weekend rotation, approx. 2 times per quarter (additional compensation)
Join a world‑class product engineering team that is redefining how global enterprises consume cloud services. We are looking for an experienced SRE to help build scalable, reliable and modern cloud solutions.
Tasks:
Architecture, Design & Development
Develop and implement DevOps and SRE strategies to optimize the end‑to‑end software delivery lifecycle.
Work with containerization (Docker) and orchestration (Kubernetes — key requirement) to support scalable and resilient application deployment.
Support application deployment and monitoring in Big Data environments.
Deliver solutions aligned with architectural principles and organizational guidelines.
Participate in system design discussions and peer code reviews.
Deployment & Operations
Design, build, manage and operate infrastructure and configuration for SaaS applications, with a strong focus on automation and Infrastructure as Code.
Implement and maintain CI/CD pipelines (e.g., Jenkins, GitLab CI).
Instrument observability using monitoring tools such as Cloudflare and AppDynamics.
Troubleshoot and resolve production incidents while meeting SLA requirements.
Automate operational processes (experience with Ansible is highly valued).
Collaboration
Work effectively in a globally distributed development environment with minimal supervision.
Collaborate closely with the Product Owner and development teams across multiple regions.
Poland-based engineers operate within Polish working hours despite global time zone coverage.
Innovation
Evaluate and integrate tools that improve development and operational workflows (version control, build systems, deployment tools).
Investigate, analyze and adopt emerging technologies.
Identify opportunities for automation, optimization and continuous improvement.
Requirements:
Must‑have
Bachelor’s or Master’s degree in Computer Science, Software Engineering or a related field, with 5+ years of industry experience.
Minimum 2 years of hands‑on SRE and/or DevOps experience.
Strong expertise in Kubernetes (top priority) and Docker.
Experience with public cloud platforms — AWS preferred and most important (Azure or GCP also acceptable).
Solid programming foundations: data structures, algorithms, concurrency, design patterns, best practices.
Proficiency in scripting languages: Bash, Python, Perl, PHP or Ruby.
Experience with CI/CD tools (e.g., Jenkins, GitLab CI).
Familiarity with at least one primary programming language — preference for Java.
Strong system administration skills and automation mindset.
Good understanding of continuous integration and deployment practices.
Nice‑to‑have
Experience with Big Data technologies (Flink, Kafka).
Knowledge of Redis, MongoDB and/or Cassandra.
Experience with Git and modern version control workflows.
Basic knowledge of enterprise Java stack (Spring Boot, REST APIs, JPA/Spring Data, Maven, JUnit).
Experience with performance and load testing tools (JMeter, LoadRunner).
Familiarity with Agile/Scrum.
GCP knowledge — welcome but not required.
Offer:
Multisport card
Private healthcare
Access to an e‑learning platform
Group life insurance
Site Reliability Engineer
Site Reliability Engineer