Site Reliability Engineer
At Pretius we are looking for Site Reliability Engineer. Join a team responsible for the maintenance and development of a global CDN platform supporting large-scale OTT streaming services. The project focuses on ensuring high availability, performance, and scalability of both internal and cloud-based content delivery infrastructure used to distribute video content worldwide.
Project / Role
Ensure the availability, performance, and reliability of CDN platforms (cloud services, bare-metal servers, networks, and ISP caches).
Monitor and analyze key performance metrics (latency, throughput, cache efficiency, error rates) and propose optimizations.
Support deployments, production rollouts, and incident response, including root cause analysis.
Build and maintain observability and monitoring solutions (logs, metrics, alerts) using tools such as Datadog.
Develop automation scripts and internal tools (Python, Bash, APIs) for monitoring, diagnostics, and traffic analysis.
Contribute to Infrastructure as Code, CI/CD pipelines, and operational automation.
Collaborate with engineering, network, DevOps, and international teams to improve CDN performance and reliability.
Requirements
4+ years of experience in SysOps, DevOps, or SRE roles.
Strong understanding of network fundamentals (DNS, TCP, HTTP, routing, caching).
Experience with monitoring/observability tools (e.g., Datadog, Grafana).
Knowledge of DevOps tools such as Terraform, Ansible, AWS, CI/CD pipelines.
Strong Linux/Unix administration skills.
Fluent English and strong communication skills.
Nice to have:
Experience with CDN technologies or OTT streaming.
What do we offer?
We focus on long-term relationships based on fair principles and reliability.
Co-financing of the Multisport card and Medicover private healthcare.
Modern office available.
Team bonding activities, internal courses, conferences, certifications.
Site Reliability Engineer
Site Reliability Engineer