SRE Engineer (Azure / AKS)
Site Reliability Engineer (Azure / AKS)
We are looking for a Site Reliability Engineer to join a platform team responsible for building and maintaining a cloud platform used by multiple development teams. The platform is based on Azure and Kubernetes and focuses on automation, reliability and standardization of environments. The team is also developing an Internal Developer Platform that allows development teams to manage infrastructure and environments in a self-service model.
Tech stack:
Azure
Azure Kubernetes Service (AKS)
Kubernetes
Terraform
GitHub Actions
Datadog
Prometheus
Responsibilities:
Building and maintaining infrastructure based on Azure and AKS
Developing and improving the Internal Developer Platform
Automating infrastructure and operational processes–
Supporting development teams through platform tooling
Improving observability, reliability and scalability of the platform
Working with monitoring solutions such as Datadog, Prometheus and Grafana
Must have:
Around 4-5 years of experience with Azure
Strong experience with Kubernetes / Azure Kubernetes Service (AKS)-
Experience with Infrastructure as Code (Terraform preferred)
Experience with CI/CD pipelines (GitHub Actions preferred)
Experience working with cloud infrastructure and containerized environments
English for daily communication
Nice to have:
Experience with ArgoCD
Experience with observability tools (Datadog, Prometheus, Grafana)
Experience with platform engineering or Internal Developer Platforms
Interest in automation or AI tools
Work model:
Hybrid work - 2 days per week in the Kraków office
SRE Engineer (Azure / AKS)
SRE Engineer (Azure / AKS)