Responsibilities
Platform Development & Operations
- Design and operate Azure Kubernetes Service (AKS) clusters with tenant separation.
- Develop and maintain Helm charts for application deployment.
- Implement and manage ingress controllers, policies (Kyverno), and alerting systems (Alertmanager).
Infrastructure as Code
- Use Terraform to manage Azure resources and automate infrastructure provisioning.
- Ensure best practices in code structure, reusability, and security.
Monitoring & Observability
- Set up and operate monitoring and logging tools (Grafana, Prometheus, Commvault, Stash).
- Optimize observability pipelines for application teams.
Collaboration
- Cooperate with cross-functional development teams to support deployment and performance optimization.
- Communicate with stakeholders regarding platform capabilities and enhancements.
Qualifications
- We are looking forward to your successfully completed university studies in computer science or comparable courses of study.
- 5+ years of experience in cloud platform engineering or DevOps roles.
- Deep knowledge of Kubernetes (AKS), container orchestration, and Helm.
- Proven experience with Terraform in production environments.
- Familiarity with logging/monitoring stacks (Grafana, Prometheus).
- Experience with service mesh (Istio) is a strong plus.
- Working knowledge of networking concepts in Azure.
- Comfortable working in managed services or platform engineering setups.
- English fluency is required. German - nice to have.
- Openness to visit Katowice office at least once per quarter.
About Cluster Reply
Cluster Reply is the Reply Group company specializing in consulting and system integration of Microsoft technologies. As a Microsoft partner, Cluster Reply is active in Germany, Austria and Switzerland and works within the Reply network with sister companies in Brazil, Great Britain, Italy as well as the USA. The company focuses on innovation and supports customers in their digital transformation. The solutions range from on-premises to cloud applications in the areas of modern workplace and security, business applications, applications and infrastructure as well as data and artificial intelligence.