SRE Tech Lead
We are looking for an experienced Senior SRE / DevOps Tech Lead who will combine strong hands-on expertise with technical leadership and team management. This role is responsible not only for the reliability and scalability of production systems, but also for guiding engineers, setting technical direction, and building a strong SRE culture across the organization.
RESPONSIBILITIES
🔧 Technical Leadership & Team Management
Leading and mentoring a team of SRE / DevOps engineers (technical guidance, code reviews, best practices)
Supporting team growth through coaching, knowledge sharing, and technical decision-making
Setting technical standards and ensuring consistency across environments and teams
Collaborating with Engineering Managers and Product teams on priorities and roadmap alignment
☁️ Infrastructure, Reliability & DevOps
Designing, implementing, and scaling resilient infrastructure in AWS (multiple accounts, production and pre-production environments)
Maintaining and evolving Kubernetes (EKS) environments using Helm, ArgoCD, and Terraform, ensuring predictable and auditable deployment processes
Owning SRE best practices: SLIs/SLOs, error budgets, reliability reviews, and capacity planning
Building and improving observability using Dynatrace, Grafana, cloud-native metrics, and open-source tooling
Optimizing Cloudflare configuration (WAF, cache and routing rules, perimeter security) to improve performance and security
Automating infrastructure, deployments, and operational tasks using GitHub Actions, Python, and Bash
Leading incident response, coordinating on-call activities, running post-mortems, and driving continuous improvement
REQUIREMENTS
Must-have
Minimum 6+ years of experience in SRE / DevOps roles in AWS-based production environments (AWS preferred, Azure acceptable)
Proven experience in a Tech Lead / Senior role, including mentoring or leading engineers
Strong proficiency with Terraform, Helm, ArgoCD, and GitHub Actions
Excellent knowledge of Kubernetes (EKS): autoscaling, rollout strategies, troubleshooting, cluster architecture
Experience building and maintaining observability pipelines (logs, metrics, traces, SLIs/SLOs, alerting)
Ability to design high-availability and fault-tolerant systems
Solid understanding of CI/CD principles and GitOps practices
Hands-on experience with Cloudflare (DNS, CDN, WAF, rulesets)
Practical experience with monitoring tools such as Dynatrace, Prometheus, and Grafana
Very good command of English (daily collaboration with teams in Europe and the US)
Experience in incident response: on-call rotations, RCA, post-mortems
WHY JOIN?
Stable, long-term B2B cooperation directly with the end client
Work on high-scale systems with real impact on a platform used by millions of users
Combination of hands-on engineering and technical leadership
Real influence over architecture, tooling, and reliability standards
100% remote work, flexible hours, and an async-friendly environment
Mature engineering culture with partnership-based collaboration
Work alongside senior experts from Europe and the US
Access to a modern tech stack: AWS, EKS, Terraform, ArgoCD, Cloudflare, Dynatrace, and cloud-native tooling
TQLO Sp. z o.o. – Employment Agency (KRAZ No. 33580)
Thank you for all applications. We will contact selected candidates.

TQLO SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ
TQLO to dynamicznie rozwijająca się firma specjalizująca się w rekrutacji IT oraz outsourcingu usług technologicznych. Działa na polskim rynku, dostarczając lokalnie wykwalifikowanych inżynierów, którzy pomagają klientom...
SRE Tech Lead
SRE Tech Lead