Tech Lead Site Reliability Engineer
We’re looking for top-tier talent ready to make a real impact as a Technical Lead in a complex, high-traffic, business-critical environment, covering both internet-facing platforms and cloud-based systems. In this role, you will drive the team’s technical direction, coordinate tasks, and mentor engineers. You will also plan, design, and implement scalable local and wide-area network solutions across diverse platforms and protocols, including IP and VOIP.
You will be accountable for ensuring optimal system performance, proactively identifying and resolving network issues, and coordinating the installation of hardware such as routers and switches with external vendors. In addition, you will develop automation tools to streamline deployment, administration, and monitoring of network infrastructure, setting the standard for technical excellence within the team.
The Main Responsibilities
Guide the team’s technical direction, set standards for high-quality solutions, and support the professional growth of team members.
Ensure high availability and optimal performance of production applications, proactively identifying and resolving issues.
Monitor systems and respond swiftly to incidents using APM tools and log analysis.
Drive automation across deployments and processes using CI/CD, Jenkins, and Git to enhance efficiency and reliability.
Create and maintain technical documentation and runbooks.
Collaborate closely with development, QA, and infrastructure teams.
Troubleshoot complex performance issues, including JVM, database connections, and web containers.
Manage test and production environments, accounting for differences between them.
Lead postmortem analyses and implement preventive measures to continuously improve system resilience.
Requirements
Bachelor’s degree or equivalent in engineering, computer science, or a related field
10+ years of hands-on experience in software engineering, delivering complex, high-impact solutions
2+ years of proven experience leading a technical team, mentoring engineers, and driving technical strategy
Strong expertise in DevOps practices, Git, CI/CD, and related tools, with 2+ years applying them in real-world projects
6–8 years of experience building scalable systems with Java/J2EE, Microservices, Spring, SpringBoot, REST APIs, and Postman
Experienced in monitoring and optimizing system performance using APM tools, preferably AppDynamics or Dynatrace
Solid knowledge of core infrastructure technologies, including load balancers, SSL, API Gateways, and DNS
Highly autonomous, proactive, and able to make technical decisions that shape the team’s success
Nice to have:
Experience with AIOps tools like Splunk and Big Panda
Experience supporting containerized or cloud applications
What We Offer
Flexible working hours and remote options
Opportunity to work on high-impact global projects
Access to cutting-edge technologies and continuous learning
Collaborative, open culture with room to grow
Competitive salary and long-term career development