#1 Job Board for tech industry in Europe

SRE Manager

Offer expired

DevOps

SRE Manager

DCG

Wrocław

Type of work

Full-time

Experience

Senior

Employment Type

B2B

Operating mode

Remote

Tech stack

Puppet

advanced

Datadog

advanced

New Relic

advanced

Prometheus

advanced

Ansible

advanced

Cloud

advanced

DevOps

advanced

GCP

advanced

Job description

Online interview

DCG is a modern technology company, gathering IT-related professionals in its ranks. Due to the continuous development and the large number of recruitment projects that we carry out for our Partners, we are looking for a person for the position of SRE Manager.

B2B contract - START from 15.01.2024
100% remote work
Longterm cooperation
This role will cooperate with the client's worldwide teams, focusing on US & LATAM. Willingness to work close to the US timezone is a must

DESCRIPTION

A Site Reliability Engineering (SRE) Sr. Manager leads a team of SRE engineers, focusing on designing, implementing, and maintaining highly reliable and scalable systems. They emphasize operational excellence, monitoring, automation, and collaboration with development teams to ensure the stability and performance of critical applications and infrastructure across the organization. Combining strong technical expertise with leadership skills, they drive continuous improvement and maintain high service availability.

RESPONSIBILITIES

Team Leadership:

- Recruit, hire, and develop a high-performing SRE team.

- Provide mentorship and coaching to junior FTE SRE engineers within client's teams.

- Set clear goals and expectations for the team.

- Develop and track objectives and key results (OKRs) for the team as a whole and for individual team members.

Technical Strategy:

- Define and implement SRE best practices, standards, and processes.

- Own end-to-end availability and performance of key services and build automation to prevent problem recurrence.

- Deliver end-to-end automation using Terraform within Google Cloud to create a new project, add a user to an existing project, request access to a new service, enable a new Google service in an existing project, etc.

- Expert knowledge of IAM and roles and permissions within Google Cloud

- Design various user roles that consider both security and user experience.

- Design and build monitoring systems to identify potential issues proactively.

- Establish Service Level Objectives (SLOs) for all offered services.

- Manage expenses to the budget in the cloud

Collaboration:

- Come with a customer-obsessed attitude and create a seamless user experience for any team requesting infrastructure services

- Work closely with development teams to identify potential reliability issues early in the development cycle

- Collaborate with security teams to maintain system security and compliance

- Excellent written and verbal communication skills

Performance Optimization:

- Analyze system metrics to identify performance bottlenecks and opportunities for improvement

- Implement capacity planning strategies to ensure system resilience under high-load

- Continuously monitor and optimize system performance

REQUIREMENTS

Bachelor’s degree in Computer Science, a related field, or equivalent practical experience
8 years of experience with data structures or algorithms
5 years of experience with software development in one or more programming languages
3 years of experience managing people or teams, leading projects, and designing, analyzing, and troubleshooting distributed systems.
Strong understanding of software development lifecycle (SDLC) and DevOps principles
Deep technical expertise in cloud computing platforms (GCP is our platform, but some services are hosted in Azure)
Proven experience with monitoring tools (Prometheus, Datadog, New Relic)
Experience with automation frameworks (Ansible, Puppet, Chef)
English level: C1, C2

OFFER

Private medical care
Co-financing for the sports card
The constant support of a dedicated consultant

Check similar offers

SRE Manager

DCG

Tech stack

Puppet

Datadog

New Relic

Prometheus

Ansible

Cloud

DevOps

GCP

Job description

Senior Azure Solutions Architect with German

Platform Engineer (Go & Kubernetes)

Senior DevOps Engineer

Senior AWS Connect Architect (FMCG)

Devops Engineer