Senior Site Reliable Engineer (Ansible)
Do you enjoy collaborating with teams to solve complex challenges?
Do you have a passion for cutting edge technologies?
Join our highly skilled Site Reliability Engineering team!
Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and services. Our SRE teams solve reliability, security, and usability at scale for our global fleet while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day.
Partner with the best
As a member of our ACDC SRE team, you will design, develop, and operate application and infrastructure deployment, configuration, and change orchestration for the Akamai Cloud.
As a Senior Site Reliability Engineer, you will be:
Designing, developing, testing, and operating essential services to enhance the reliability, scalability, and performance of infrastructure systems.
Designing and implementing observability solutions, including monitoring, logging, alerting, and telemetry, to identify and address issues before customer impact.
Enhancing reliability via automation, minimizing operational toil, and boosting resilience within engineering processes.
Developing extensive expertise in ACDC systems and acting as a reliable resource, guiding engineers and sharing effective practices team-wide.
Collaborating closely with software engineering, infrastructure, and platform teams to resolve complex production issues, determine root causes, and implement solutions.
Participating in an on-call rotation and delivering technical expertise during incidents, ensuring prompt restoration, clear communication, and post-incident enhancements.
Do what you love
To be successful in this role you will:
Demonstrate expertise in Ansible through playbook development, role creation, automation workflows, and enterprise-scale configuration management processes.
Manage Infrastructure as Code solutions utilizing tools like Terraform, SaltStack, Ansible, Chef, Puppet, or comparable technologies effectively and efficiently.
Design, develop, and deploy software and infrastructure at scale within a Linux environment with advanced-level expertise.
Demonstrate advanced experience in a site reliability or software engineering role, working with large-scale distributed systems.
Have great communication and interpersonal skills
Demonstrate accountability for reliability, develop automation and monitoring, and collaborate effectively with an engineering team unfamiliar with SRE practices.
Build your career at Akamai
Our ability to shape digital life today relies on developing exceptional people like you. The kind that can turn impossible into possible. We’re doing everything we can to make Akamai a great place to work. A place where you can learn, grow and have a meaningful impact.
With our company moving so fast, it’s important that you’re able to build new skills, explore new roles, and try out different opportunities. There are so many different ways to build your career at Akamai, and we want to support you as much as possible. We have all kinds of development opportunities available, from programs such as GROW and Mentoring, to internal events like the APEX Expo and tools such as Linkedin Learning, all to help you expand your knowledge and experience here.
Learn more
Not sure if this job is the right match for you or want to learn more about the job before you apply? Schedule a 15-minute exploratory call with the Recruiter and they would be happy to share more details.
Senior Site Reliable Engineer (Ansible)
Senior Site Reliable Engineer (Ansible)