Performance Engineer - Akamai Inference Cloud
Opolska 100, Kraków
Akamai Technologies
Do you thrive on optimizing AI systems for peak performance?
Are you ready to push the boundaries of inference speed and efficiency?
Join the Akamai Inference Cloud Team!
The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We build AI platforms for efficient, compliant, and high-performing applications. These platforms support customers in running inference models and empower developers to create advanced AI solutions effectively. #AIC
Partner with the best
The Performance Engineer ensures optimal benchmarking, tuning, and performance of an AI inference platform. Responsibilities include applying advanced optimization techniques to enhance throughput, reduce latency, and improve resource efficiency. The role involves working with models, hardware accelerators, and infrastructure. Expertise in AI/ML performance optimization, proficiency with inference frameworks, and a passion for maximizing hardware and software performance are essential.
As a Performance Engineer, you will be responsible for:
Benchmarking and profiling AI models and inference workloads across different hardware configurations, measuring latency, throughput, and resource utilization.
Researching and implementing model optimization techniques including quantization, pruning, distillation, and hardware-specific optimizations.
Optimizing inference frameworks and infrastructure to maximize performance, working with TensorRT, vLLM, TorchServe, Triton and other serving platforms.
Establishing performance baselines and monitoring for the platform, identifying and addressing performance regressions.
Collaborating with engineering teams to identify bottlenecks, recommend optimizations, and validate performance improvements.
Do what you love
To be successful in this role you will:
Have experience in performance engineering with hands-on expertise in AI/ML model optimization and inference performance tuning.
Demonstrate solid knowledge of inference optimization techniques including quantization (INT8, FP16), model compilation, hardware acceleration, and familiarity with compiler optimizations and ML compilers.
Show proficiency with GPU optimization and understanding of memory hierarchies and techniques to maximize hardware utilization.
Have experience with profiling and benchmarking tools for AI workloads, identifying performance bottlenecks in distributed systems.
Demonstrate problem-solving skills with ability to analyze performance data, communicate insights clearly, and drive optimization efforts.
Possess knowledge of distributed inference and model parallelism techniques.
Have experience with cost optimization for compute-intensive workloads.
Work in a way that works for you
FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply. Learn what makes Akamai a great place to work
Connect with us on social and see what life at Akamai is like!
We power and protect life online, by solving the toughest challenges, together.
At Akamai, we're curious, innovative, collaborative and tenacious. We celebrate diversity of thought and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.
Working for you
At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:
Your health
Your finances
Your family
Your time at work
Your time pursuing other endeavors
Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.
About us
Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.
Join us
Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you!
#LI-Remote

We’ve built what no one else has — a massively distributed edge and cloud platform for cloud computing, security, and content delivery. We pioneer solutions for the fast, vast, and volatile digital world.
Performance Engineer - Akamai Inference Cloud
Performance Engineer - Akamai Inference Cloud
Opolska 100, Kraków
Akamai Technologies