Senior Generative AI DevOps Engineer
We are seeking a Senior DevOps Engineer specialized in Generative AI Operations to design and maintain scalable AI/ML workloads and infrastructure.
Join our team to build robust, secure, and reliable cloud infrastructure that supports advanced physics simulations and AI model deployment. Apply now to contribute to cutting-edge AI solutions in a collaborative environment.
Responsibilities
Design and develop scalable AI/ML workloads and infrastructure
Create and maintain secure and reliable infrastructure to enhance client satisfaction
Automate infrastructure provisioning using Infrastructure-as-Code tools such as Terraform or Ansible
Develop infrastructure supporting physics simulations and model training for Physics Informed Machine Learning applications
Deploy and maintain AI models, data pipelines, and cloud infrastructure
Implement observability frameworks for monitoring, logging, and tracing AI services in production
Mentor and coach team members to promote best practices and continuous improvement
Requirements
Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or related field
3+ years of experience in infrastructure engineering with focus on AI/ML
Strong expertise in Infrastructure as Code tools such as Terraform and Ansible
Proficiency with cloud platforms including Google Cloud Platform, Azure, or AWS
Experience in containerization and orchestration tools such as Docker and Kubernetes
Familiarity with infrastructure for High Performance Compute (HPC) workloads
Experience with GPU-accelerated compute environments and AI-specific tools like NVIDIA Triton, Kubeflow, or MLFlow
Strong problem-solving skills
Ability to work effectively in agile, cross-functional teams
Strong written and verbal English communication skills (B2+)
Nice to have
Proficiency in Python programming
Experience with AI/ML frameworks such as PyTorch, TensorFlow, HuggingFace, or Scikit-learn
We offer
We gather like-minded people:
Engineering community of industry professionals
Friendly team and enjoyable working environment
Flexible schedule and opportunity to work remotely within Poland
Chance to work abroad for up to 60 days annually
Business-driven relocation opportunities
We provide growth opportunities:
Outstanding career roadmap
Leadership development, career advising, soft skills, and well-being programs
Certification (GCP, Azure, AWS)
Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
English classes
We cover it all:
Stable income (Employment Contract or B2B)
Participation in the Employee Stock Purchase Plan
Benefits package (health insurance, multisport, shopping vouchers)
Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
Referral bonuses
Corporate, social and well-being events
Please, note:
The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview
We will reach out to selected candidates exclusively
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Senior Generative AI DevOps Engineer
Senior Generative AI DevOps Engineer