#1 Job Board for tech industry in Europe

Senior Site Reliability Engineer
Python

Senior Site Reliability Engineer

Kraków
Type of work
Undetermined
Experience
Senior
Employment Type
B2B
Operating mode
Remote

Tech stack

    Cloud Platforms

    advanced

    Python

    advanced

    Go

    advanced

    Rust

    advanced

    Kubernetes

    regular

    CI/CD

    regular

Job description

Online interview
Job Summary:

As a member of PubNub's Engineering organization, you will work alongside Engineers and Architects in designing, developing, operating and scaling PubNub’s Data Stream Network, with a focus on improving the reliability, scale and efficiency of our global Data Stream Network. The infrastructure you will manage creates billions of events and produces terabytes of data on a daily basis. You will have the unique opportunity to help architect PubNub's infrastructure to solve challenging problems related to distributed systems, real-time messaging, and large scale data management.

Responsibilities:

  • Design processes for improving operational stability of PubNub services
  • Collaborate with engineering teams, product owners, and other stakeholders to develop tooling and documentation to scale our distributed systems
  • Ensure and enforce best application and network security practices
  • Participate in incident management on-call rotation and drive root cause analysis
  • Help define Service Level Objectives to assess release readiness of all services
  • Support, monitor and manage cloud infrastructure and environments (AWS EC2, EKS, DNS, load balancers, and databases)
  • Identify, document and help improve performance and operational efficiency challenges

Experience & Skills Required:

  • 3+ years of cloud platform experience. AWS preferred
  • 3+ years of system programming  (Python, Go, Rust, or equivalent)
  • Experience developing, supporting or operating large-scale, distributed SaaS products
  • Container orchestration systems management (Kubernetes, Helm, Docker)
  • Configuration management and automation tools such as Ansible, Terraform, etc
  • Experience with CI/CD tools and implementing best practices
  • Solid principles in cloud resources such as networking, load balancing, DNS, and security

Preferred:

  • A passion for system stability, performance, scalability and customer success
  • Desire to automate tedious tasks and eliminate inefficiencies
  • Experience refactoring apps to fit cloud native paradigms and scaling cloud native apps
  • Previous participation in Incident Management teams
  • Experience with Redis/ Cassandra / Consul
  • Experience with Linux operating system internals and networking (file systems, system calls, tunables etc)
  • Experience with protocols (TCP, HTTP, HTTPS)
  • Experience with Secrets management / Vault, Amazon Secrets manager
  • Experience operating Kubernetes clusters at scale
  • BS or MS in Computer Science or a related technical field
  • Desire to automate tedious tasks and eliminate inefficiencies

Benefits:

  • Unlimited (within reason) paid time off
  • Flexible hours
  • Remote work 
  • Stock options
  • All the hardware that you need to work effectively
  • Monthly stipend 800 PLN net for: Medicover, Warta insurance, Internet, phone, gym membership, home office setup 
  • Brand new office in the center of Katowice
  • Monthly voucher for Pyszne.pl

Note:  This position is not eligible for Visa sponsorship and will be a B2B contract hire.