Senior Site Reliability Engineer

Python

Senior Site Reliability Engineer

Python

Centrum, Warszawa

PubNub

Undetermined
B2B
Senior
Remote
6 104 - 7 353 USD
Net per month - B2B

Tech stack

    Cloud Platforms

    advanced

    Python

    advanced

    Go

    advanced

    Rust

    advanced

    Kubernetes

    regular

    CI/CD

    regular

Job description

Job Summary:

As a member of PubNub's Engineering organization, you will work alongside Engineers and Architects in designing, developing, operating and scaling PubNub’s Data Stream Network, with a focus on improving the reliability, scale and efficiency of our global Data Stream Network. The infrastructure you will manage creates billions of events and produces terabytes of data on a daily basis. You will have the unique opportunity to help architect PubNub's infrastructure to solve challenging problems related to distributed systems, real-time messaging, and large scale data management.

Responsibilities:

  • Design processes for improving operational stability of PubNub services
  • Collaborate with engineering teams, product owners, and other stakeholders to develop tooling and documentation to scale our distributed systems
  • Ensure and enforce best application and network security practices
  • Participate in incident management on-call rotation and drive root cause analysis
  • Help define Service Level Objectives to assess release readiness of all services
  • Support, monitor and manage cloud infrastructure and environments (AWS EC2, EKS, DNS, load balancers, and databases)
  • Identify, document and help improve performance and operational efficiency challenges

Experience & Skills Required:

  • 3+ years of cloud platform experience. AWS preferred
  • 3+ years of system programming  (Python, Go, Rust, or equivalent)
  • Experience developing, supporting or operating large-scale, distributed SaaS products
  • Container orchestration systems management (Kubernetes, Helm, Docker)
  • Configuration management and automation tools such as Ansible, Terraform, etc
  • Experience with CI/CD tools and implementing best practices
  • Solid principles in cloud resources such as networking, load balancing, DNS, and security

Preferred:

  • A passion for system stability, performance, scalability and customer success
  • Desire to automate tedious tasks and eliminate inefficiencies
  • Experience refactoring apps to fit cloud native paradigms and scaling cloud native apps
  • Previous participation in Incident Management teams
  • Experience with Redis/ Cassandra / Consul
  • Experience with Linux operating system internals and networking (file systems, system calls, tunables etc)
  • Experience with protocols (TCP, HTTP, HTTPS)
  • Experience with Secrets management / Vault, Amazon Secrets manager
  • Experience operating Kubernetes clusters at scale
  • BS or MS in Computer Science or a related technical field
  • Desire to automate tedious tasks and eliminate inefficiencies

Benefits:

  • Unlimited (within reason) paid time off
  • Flexible hours
  • Remote work 
  • Stock options
  • All the hardware that you need to work effectively
  • Monthly stipend 800 PLN net for: Medicover, Warta insurance, Internet, phone, gym membership, home office setup 
  • Brand new office in the center of Katowice
  • Monthly voucher for Pyszne.pl

Note:  This position is not eligible for Visa sponsorship and will be a B2B contract hire.
Published: 05.01.2022

Meet the company

PubNub

PubNub is a dynamic technology company that creates innovative solutions for modern business. We specialize in software development, mobile applications, and data management systems.

Company profile
Office location