Senior Site Reliability Engineer

PubNub

Kraków

Type of work

Undetermined

Experience

Senior

Employment Type

B2B

Operating mode

Remote

Tech stack

Cloud Platforms

advanced

Python

advanced

Go

advanced

Rust

advanced

Kubernetes

regular

CI/CD

regular

Job description

Online interview

Job Summary:

As a member of PubNub's Engineering organization, you will work alongside Engineers and Architects in designing, developing, operating and scaling PubNub’s Data Stream Network, with a focus on improving the reliability, scale and efficiency of our global Data Stream Network. The infrastructure you will manage creates billions of events and produces terabytes of data on a daily basis. You will have the unique opportunity to help architect PubNub's infrastructure to solve challenging problems related to distributed systems, real-time messaging, and large scale data management.

Responsibilities:

Design processes for improving operational stability of PubNub services
Collaborate with engineering teams, product owners, and other stakeholders to develop tooling and documentation to scale our distributed systems
Ensure and enforce best application and network security practices
Participate in incident management on-call rotation and drive root cause analysis
Help define Service Level Objectives to assess release readiness of all services
Support, monitor and manage cloud infrastructure and environments (AWS EC2, EKS, DNS, load balancers, and databases)
Identify, document and help improve performance and operational efficiency challenges

Experience & Skills Required:

3+ years of cloud platform experience. AWS preferred
3+ years of system programming (Python, Go, Rust, or equivalent)
Experience developing, supporting or operating large-scale, distributed SaaS products
Container orchestration systems management (Kubernetes, Helm, Docker)
Configuration management and automation tools such as Ansible, Terraform, etc
Experience with CI/CD tools and implementing best practices
Solid principles in cloud resources such as networking, load balancing, DNS, and security

Preferred:

A passion for system stability, performance, scalability and customer success
Desire to automate tedious tasks and eliminate inefficiencies
Experience refactoring apps to fit cloud native paradigms and scaling cloud native apps
Previous participation in Incident Management teams
Experience with Redis/ Cassandra / Consul
Experience with Linux operating system internals and networking (file systems, system calls, tunables etc)
Experience with protocols (TCP, HTTP, HTTPS)
Experience with Secrets management / Vault, Amazon Secrets manager
Experience operating Kubernetes clusters at scale
BS or MS in Computer Science or a related technical field
Desire to automate tedious tasks and eliminate inefficiencies

Benefits:

Unlimited (within reason) paid time off
Flexible hours
Remote work
Stock options
All the hardware that you need to work effectively
Monthly stipend 800 PLN net for: Medicover, Warta insurance, Internet, phone, gym membership, home office setup
Brand new office in the center of Katowice
Monthly voucher for Pyszne.pl

Note: This position is not eligible for Visa sponsorship and will be a B2B contract hire.

Check similar offers

Senior Site Reliability Engineer

PubNub

Tech stack

Cloud Platforms

Python

Go

Rust

Kubernetes

CI/CD

Job description

Full Stack Developer

Senior Fullstack Developer

Data Architect (Spark)

Python Developer

Fullstack Developer for AI-Powered R&D Tools in Pharma