Job Summary:
As a member of PubNub's Engineering organization, you will work alongside Engineers and Architects in designing, developing, operating and scaling PubNub’s Data Stream Network, with a focus on improving the reliability, scale and efficiency of our global Data Stream Network. The infrastructure you will manage creates billions of events and produces terabytes of data on a daily basis. You will have the unique opportunity to help architect PubNub's infrastructure to solve challenging problems related to distributed systems, real-time messaging, and large scale data management.
Responsibilities:
- Design processes for improving operational stability of PubNub services
- Collaborate with engineering teams, product owners, and other stakeholders to develop tooling and documentation to scale our distributed systems
- Ensure and enforce best application and network security practices
- Participate in incident management on-call rotation and drive root cause analysis
- Help define Service Level Objectives to assess release readiness of all services
- Support, monitor and manage cloud infrastructure and environments (AWS EC2, EKS, DNS, load balancers, and databases)
- Identify, document and help improve performance and operational efficiency challenges
Experience & Skills Required:
- 3+ years of cloud platform experience. AWS preferred
- 3+ years of system programming (Python, Go, Rust, or equivalent)
- Experience developing, supporting or operating large-scale, distributed SaaS products
- Container orchestration systems management (Kubernetes, Helm, Docker)
- Configuration management and automation tools such as Ansible, Terraform, etc
- Experience with CI/CD tools and implementing best practices
- Solid principles in cloud resources such as networking, load balancing, DNS, and security
Preferred:
- A passion for system stability, performance, scalability and customer success
- Desire to automate tedious tasks and eliminate inefficiencies
- Experience refactoring apps to fit cloud native paradigms and scaling cloud native apps
- Previous participation in Incident Management teams
- Experience with Redis/ Cassandra / Consul
- Experience with Linux operating system internals and networking (file systems, system calls, tunables etc)
- Experience with protocols (TCP, HTTP, HTTPS)
- Experience with Secrets management / Vault, Amazon Secrets manager
- Experience operating Kubernetes clusters at scale
- BS or MS in Computer Science or a related technical field
- Desire to automate tedious tasks and eliminate inefficiencies
Benefits:
- Unlimited (within reason) paid time off
- Flexible hours
- Remote work
- Stock options
- All the hardware that you need to work effectively
- Monthly stipend 800 PLN net for: Medicover, Warta insurance, Internet, phone, gym membership, home office setup
- Brand new office in the center of Katowice
- Monthly voucher for Pyszne.pl
Note: This position is not eligible for Visa sponsorship and will be a B2B contract hire.