Senior Site Reliability Engineer (SRE)

DevOps

Senior Site Reliability Engineer (SRE)

DevOps
Rondo Daszyńskiego, Warszawa +1 Location

Grid Dynamics Poland

Full-time
Permanent, B2B
Senior
Hybrid

Job description

We are looking for an experienced Senior Site Reliability Engineer to join our team and oversee the reliability, resilience, and performance of our core enterprise products.

In this role, you will bridge the gap between infrastructure operations and software engineering. You won't just react to alerts - you will proactively analyze system architecture, build automation, and dive deep into the application code (Java/Spring Boot) to fix bugs and eliminate issues at their root.

Responsibilities:

  • Architecture & Reliability: Understand the end-to-end product topology from both infrastructure and application perspectives. Identify bottlenecks, scale limitations, and unstable components, driving long-term resolutions before they impact production.

  • Incident Response & RCA: Respond to outages, provide L3 on-call technical support (on rotation), and perform blameless Root Cause Analysis (RCA) to implement permanent fixes.

  • Hands-on Engineering: Address defects, perform code bug fixes directly in production, and recommend architectural improvements during incident analysis.

  • Security & Vulnerability Management: Oversee vulnerability management for applications and containers, manage patching processes, ensure compliance, and monitor certificate expirations and renewals according to global best practices.

  • SRE Advocacy & SDLC: Represent the SRE organization in design reviews, capacity planning, and operational readiness exercises. Partner closely with development teams to embed reliability best practices early in the SDLC.

  • Automation & Mentoring: Build automation tools to reduce manual toil and improve efficiency. Spread SRE culture, create standard documentation, and provide technical mentorship to junior team members.

  • System Health: Oversee the production environment by tracking availability, applying learnings from observability tools, and becoming a Subject Matter Expert (SME) on core issuing products.

Min requirements:

  • Experience: 5+ years of experience in Site Reliability Engineering (SRE) or Platform Engineering roles.

  • Software Engineering: Strong proficiency in Java, Spring Boot, Hibernate, and Jenkins. Ability to read, analyze, and fix application code.

  • Containerization: Hands-on expertise with Docker and container orchestration using Kubernetes.

  • Infrastructure: Deep knowledge of Linux systems, networking, and distributed architectures.

  • Observability: Strong understanding of monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Splunk).

  • Education: Bachelor’s degree in Computer Science, Systems Engineering, or equivalent practical experience.

  • Soft Skills: Excellent problem-solving abilities and strong communication skills.

Would be a plus:

  • Infrastructure as Code & Cloud: Hands-on experience with tools like Terraform or Ansible, alongside familiarity with major public cloud providers (AWS, GCP, or Azure).

  • Advanced Networking & Service Mesh: Knowledge of service mesh technologies (e.g., Istio, Linkerd) for traffic management, security, and observability in microservices architectures.

  • Industry Experience: Previous background in the FinTech, payments, or banking sectors, with an understanding of high-security compliance standards (e.g., PCI-DSS).

We offer:

  • Opportunity to work on bleeding-edge projects

  • Work with a highly motivated and dedicated team

  • Competitive salary

  • Flexible schedule

  • Benefits package - medical insurance, sports

  • Corporate social events

  • Professional development opportunities

  • Well-equipped office

About us:

Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.

Tech stack

    English

    B2

    Java

    advanced

    Spring Boot

    advanced

    Hibernate

    advanced

    Docker

    advanced

    Kubernetes

    advanced

    Linux

    advanced

Office location

Senior Site Reliability Engineer (SRE)

Summary of the offer

Senior Site Reliability Engineer (SRE)

Rondo Daszyńskiego, Warszawa
Grid Dynamics Poland
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest Grid Dynamics Poland z siedzibą w Krakowie, al. 3 Maja 9, 30-062 (dalej jako "administrato... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.