Observability Engineer
Join us, and enhance system reliability with advanced monitoring solutions!
Krakow-based opportunity with a hybrid working model: 6 days in the office per month!
As an Observability Engineer, you will be working for our client, a leading global financial services organization that focuses on enhancing critical IT infrastructure services. The project you'll be working on is centered around monitoring and observability for one of their key applications. Your role will involve building and implementing monitoring solutions, providing consultancy, and collaborating with internal teams to ensure the performance and reliability of the system. The team is highly collaborative and committed to continuously improving the monitoring and observability services for mission-critical applications.
Your main responsibilities:
- Implementing and maintaining observability and monitoring frameworks
- Collaborating with application teams to set up observability for their infrastructure and applications
- Designing and optimizing dashboards, visualization, and self-healing solutions
- Building performance and tracing solutions using Splunk, AppDynamics, and ThousandEyes
- Engineering and establishing standards for functional components, including agent deployments and application tuning
- Automating operational tasks through scripting and seeking integration opportunities
- Assisting in training sessions to promote tool adoption and best practices
- Providing input for improving global monitoring and observability operating models
- Adhering to internal policies and raising concerns on potential issues
- Continuously evolving monitoring tooling towards a self-service automated platform
You're ideal for this role if you have:
- 2+ years of experience working with Splunk, AppDynamics, or ThousandEyes
- Experience with application development (preferably Java) at an enterprise level
- Knowledge of cloud technologies such as AWS or GCP
- Experience with Kubernetes, OpenShift, PCF, and other architecture tech stacks
- Familiarity with monitoring and observability solutions, including server and network performance
- Practical knowledge of distributed service design, messaging protocols, and autonomous software design practices
- Strong understanding of application performance metrics and KPIs
- Experience with event management tools and operational automation like AIOps
- Ability to develop and optimize monitoring extensions using REST API
- Excellent communication skills and the ability to work independently and in teams
It is a strong plus if you have:
- Experience working with ServiceNow, Confluence, and Jira
- Knowledge of machine learning and AI/ML concepts in relation to observability
- Familiarity with Elasticsearch, Grafana, Prometheus
- Experience defining and supporting monitoring dashboards for mission-critical applications
- Technical writing experience for queries, reports, and presentations
We offer you:
ITDS Business Consultants is involved in many various, innovative and professional IT projects for international companies in the financial industry in Europe. We offer an environment for professional, ambitious, and driven people. The offer includes:
- Stable and long-term cooperation with very good conditions
- Enhance your skills and develop your expertise in the financial industry
- Work on the most strategic projects available in the market
- Define your career roadmap and develop yourself in the best and fastest possible way by delivering strategic projects for different clients of ITDS over several years
- Participate in Social Events, training, and work in an international environment
- Access to attractive Medical Package
- Access to Multisport Program
- Access to Pluralsight
Internal job number #6198