Elastic Observability Specialist
Introduction & Summary
We are seeking an experienced Observability Specialist dedicated to ensuring the reliability and performance of our systems. This role involves collaborating with enterprise architects and IT professionals to design, implement, and oversee a scalable telemetry infrastructure. The ideal candidate will possess deep expertise in ELK or similiar technologies and modern telemetry standards.
Main Responsibilities
As our Observability Engineer, your core duties will include:
Architectural Collaboration: Partner with system architects and local engineering teams in Denmark to design resilient monitoring solutions.
Monitor Kubernetes environments with OpenTelemetry (OTel) standards for logs, traces, and metrics.
Manage centralized data collection and automate Elastic deployments using Ansible.
Utilize Elastic APM for identifying code-level bottlenecks and resolving latency issues.
Implement AIOps configurations for proactive anomaly detection and automated root-cause analysis.
Drive Site Reliability Engineering (SRE) methodologies across teams.
Elastic Stack Management: Deploy, scale, and maintain Elasticsearch, Logstash, and Kibana (ELK) environments.
Key Requirements
Cloud-Native Observability: Strong skills in monitoring Kubernetes (Openshift)
environments and integrating with major cloud providers.
APM & Distributed Tracing: Expertise in Application Performance Monitoring (APM) to
identify code-level bottlenecks and latency issues.
OpenTelemetry (OTel): Hands-on experience implementing OpenTelemetry (or similiar) standards
for logs, traces, and metrics to ensure vendor-neutral telemetry.
Infrastructure as Code (IaC): Proficiency in automating Elastic environments with
Ansible.
Performance Engineering: Expert-level knowledge of shard optimization, mapping,
and Index Lifecycle Management (ILM) to balance high performance with cost control.
SRE Methodology: Experience defining and monitoring Service Level Objectives
(SLOs) and managing Error Budgets.
Strong communication skills for collaboration with IT teams.
NIce to Have:
Elastic Stack Mastery: Deep expertise in architecting and managing Elasticsearch,
Logstash, and Kibana (ELK) at scale.
Data Ingestion & Fleet: Proven experience deploying Elastic Agent and Fleet for
centralized agent management and data collection.
AIOps & Machine Learning: Ability to configure Elastic ML models for proactive
anomaly detection and automated root cause analysis.
Other Details
This is position based in Warsaw, with 3 days Hybrid model, focused on leading-edge observability solutions in a dynamic and collaborative environment.
Elastic Observability Specialist
Elastic Observability Specialist