DevOps Observability Engineer
Summary: The role of a DevOps Observability Engineer focuses on enhancing the monitoring and visibility of the system through advanced tools and customization, ensuring reliable performance and swift incident resolution.
Main Responsibilities:
Design, build, and maintain advanced dashboards using Grafana.
Integrate and develop custom exporters for Prometheus based on specific requirements.
Provide Level 3 (L3) support and troubleshooting for complex monitoring and observability incidents.
Maintain and continuously improve the observability stack: Prometheus, Grafana, and Loki.
Automate deployments, upgrades, and maintenance using Ansible (including exporters and observability components).
Key Requirements:
Strong expertise in Prometheus, Grafana, and Loki (integration, customization, and troubleshooting).
Solid scripting skills (Bash, Python, or similar) for automation.
Experience with Ansible for configuration management and deployment automation.
Strong automation mindset with a focus on reliability and efficiency.
Nice to Haves:
Experience with GitOps tools such as Argo CD.
Experience with container management platforms like Rancher.
Familiarity with Kubernetes environments and cloud-native architectures.
Other Details:
Profile: Passion for observability, monitoring, and logging systems.
Work Style: Autonomous, detail-oriented, and comfortable handling complex production issues.
Technical Growth: Ability to evolve a technical stack while ensuring high availability and stability.
DevOps Observability Engineer
DevOps Observability Engineer