Monitoring Engineer (Poland / Remote) room Rzeszów home_work
ZEN
We’re looking for a Monitoring Engineer to join us in Cracow, Rzeszow, Warsaw offices or remotely . This role plays a critical part in early issue detection, incident response, and operational excellence by providing real-time insights into infrastructure, applications, and business-critical components.
Requirements
- A university degree in IT or a related field, and at least 3 years of experience in the IT industry.
- Hands‑on experience with monitoring tools such as Grafana, Prometheus, VictoriaMetrics, Zabbix, CheckMK, Datadog, Nagios, InfluxDB or ELK STACK.
- Ability to configure alerts and dashboards.
- Understanding of protocols and data formats such as SNMP, ICMP, REST API, JSON, and YAML.
- Basic knowledge of Linux/Unix environments.
- Basic understanding of networking (TCP/IP, DNS, routing, load balancing).
- Ability to analyze logs and correlate events (e.g., using Grafana Loki, ELK Stack, or Graylog).
- Strong analytical and problem‑solving abilities.
- Ability to work independently and take full ownership of assigned tasks.
- Team player with excellent communication skills, able to collaborate with development and operations teams.
- Proficient in English (able to read technical documentation and communicate clearly).
Nice to Have
- Understanding of distributed systems and High Availability (HA) concepts.
- Experience with scripting languages (e.g., Bash, Python, or Go).
- Familiarity with incident and ticket management tools (e.g., Jira, Opsgenie, PagerDuty).
- Experience with event streaming or message queue systems (e.g., Kafka, RabbitMQ).
- Familiarity with CI/CD pipelines and containerization technologies (e.g., Docker, Kubernetes).
- Knowledge of ITIL, DevOps, or SRE principles.
- Experience working with high‑cardinality metrics and large‑scale telemetry data.
Responsibilities
- Design, implement, and maintain monitoring solutions for infrastructure, applications, and services.
- Create, improve, and manage dashboards, metrics, and alerting rules in tools like Grafana, Prometheus, VictoriaMetrics, or InfluxDB.
- Collaborate with development, DevOps, and operations teams to ensure observability of all critical systems.
- Analyze incidents and monitoring data to detect trends, performance issues, and anomalies.
- Optimize alerting strategies to reduce noise and improve detection of real issues.
- Integrate monitoring tools with other platforms (e.g., alerting systems like Opsgenie, ticketing tools like Jira).
- Maintain documentation related to monitoring processes, standards, and best practices.
- Continuously evaluate and improve monitoring infrastructure to support scalability, reliability, and efficiency.
- Working in a 24/7 shift system
What we offer
- Real influence on shaping the ZEN.COM.
- Work in an environment where innovation and effectiveness truly matter.
- Competitive salary and flexible working conditions. #LI-Remote #LI-Hybrid
- Private medical healthcare.
- Internal and external training opportunities.
Oferta pracy dodana 3 dni temu
Powiązane wyszukiwania
- inżynier w dziale rozwoju technologii Rzeszów
- failure analysis engineer Rzeszów
- presales engineer Rzeszów
- inżynier doświadczenia Rzeszów
- inżynier przemysł spożywczy Rzeszów
- inżynier mosty Rzeszów
- zdalna inżynier Rzeszów
- inżynier ds. zarządzania produktem Rzeszów
- inżynier ds. przygotowania ofert Rzeszów
- inżynier ds. rozwoju produktu i procesu Rzeszów