Assignment description
We are looking for an Observability and Monitoring Specialist to join a team working with incident management and system support for a business-critical workshop solution.
In this role, you will take ownership of monitoring and observability, ensuring that dashboards, alarms, and logging capabilities provide clear insights and enable early detection of issues. You will play a key role in improving system reliability and supporting efficient troubleshooting.
You will use monitoring as a primary tool for initial incident investigation and help route issues to the appropriate development teams. The role also includes supporting major incident handling and contributing to root cause analysis and continuous improvements.
You will collaborate closely with development teams and other stakeholders, acting as a bridge between support and engineering, and ensuring that monitoring is aligned with both technical needs and real user impact.
Qualifications:
- Experience working with monitoring or observability for software services.
- Experience with Datadog or comparable monitoring tools (e.g., Grafana, Prometheus).
- Strong interest in IT, systems, and troubleshooting.
- Ability to use monitoring and dashboards as a tool for initial incident investigation.