Quantcast
Channel: Monitoring-Portal Feed
Viewing all articles
Browse latest Browse all 1338

OMD 1.21.20150729 - Wato Notifications escalation questions

$
0
0
Hi everyone, I've been setting up OMD to monitor our internal infrastructure with good success so far. I've been setting up an escalation to send emails, and after 10 minutes, an SMS if the problem is still not taken care of or acknowledged. This works fine following the guideline on https://mathias-kettner.de/checkmk_flexi…ifications.html, by adding a notification interval, and also using nth to the mth notifications.

I've been hitting a problem with Nth to the Mth notifications though. Let's say I set it up on a notification interval of 10 minutes, and I have 2 flexible notifications: 1 to 5th by email, 2nd by SMS. If a service goes from OK to CRIT, stays that way for 20 minutes, and goes back to OK, everything is fine, and works like I want it to. However, if the problem stays for 120 mins, the number of notifications goes over 50 mins, I get the 5 notifications by email every 10 mins, I get the SMS on the 10th minute, but since I go over 50 minutes for the problem, I never get the notification that the service goes back to OK status.

I tried looking around for settings that would make the change from WARN or CRIT to OK as a new notification chain, not counting against the Nth to the Mth notifications set up in the emails, but I have yet to hit the jackpot. Ideally, I<d like to get the 1st notification by email, 2nd and 3rd by SMS, and only get notified by email when the problem is fixed (to get the OK email, which my manager will look for to know problem is fixed). Seems that I cannot easily get that behaviour as it is right now, if I limit Nth to the Mth for the email notification.

I'd appreciate your help if anyone has a few pointers for me, as I am not having much success setting it up to get the desired behaviour (1st warning by email, 2nd and 3rd by sms, and get the OK email when problem is fixed)

Thanks in advance for your time!

Viewing all articles
Browse latest Browse all 1338