Quantcast
Channel: Monitoring-Portal Feed
Viewing all 1338 articles
Browse latest View live

"Backend icinga is not running" PSQL - IcingaWeb2

$
0
0
Hi,

After I got all of my remote servers added in and checks being done via snmp. I am now getting a notification that Backend Icinga is not running.
The Icinga2 service is started and running. I noticed some threads are mentioning some things regarding the mysql database but I have deployed with psql.

Below is the contents of /etc/icinga2/features-enabled/ido-pgsql.conf I am not sure if this is related to the issue? any help is appreciated.

/**
* The db_ido_pgsql library implements IDO functionality
* for PostgreSQL.
*/

library "db_ido_pgsql"

object IdoPgsqlConnection "ido-pgsql" {
//user = "icinga"
//password = "icinga"
//host = "localhost"
//database = "icinga"
}

Icinga2 retention state - host unknown status when restarting icinga2

$
0
0
Hello

Ive been trying to find out how to turn on the retetion state for icinga2 as some of ping "fails" and hosts goes to either down or unknown.

Where can i fix so that this doesnt happen when restarting icinga2


im on centos 7 with latest icinga2 and icingaweb2

Not all local checks executed when running a HA-cluster

$
0
0
Hello,

I work in a company that is using a non-clustered Icinga1. Now we would like to switch to HA-Icinga2. I created a test setup with two maschines and initialized the cluster. Clustering works fine and the "cluster-zone" check is green. We're using Debian Jessie and the current Icinga2 version from the debmon repository.

In the docs there is a statement that the "cluster" and the "cluster-zone" check should be configured locally. So I created a separate directory, configured it in icinga2.conf and added the check conf files.

Now my problem is, that some local tests stalls after a few checks (ssh, cluster) while others (cluster-zone, icinga) are running. There are no errors in the debug log and I see the output of e.g. the ssh checks running successfully several times:

Quellcode

1
2
3
4
5
6
7
8
9
10
11
[2016-01-22 09:14:40 +0100] debug/CheckerComponent: Executing check for 'icinga2!ssh'
[2016-01-22 09:14:40 +0100] notice/Process: Running command '/usr/lib/nagios/plugins/check_ssh' '192.168.110.102': PID 24903
[2016-01-22 09:14:40 +0100] debug/CheckerComponent: Check finished for object 'icinga2!ssh'
[2016-01-22 09:14:40 +0100] notice/Process: PID 24903 ('/usr/lib/nagios/plugins/check_ssh' '192.168.110.102') terminated with exit code 0
[2016-01-22 09:14:40 +0100] debug/Checkable: No origin or local origin for object 'icinga2!ssh', setting icinga2 as check_source.
[2016-01-22 09:14:40 +0100] notice/Checkable: State Change: Checkable icinga2!ssh hard state change from UNKNOWN to OK detected.
[2016-01-22 09:16:47 +0100] debug/CheckerComponent: Executing check for 'icinga2!ssh'
[2016-01-22 09:16:47 +0100] notice/Process: Running command '/usr/lib/nagios/plugins/check_ssh' '192.168.110.102': PID 25116
[2016-01-22 09:16:47 +0100] debug/CheckerComponent: Check finished for object 'icinga2!ssh'
[2016-01-22 09:16:47 +0100] notice/Process: PID 25116 ('/usr/lib/nagios/plugins/check_ssh' '192.168.110.102') terminated with exit code 0
[2016-01-22 09:16:47 +0100] debug/Checkable: No origin or local origin for object 'icinga2!ssh', setting icinga2 as check_source.


But after a while some checks stall and are not executed anymore. If I stop the ssh daemon, there is no state change.

My config files look like that:

zones.conf

Quellcode

1
2
3
4
5
6
7
8
9
10
11
object Endpoint "icinga1" {
	host = "192.168.110.101"
}

object Endpoint "icinga2" {
	host = "192.168.110.102"
}

object Zone "monitor-cluster" {
  endpoints = [ "icinga1", "icinga2" ]
}


icinga2.conf

Quellcode

1
2
3
4
5
6
7
include "constants.conf"
include "zones.conf"
include <itl>
include <plugins>
include "features-enabled/*.conf"
include_recursive "repository.d"
include_recursive "local.d"


local.d/icinga1.conf

Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
object Host "icinga1" {
	check_command = "hostalive"
	address = "192.168.110.101"
}

object Service "cluster-zone" {
	check_command = "cluster-zone"
	check_interval = 5s
	retry_interval = 1s
	vars.cluster_zone = "monitor-cluster"
	host_name = "icinga1"
}

object Service "cluster" {
	check_command = "cluster"
	check_interval = 5s
	retry_interval = 1s
	host_name = "icinga1"
}

object Service "icinga" {
	check_command = "icinga"
	check_interval = 30s
	retry_interval = 10s
	host_name = "icinga1"
}

object Service "ssh" {
	check_command = "ssh"
	check_interval = 20s
	retry_interval = 10s
	host_name = "icinga1"
}


I created a test setup a while ago and had the same problem. I asked the same question in this forum then, but got no answer. After some googling I think I'm not alone with this problem. See:

http://www.monitoring-portal.org/wbb/ind…&threadID=32800
http://www.monitoring-portal.org/wbb/ind…&threadID=33066
http://www.monitoring-portal.org/wbb/ind…&threadID=32409
https://dev.icinga.org/issues/8052

But no solution was given.

Can you help me. If I should provide more information please let me know.

Thanks, Dieter

check_logfiles multiple excludes

$
0
0
Hallo miteinander,

ich bin gerade dabei eine POC für Icinga aufzusetzten und beschäftige mich gerade mit Windows Systeme und dem Eventlog.
Ich habe hierzu das Plugin check_logfiles gefunden, welches sehr mächtig ist.
Mir ist leider noch nicht ganz der komplette Syntax verständlich. Vielelicht hat ja jemand schonnmal die gleiche Situation gehabt und kann mir hier behilflich sein.
Ich möchte das Windows Eventlog "System" überwachen und hierbei auf alle Fehler matchen.
entsprechend habe ich aber auch schon eine Excludeliste welche Fehler ich nicht überwachen möchte.
Hierzu möchte ich aber nicht nur EventIDs überwachen sondern der Filter soll explizit sein. Sprich also.
Source, EventID, Level, Message

gibt es die Möglichkeit Multiple Excludes durchzuführen ??

Ich würde z.b im Systemlog folgende Fehler in einer Abfrage excluden

"System,TermServDevices",
"System,TermService",
"System,Print",
"System,LSASRV",
"System,WindowsUpdateClient,16",
"System,Ntfs,137",
"System,Ntfs,57",
"System,VDS Basic Provider,1",

Anbei habe ich mal einen Entwurf der Konfiguration zum Test eingetragen.
Wass mich allerdings wundert ist, das ja Defaulteinstellung der Exclude auf "or" basiert.
Als ich jetzt den operatior auf "and" gestellt hatte.

Hatte dies keine Auswirkung und der Fehler wurde weiterhin als Match gefunden.

Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
$options = 'report=long';
@searches = ({
  tag => 'system_events',
  type => 'eventlog',
  eventlog => {
	eventlog => 'application',
    	include => {
            	eventtype => 'error,warning',
	},

	exclude => {
	source => 'VSS',
	eventid => '12289',
	source => 'Ntfs',
	eventid => '157',
	source => 'Ntfs',
	eventid => '137',
	
operation => 'and',
	},
  },
 options => 'eventlogformat="%w src:%s id:%i %m",winwarncrit,lookback=24h',
criticalpatterns => '.*', 
});


Vielen Dank für eure Hilfe im Vorraus

Grüße Tobi

OMD - Internal Server Error - CentOS 7.2

$
0
0
Hallo,
in einer frisch installierten CentOS 7.2 mit OMD 1.30 aus den ConsolLabs Repo bekomme ich
statt dem Welcome Screen leider nur eine Fehlermeldung 500 (das betrifft auch Check_MK).

/var/log/apache/error_log sagt:

Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
lib-dynload', '/usr/lib64/python2.7/site-packages', '/usr/lib/python2.7/site-packages']".
[Fri Jan 22 11:21:01.675757 2016] [:error] [pid 32090] get_interpreter: no interpreter callback found.
[Fri Jan 22 11:21:01.675779 2016] [:error] [pid 32090] [client 127.0.0.1:56011] python_handler: Can't get/create in
terpreter.
[Fri Jan 22 11:21:01.689692 2016] [:error] [pid 32090] make_obcallback: could not import mod_python.apache.\n
Traceback (most recent call last):
  File "/omd/versions/1.30/lib/python/mod_python/apache.py", line 29, in <module>
	import cgi
  File "/usr/lib64/python2.7/cgi.py", line 50, in <module>
	import mimetools
  File "/usr/lib64/python2.7/mimetools.py", line 6, in <module>
	import tempfile
  File "/usr/lib64/python2.7/tempfile.py", line 35, in <module>
	from random import Random as _Random
  File "/usr/lib64/python2.7/random.py", line 49, in <module>
	import hashlib as _hashlib
  File "/omd/versions/1.30/lib/python/hashlib.py", line 115, in <module>
	f()
TypeError: 'frozenset' object is not callable
[Fri Jan 22 11:21:01.690253 2016] [:error] [pid 32090] make_obcallback: Python path being used "['/omd/versions/1.3
0/lib/python', '/omd/versions/1.30/lib/python', '/omd/sites/testsite130/lib/python/pyasn1-0.1.7-py2.7.egg', '/omd/s
ites/testsite130/lib/python/pysnmp-4.2.4-py2.7.egg', '/omd/sites/testsite130/lib/python/pycrypto-2.6.1-py2.7-linux-
x86_64.egg', '/omd/sites/testsite130/lib/python/pysphere-0.1.7-py2.7.egg', '/omd/sites/testsite130/lib/python/setup
tools-0.6c11-py2.7.egg', '/omd/sites/testsite130/lib/python/Pyro4-4.18-py2.7.egg', '/omd/sites/testsite130/lib/pyth
on/multiprocessing-2.6.2.1-py2.7-linux-x86_64.egg', '/omd/sites/testsite130/lib/python/pymongo-2.5.2-py2.7-linux-x8
6_64.egg', '/omd/sites/testsite130/lib/python', '/omd/sites/testsite130/local/lib/python', '/usr/lib64/python27.zip
', '/usr/lib64/python2.7', '/usr/lib64/python2.7/plat-linux2', '/usr/lib64/python2.7/lib-tk', '/usr/lib64/python2.7
/lib-old', '/usr/lib64/python2.7/lib-dynload', '/usr/lib64/python2.7/site-packages', '/usr/lib/python2.7/site-packa
ges']".
[Fri Jan 22 11:21:01.690265 2016] [:error] [pid 32090] get_interpreter: no interpreter callback found.
[Fri Jan 22 11:21:01.690293 2016] [:error] [pid 32090] [client 127.0.0.1:56011] python_handler: Can't get/create in
terpreter.


Das ganze passiert wohl auch mit einer 2.10 von ConsolLabs. Mit mod_python von http://dist.modpython.org/dist/ hab ich schon probiert das Problem zu fixen, hat so im ersten Versuch leider nicht hin-gehauen.

Die Front-Ends von Thruk und Nagios funktionieren ohne Probleme. Hat jemand eine Idee?

Grüßle,
nasowas

Hohe CPU-Last auf OMD Host

$
0
0
Hi,

ich habe seit Kurzem ein eigenartiges Problem, Wir setzen
hier OMD 1.30 auf einer virtuellen Maschine ein. Dieser Maschine sind
1GB RAM sowie ein Sockelt mit einem Kern zugewiesen. Überwachtw erden
ca. 70 Systeme; hauptsächlich Windows Server.

Seit Kurzem
reagiert der Browser (getestet mit Firefox und Chrome unter Windows 7)
sehr träge; aufgefallen ist das beim Anzeigen der Einträge eines Windows
Event Logs.

Schaue ich nun auf den Server, so sehe ich mehrere
Python-Prozesse und ab und zu auch zwei Apache2-Prozesse. Insgesamt
erzeugen diese Prozesse eine Last zw. 80% und 98%, was dann natürlich
zur trägen Ansicht im Browser führt. Die Last besteht auch wenn der
Browser geschlossen ist.

Das System ist ein Debian 7 mit OMD 1.3.

Ich
kann das Auftreten des Problems nicht mit einer bestimmten Änderung in
Verbindung bringen; das System bearbeite ich alle paar Wochen.

MIr würde schon ein Tipp genügen, wie ich erkennen kann welche Aufrufe da so viel Last erzeugen.

Vielen Dank schon mal im Voraus

Thomas

Wie SNMP-Mib für AXIS-Kamera importieren

Alter einer Datei bestimmen? checkdisk checkfiles checkfile2?

$
0
0
Hi,

ich verzweifele langsam an dem Problem:
Ziel ist es eine Datei mit variablen Namen (immer eine .iso) auf Alter und Existenz zu überprüfen (ggf. auch größe).

Ich habe mir schon so ziemlich alle Threads dazu angeschaut mit checkdisk checkfiles checkfile2
Ich krieg es nicht hin :/

Eingesetzte nsclient Version ist: 0.4.1.102

Kann mir jemand erklären was hier schief läuft? Ich bin auch gern für jeden anderen Hinweis oder Variante dankbar :)

Der Check:

Quellcode

1
2
/usr/lib/nagios/plugins/check_nrpe -H ****** -c checkfile2 -a path=**** pattern=*.iso MaxCrit=1 filter+creation=\>24h
CHECK_NRPE: No output returned from daemon.



Logfile:

Quellcode

1
2
3
4
2016-01-22 16:49:30: d:D:\source\nscp\trunk\include\nrpe/server/protocol.hpp:66: Accepting connection from: ***********
2016-01-22 16:49:30: d:..\..\..\trunk\service\NSClient++.cpp:960: Injecting: checkfile2...
2016-01-22 16:49:30: d:..\..\..\trunk\service\NSClient++.cpp:985: Result checkfile2: BAD_CODE: -1
2016-01-22 16:49:30: e:..\..\..\..\trunk\include\nscapi\nscapi_core_wrapper.cpp:181: No handler for command 'checkfile2'.


Danke & Gruß


//edit
Ini folgt

OMD 1.20 kein Autostart unter Centos 7.2

$
0
0
Hallo,

ich habe derzeit das Problem, dass mein frisch installiertes OMD 1.20 auf Centos 7.2 nach einem reboot nicht automatisch starten will.

Quellcode

1
OMD - Open Monitoring Distribution Version 1.20


chkconfig

Quellcode

1
omd         	0:off   1:off   2:on	3:on	4:on	5:on	6:off


omd status

Quellcode

1
2
3
4
5
6
7
8
9
[root@nagios python]# omd status
Doing 'status' on site site1:
rrdcached:  	stopped
npcd:       	stopped
nagios:     	stopped
apache:     	stopped
crontab:    	stopped
-----------------------
Overall state:  stopped


omd start

Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
[root@nagios python]# omd start
Doing 'start' on site site1:
Starting rrdcached...OK
Starting npcd...OK
Starting nagios...OK
Starting dedicated Apache for site site1...OK
Initializing Crontab...OK

[root@nagios python]# omd status
Doing 'status' on site site1:
rrdcached:  	running
npcd:       	running
nagios:     	running
apache:     	running
crontab:    	running
-----------------------
Overall state:  running


Der manuelle Start geht problemlos. Ich kann in den Logfiles nichts außergewöhnliches feststellen.

OS: CentOS Linux release 7.2.1511 (Core) (Frisch installiert - Minimal)

Wo kann ich noch nach der Ursache suchen?

Assigning variable to variable

$
0
0
I am wondering if it is possible to assign a variable the value of another variable (specifically clone a object onto another variable, and more specifically without creating a reference, just a straight clone)

for example:

Quellcode

1
2
vars.chassis = {    make = "Y"    model = "x"}
vars.newchassis = vars.chassisvars.newchassis.make = "z"




where this would result in the following object:

Quellcode

1
vars.chassis = {    make = "Y"    model = "x"}vars.newchassis = {    make = "Z"    model = "x"}

Fatal error: Class 'Zend_Controller_Request_Http' not found

$
0
0
Hello

This weekend I've updated my Raspberry Pi2 with Icinga2 and Icingaweb2 installed on it. Unfortunately since the update the Icingaweb2 is not working anymore. I always get the error message


Fatal error: Class 'Zend_Controller_Request_Http' not found in /usr/share/webapps/icingaweb2/library/Icinga/Web/Request.php on line 13

Setup:
[admin@alarmpi4 Web]$ php -version
PHP 7.0.2 (cli) (built: Jan 21 2016 10:23:54) ( NTS )
Copyright (c) 1997-2015 The PHP Group
Zend Engine v3.0.0, Copyright (c) 1998-2015 Zend Technologies

[admin@alarmpi4 Web]$ yaourt -Q zendframework
local/zendframework 1.12.17-1

[admin@alarmpi4 Web]$ sudo pacman -Q icingaweb2
icingaweb2 2.1.2-1

[admin@alarmpi4 Web]$ sudo pacman -Q apache
apache 2.4.18-1

[admin@alarmpi4 Web]$ uname -a
Linux alarmpi4 4.1.15-5-ARCH #1 SMP Wed Jan 20 14:05:45 MST 2016 armv7l GNU/Linux

What I already have done:
I've added the path to the zendframework library in the php.ini file

[admin@alarmpi4 Web]$ php -r "echo get_include_path();"
.:/php/includes:/usr/share/zendframework/library

I also get the error when I call the methode on the command line
[admin@alarmpi4 Web]$ php Request.php
PHP Fatal error: Class 'Zend_Controller_Request_Http' not found in /usr/share/webapps/icingaweb2/library/Icinga/Web/Request.php on line 13

I am still quite new so please do not hesitate to ask stupid question or make such recommendations

Regards
Michael

OK without PROBLEM at statehistory table

$
0
0
Hello,

Look at the example below. In the row with statehistory_id = 293470 we have OK with state = 0 and last_state = 3. But as you can see we dont have any event with state = 3(UNKNOWN) before. The full chain is:
WARNING(soft) -> OK(soft) -> OK(hard) -> (???) -> OK(hard).


What is the reason why this may be? Is it safe to just delete this "fake" OK? It happens from time to time with different hosts/services.

MySQL-Abfrage(n)

1
SELECT * FROM icinga_statehistory WHERE object_id = 669 AND state_time >= '2016-01-11 00:00:00' AND  state_time <= '2016-01-11 23:59:59'


mysql> SELECT * FROM icinga_statehistory WHERE object_id = 669 AND state_time >= '2016-01-11 00:00:00' AND state_time <= '2016-01-11 23:59:59'\G
*************************** 1. row ***************************
statehistory_id: 292422
instance_id: 1
state_time: 2016-01-11 10:34:28
state_time_usec: 273118
object_id: 669
state_change: 1
state: 1
state_type: 0
current_check_attempt: 1
max_check_attempts: 3
last_state: 0
last_hard_state: 0
output: Load : 21.49 16.14 10.08 : 21.49 > 16 : WARNING
long_output:
check_source: some-host
endpoint_object_id: 113
*************************** 2. row ***************************
statehistory_id: 292432
instance_id: 1
state_time: 2016-01-11 10:39:28
state_time_usec: 256267
object_id: 669
state_change: 1
state: 0
state_type: 0
current_check_attempt: 1
max_check_attempts: 3
last_state: 1
last_hard_state: 0
output: Load : 7.25 13.00 10.68 : OK
long_output:
check_source: some-host
endpoint_object_id: 113
*************************** 3. row ***************************
statehistory_id: 292433
instance_id: 1
state_time: 2016-01-11 10:39:34
state_time_usec: 342102
object_id: 669
state_change: 1
state: 0
state_type: 1
current_check_attempt: 1
max_check_attempts: 3
last_state: 0
last_hard_state: 0
output: Load : 7.23 12.90 10.66 : OK
long_output:
check_source: some-host
endpoint_object_id: 113
*************************** 4. row ***************************
statehistory_id: 293470
instance_id: 1
state_time: 2016-01-11 17:51:28
state_time_usec: 257215
object_id: 669
state_change: 1
state: 0
state_type: 1
current_check_attempt: 1
max_check_attempts: 3
last_state: 3
last_hard_state: 0
output: Load : 4.61 5.40 5.16 : OK
long_output:
check_source: some-host
endpoint_object_id: 113
4 rows in set (0.00 sec)

Logwatching Zugriff

$
0
0
English Below

Hi,

ich hänge grad am folgenden Problem:

User A soll die Möglichkeit erhalten, Logfehler su sehen UND die Logs zu clearen. Offenbar scheint das aber auf Host Ebene zu funktioniere, so dass ich nun dem Nutzer Zugriff auf den Host geben muss.

Wie kann ich nun dafür sorgen, dass der Nutzer den Host veralten kann OHNE gleich wieder alle Services zu sehen ?


Exemplarisch:

Host A
Check 1
Check 3
Check 3
LOGCheck 1

Der Nutzer soll nun ausschließlich LogCheck1 sehen UND die Möglichkeit haben den Fehler zu "acknowledgen" bzw das Log zu clearen. (Ich könnte noch damit leben, wenn der Nutzer auch den Host selber sieht, notwendiges Übel.

Danke schon mal.


CheckMK in Version 1.2.6p14 (OMD .deb Installation, Hosts über mk files eingepflegt, NICHT WATO)


-----

I've the following problem/question:
I want to allow a user access to clear logs in checkmk WITHOUT allowing the user to see/control all services af the specific host (because you need host access to clean logs). Is there any way todo that? At least i can live with access to the host itself.

Umgebung des icinga2 daemons

$
0
0
Hi,

ich habe diverse Drucker, die ich per (SNMP) check_printer-Kommando abfrage. Da ich noch einen Symbol-WLAN AP per eigener MIB Datei abfrage, musste ich u.a. eine Anpassung an der net-snmp Bilbiothek vornehmen (Parameter MAX_IMPORTS).

Das Kommando, das abgesetzt wird, ist folgendes:

Quellcode

1
2
[2016-01-25 11:00:25 +0100] notice/Process: Running command '/usr/lib/nagios/plugins/check_printer' '192.168.100.121' 'public' 'v2c' 'toner' '3' '' '' '100': PID 29357
[2016-01-25 11:00:26 +0100] notice/Process: PID 29357 ('/usr/lib/nagios/plugins/check_printer' '192.168.100.121' 'public' 'v2c' 'toner' '3' '' '' '100') terminated with exit code 0


Führe ich das Kommando als nagios-User aus, erhalte ich:

Quellcode

1
2
3
root@icinga:~# sudo su - nagios
nagios@icinga:~$ '/usr/lib/nagios/plugins/check_printer' '192.168.100.121' 'public' 'v2c' 'toner' '3' '' '' '100'
OK - Toner (Yellow) 47% (47 of 100 left) | toner=47;0;0;0;100 nagios@icinga:~$


=> Einwandfrei.

In der icingaweb2 Ausgabe kommt allerdings immer noch die ursprüngliche SNMP-Fehlermeldung:
[attach=7649]
(siehe angehängter Screenshot)[/attach]

Habe den icinga2 daemon bereits neu gestartet...

Hat jemand eine Idee?

Danke & Gruss

Dashboards teilen?

$
0
0
Hallo zusammen,

ich habe ein zusammengestelltes Dashboard in icingaweb2 und möchte das auch anderen Usern zugänglich machen. Ich hab' mich glaub ich verlaufen, ich finde das nicht... ?( ?

Hat jemand einen Tipp für mich? Geht das überhaupt?

Danke & Gruss

Delay in Centreon

$
0
0
Hi,

I have a Centreon with passive monitoring configured. I have more or less 74 host and 150 services. The problem is that i have delay in the checks. In these moments there are realized the check of 12 hours ago and the delay is growing. I have no latency in my Centreon, its not problem of RAM, CPU and I/O, i've checked it. I have tried to use mysqltunner, and all tips are done, nothing happened.. The problems is solved when i restart server or when i restart de centreon service, but the delay is growing again as time passes...My stats are following:

Zitat

Centreon Engine Statistics Utility 1.4.14

Copyright 2003-2008 Ethan Galstad
Copyright 2011-2013 Merethis
License: GPLv2

CURRENT STATUS DATA
------------------------------------------------------
Status File: /var/log/centreon-engine/status.dat
Status File Age: 0d 0h 0m 52s
Status File Version: (null)

Program Running Time: 0d 0h 8m 12s
Centreon Engine PID: 27336
Used/High/Total Command Buffers: 0 / 0 / 4096

Total Services: 161
Services Checked: 151
Services Scheduled: 0
Services Actively Checked: 10
Services Passively Checked: 151
Total Service State Change: 0.000 / 6.120 / 0.400 %
Active Service Latency: 0.000 / 0.000 / 0.000 sec
Active Service Execution Time: 0.000 / 0.000 / 0.000 sec
Active Service State Change: 0.000 / 0.000 / 0.000 %
Active Services Last 1/5/15/60 min: 0 / 0 / 0 / 0
Passive Service Latency: 0.028 / 0.990 / 0.561 sec
Passive Service State Change: 0.000 / 6.120 / 0.426 %
Passive Services Last 1/5/15/60 min: 0 / 119 / 151 / 151
Services Ok/Warn/Unk/Crit: 150 / 2 / 0 / 9
Services Flapping: 0
Services In Downtime: 0

Total Hosts: 49
Hosts Checked: 39
Hosts Scheduled: 0
Hosts Actively Checked: 10
Host Passively Checked: 39
Total Host State Change: 0.000 / 0.000 / 0.000 %
Active Host Latency: 0.000 / 0.000 / 0.000 sec
Active Host Execution Time: 0.000 / 0.000 / 0.000 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
Passive Host Latency: 0.008 / 0.988 / 0.496 sec
Passive Host State Change: 0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min: 0 / 30 / 39 / 39
Hosts Up/Down/Unreach: 49 / 0 / 0
Hosts Flapping: 0
Hosts In Downtime: 0

Active Host Checks Last 1/5/15 min: 5 / 30 / 34
Scheduled: 0 / 0 / 0
On-demand: 5 / 30 / 34
Parallel: 0 / 0 / 0
Serial: 0 / 0 / 0
Cached: 5 / 30 / 34
Passive Host Checks Last 1/5/15 min: 0 / 16 / 16
Active Service Checks Last 1/5/15 min: 0 / 0 / 0
Scheduled: 0 / 0 / 0
On-demand: 0 / 0 / 0
Cached: 0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 46 / 46

External Commands Last 1/5/15 min: 84 / 425 / 614
I have no more ideas to try.. :(

someone could help me?

NagiosQL support discontinued?

$
0
0
I tried installing NagiosQL. The installation seems to be succesful, but there are some configuration problems.
What I noticed is that the official NagiosQL website is down.
Also there seem to be no new forum posts for a few months.

I think NagiosXI still has a modified version of NagiosQL.

Have the NagiosQL devs stopped updating and supporting NagiosQL?
Did Nagios purchase rights for NagiosQL?

Most important question though:

What configuration solution can you recommend if NagiosQL has no remaining support options?

I'm running Nagios Core 3.5.1 on Centos 6.4 with MKLivestatus and NagVis.
I'm looking to firstly automate some of the configuration (possibly modified autodiscovery) and secondly make configuration of hosts and services more easily accessible through a web browser for non Linux-enlightened people.

Thanks in advance for any assistance!

Connection to VCenter fails with secure credential mode

$
0
0
Hello,

I have powershell scripts that connect to my VCENTER in order to monitor SNAPSHOTS and DATASTORES, my problem is that I do not want clear credentials in my powershel script.

Then I tried to use the following code in my script:

Quellcode

1
2
3
4
5
 Get-Content C:\temp\nagios.cred
$vcenter = "XXX.YYY.110.26"
$vicred = New-Object System.Management.Automation.PsCredential "MyDomain\services", (Get-Content "C:\temp\nagios.cred" | ConvertTo-SecureString) 
Connect-VIServer $vcenter -Credential $vicred
     


It works if I run the script from the POWERSHELL environment.

But, when NAGIOS executes it, I have the following errors:

Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
[root@GLNXNGIOS plugins]# ./check_nrpe -H V2K8VENTER  -c check_vmware_snapshots -t 60 -p 5666
01000000d08c9ddf0115d1118c7a00c04fc297eb01000000c6d449a0c47eb84e87988133a085c5630000000002000000000003660000c00000001000000011b297f00128d157049212eb0ef1dbd40000000004800000a00000001000000066949d7b4ff46c1ad77358a31c01586118000000a0e10d3466daca0f6280553097c3b800c0f85f9a4e5122cb140000009e4a471cadc4576b49dcf7048e6f53289e02b5b0
ConvertTo-SecureString : Cle non valide pour l'utilisation dans l'etat
specifie.
Au caractere C:\Program Files\NSClient++\scripts\Snapshot.ps1:14 : 127
+ ... nagios.cred"| ConvertTo-SecureString)
+                    ~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : InvalidArgument : (:) [ConvertTo-SecureString],
   CryptographicException
    + FullyQualifiedErrorId : ImportSecureString_InvalidArgument_Cryptographic
   Error,Microsoft.PowerShell.Commands.ConvertToSecureStringCommandNew-Object : Exception lors de l'appel de ▒.ctor▒ avec '2' argument(s): 
Impossible de traiter l'argument, car la valeur de l'argument password est
Null. Remplacez la valeur de l'argum
You have new mail in /var/spool/mail/root


I also tried the following code and got the same error:

Quellcode

1
2
3
 $creds = Get-VICredentialStoreItem -file “C:\Program Files\NSClient++\scripts\cred.xml”

Connect-viserver -Server $creds.Host -User $creds.User -Password $creds.Password -WarningAction SilentlyContinue 


Quellcode

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
[root@GLNXNGIOS plugins]# ./check_nrpe -H V2K8VCENTER -c check_vmware_snapshots -t 60 -p 5666
Get-VICredentialStoreItem : Clé non valide pour l'utilisation dans l'état
specifie.
Au caractere C:\Program Files\NSClient++\scripts\Snapshot.ps1:7 : 10
+ $creds = Get-VICredentialStoreItem -file "C:\Program
Files\NSClient++\scripts\cr ...
+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : NotSpecified: (:) [Get-VICredentialStoreItem], C
   ryptographicException
    + FullyQualifiedErrorId : Core_GetVICredentialStoreItem_ProcessRecordget,V
   Mware.VimAutomation.ViCore.Cmdlets.Commands.GetVICredentialStoreItemConnect-VIServer : Impossible de valider l'argument sur le parametre Server.
L'argument est Null ou vide. Indiquez un argument qui n'est pas Null ou vide
et reessayez.
Au caractere C:\Program Files\NSClient++\scripts\Snapshot.ps1:8 : 26
+ Connect-viserver -Server $creds.Host -User $creds.User -Password
$creds.Password ...
+                          ~~~~~~~~~~~
    + CategoryInfo          : InvalidData : (:) [C



How to fix this ? Or is there another way to connect the VCENTER without clear credentials ?

Mini-Mini Monitoring Lösung mit Raspberry Pi 2

$
0
0
Hallo zusammen,

ich habe mir überlegt meinen Raspberry Pi 2 als kleinen Monitoring Server zu betreiben (für Synology NAS, Router usw. daheim).

Gibt es hier irgendwelche speziellen Distris die man empfehlen kann oder sogar OMD? Ich hatte da mal was gefunden das es OMD für den Pi 1 gab, wie sieht es hier mit dem Pi 2 aus?

Oder einfach

Quellcode

1
# apt-get install nagios


Ich hoffe mal das der eine oder andere von euch vlt. ne Idee hat :)

Icinga2 distributed setup

$
0
0
Hi,

After getting Icinga2 installed with Icingaweb2 on the same server I discovered this was not the best practices of installation. Currently I am trying to install Icinga2 in a distributed setup but am having difficulty.

I have 5 data centers that I am currently monitoring but this will expand. I would like an Icinga2 server in each data center that monitors the data and pushes that information back to a single Icingaweb2 server for front end visibility.

Is this setup possible? Please explain how the zones and nodes should be set up correctly, up until now I am not able to get it to work properly following the document https://wiki.icinga.org/display/howtos/I…l+with+CentOS+6 or http://docs.icinga.org/icinga2/snapshot/…gh-availability

Also, i do not want an HA setup. Just an icinga2 server in each DC to reduce load and push that data back to an icinga2 web server for viewing.

Thanks in advance.
Viewing all 1338 articles
Browse latest View live