Network and Systems Management

Check_iDRAC for DELL iDRAC

Description:

Check_iDRAC for DELL
Latest: 2.1
https://github.com/dangmocrang/check_idrac
http://www.cloudarch.club

Current Version

2.2rc4

Last Release Date

2016-08-19

Compatible With

  • Nagios 3.x
  • Nagios 4.x
  • Nagios XI

License

GPL


Nagios CSP

Meet The New Nagios Core Services Platform

Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.

Monitoring Made Magically Better

  • Nagios Core on Overdrive
  • Powerful Monitoring Dashboards
  • Time-Saving Configuration Wizards
  • Open Source Powered Monitoring On Steroids
  • And So Much More!
Project Notes
This check monitor idrac 7 hardware status via SNMP: - Virtual Disk - Physical Disk - Memory - CPU - Power Supply - Power Unit - Fan - Battery - Temperature Sensor check out MANUAL on github ------------------------ 1.0 - Deprecated ------------------------
Reviews (9) Add a Review
Good, but needs optimization on output and alert management
by Setti, January 31, 2017
Hi all.
The plugin extract the needed data, but the amount of output can be confusing.
For example, checking all memory banks will return the correct status, but the first line will always correspond to the first DIMM.
Instead, you should use the first line to give an overview of the status (i.e. checking all memory could return: "WARNING - Dimm3 nonok"), and then all the other output lines to give the status of each component. In this way, SMS and other messages will return the correct component with warn or crit state.
Owner's Reply:

Hi, script is not intended to be used this way, it was intended to check only one device, i.e one Memory dimm at a time.


0 of 1 found this review helpful.
Helpful? Yes  No 1
Error witch idrac 7
by raprop, November 30, 2016
Excellent!! work fine with idrac8, with idrac7 show this message when check PU

PU 1: UNKNOWN(!)/OTHER(!), RedundancyStatus: OTHER(!), SystemBoard Pwr Consumption: 168 W | PwrConsumption=168;;;;

Is possible fixed?

Thx
Owner's Reply:

Because your device returns output that way :) script simply maps the OID with MIB and translate into human readable. You should investigate why it returns that output


Helpful? Yes  No 
Error in Minimal Warnung for Fac
by snh, July 31, 2016
Hallo, ich an Fan goes unterminimal Warning. The Script raises an error in case of the the value get in "(!)" attached and is not in int.

Traceback (most recent call last):
File "/usr/lib/nagios/plugins/check_idrac", line 841, in
result, exit_code = PARSER().main()
File "/usr/lib/nagios/plugins/check_idrac", line 664, in main
hw_dict, exit_code = self.raise_alert(hw_dict, value_on_alert)
File "/usr/lib/nagios/plugins/check_idrac", line 494, in raise_alert
if int(tmp[key][stat_t]) >= conf['fan_thresholds'][1]:
ValueError: invalid literal for int() with base 10: '3600(!)'
Owner's Reply:

Hi,

Fixed with new merge (from another guy).


Helpful? Yes  No 
Functionnal and usfull
by shiryu, May 31, 2016
The tool make job, it miss only on small tutorial for multi line output on Nagios front end, if any body have one good documentation about that I take!!
Helpful? Yes  No 
Very good, one feature request
by praines, August 31, 2015
Got the plugin from master on GitHub (there is an error in the include conf file on consumption line)

One feature request is that a nagios alert could be done on a whole group instead of each individual item. Mainly for FAN, MEMORY, SENSOR and PDISK groups

For example doing "-w MEM#0" would return something like

OK - All memory ENABLED/OK

or

CRITICAL - DIMM Socket B7 failed ...

while "-w MEM" would still return the non-alert full info like it does now.
Owner's Reply:

Thanks,

script currently limited support for global health, it will return with exit code of alert but now print prefix (OK, CRIT, WARN) and not combine check result, this can use with multi line check view in front-end


Helpful? Yes  No 
PSU bug
by henri5672, July 31, 2015
I have a problem where if power supplies are removed this will not detect a problem. A removed PSU shows up with status of "lost", which I think is the issue.
Owner's Reply:

If removed PSU does not remove SNMP OID in idrac, then you must add 'lost' to state alert for script to know.

If SNMP OID removed also, then script will say that hardware not exist


Helpful? Yes  No 
Good start, but some refinement would help
by kalju, June 30, 2015
The plugin seems to do a decent job at picking up the correct information from the iDRAC controller.
However, at least in my experiments the no_alerts option seemed to be switched on by default and that means that even though the HW issues (like an unplugged PS) were detected, they did not trigger a Nagios alert.
For example:
$ ./check_idrac -f ./check_idrac.conf -H 10.0.0.1 -w ps
PS 1: CRITICAL, Volt I/O: 264 V/(N/A) V, Current: (N/A) A, Watt I/O: 900.0 W/750.0 W
PS 2: OK, Volt I/O: 264 V/246.0 V, Current: 0.4 A, Watt I/O: 900.0 W/750.0 W
$ echo $?
0

So the correct exit code was not set.

I could find no other way to fix it than to change the code:
179c179
opts.no_alert = True

Also the PS and PU options should be better implemented / documented. Both, the wat-warn and the wat-crit need to be defined, otherwise you'll get a parsing error:
$ ./check_idrac -f ./check_idrac.conf -H 10.0.0.1 -w ps --wat-warn=100,500
Error parsing threshold.

I believe the options should be parsed in separate code blocks or at least the error should be more precise. (This specific code starts on the line 256).

Thank you for the plugin and hope these issues will get addressed to make it a little bit more user friendly.
Owner's Reply:

Hi,

You should visit my github, there is a manual.

The reason script does not return alert because you are using it wrong way :x

Regards,


1 of 2 found this review helpful.
Helpful? Yes 1 No 1
Thanks for check_idrac
by egalstad, November 30, 2014
Works fine, but i have a problem with the return status. I always get OK.

Here´s one example (1 PS is not connected, so it should get critical)

./check_idrac_2.py -H 10.xx.xx.xx -c public -m /usr/local/icinga/libexec/iDRAC-SMIv2.mib -w PS#2 --conf /usr/local/icinga/libexec/check_idrac.conf
OK - PS 2: CRITICAL, Volt I/O: 264 V/0.084 V, Current: 8.4 A, Watt I/O: 900.0 W/750.0 W

Also when I change the OK/WARN/CRIT Parameters in the Config, it also show´s alway OK.
With version 2.0b2 it works fine.

I think with the bugfix 2.0b3 the script ignors the config?
Owner's Reply:

It's realy strange. I tested with CRIT changed and it work fine.
Does 2.0b4 work for you?


2 of 2 found this review helpful.
Helpful? Yes 2 No 0
2.0b: Not ready yet
by tarvin, September 30, 2014
Thanks for the check_idrac plugin. A review on 2014-09-10:

On the plugin's page, it's unclear which one to download, "check_iDRAC.tar.gz" or "idrac_2.0b.tar.gz"? Is check_iDRAC.tar.gz version 1? (It turns out: Yes, check_iDRAC.tar.gz is version 1.x.)

At the time of writing this, the files are weird, in that they are double-gzipped(?).

I think it's a shame that generation 2 demands a configuration file. I suggest that the general case be made simple (and not require a configuration file): Check all hardware's status using SMNP version 2c with "public" as community shouldn't require anything but a "-H ".

I couldn't get version 2.0b to work. After having created the configuration file, I'm trying:
$ python idrac/idrac_2.0/idrac_2.0b.py -c check_idrac.conf -H foobar
Unknown flag passed to -C: u

I don't understand that, as I didn't put in a "-C" argument.

I'm giving a rating of Average, as generation 1 is fine, but generation 2 is bad at the time of writing.
Owner's Reply:

Thanks for your review. I will check the "-C" error and consider authen snmp as option.


3 of 3 found this review helpful.
Helpful? Yes 3 No 0
Add a Review

You must be logged in to submit a review.

Thank you for your review!

Your review has been submitted and is pending approval.

Recommend

To:


From:


Thank you for your recommendation!

Your recommendation has been sent.

Project Stats
Rating
3.8 (10)
Favorites
0
Views
87,743