Dell

check_dell_openmanage

Description:

This plugin checks the status of objects monitored by Dell OpenManage via SNMP and returns OK, WARNING, CRITICAL or UNKNOWN. If a failure occurs it will describe the subsystem that failed and the failure code.

As featured in Linux Journal issue 182
http://www.linuxjournal.com/issue/182

Current Version

1.3

Last Release Date

2010-02-31

Compatible With

  • Nagios 2.x
  • Nagios 3.x

Project Files
Project Notes

INFORMATION ABOUT THIS PLUGIN

This plugin checks the status of objects monitored by Dell OpenManage via SNMP and returns OK, WARNING, CRITICAL or UNKNOWN. If not OK it returns text describing which subsystem failed. This plugin focuses on low bandwidth and speed. Questions, comments and criticisms are appreciated.

As featured in Linux Journal issue 182 http://www.linuxjournal.com/issue/182

This program is written and maintained by: Jason Ellison - infotek(at)gmail.com http://www.jasonellison.net/

t is based on check_snmp_temperature.pl plugin by: William Leibzon - william(at)leibzon.org

-= SETUP NOTES =-

Copy this file to your Nagios installation folder in "libexec/". Rename to "check_dell_openmanage.pl".

You must have Dell OpenManage installed on the server you wish to monitor. You must have enabled SNMP on the server and allow SNMP queries. On the nagios server that will be running the plugin you must have the perl "Net::SNMP" module installed.

perl -MCPAN -e shell
cpan> install Net::SNMP

Check Dell OpenManage locally on the monitored host for alert threshholds like min/max fan speeds...

To do a test run using SNMPv1 try:

./check_dell_openmanage.pl -H 192.168.1.250 -C public -T test

This will report which OID's your system supports.

-= OIDS Checked by System Type=-

System Types

"dellom" monitors the following OID's:

systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
systemStatePowerSupplyStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.9.1
systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
systemStateCoolingDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.21.1
systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
systemStateChassisIntrusionStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.30.1
systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1

"dellom_storage" test all of the OID's "dellom" tests plus the storage OID

systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
systemStatePowerSupplyStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.9.1
systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
systemStateCoolingDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.21.1
systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
systemStateChassisIntrusionStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.30.1
systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1
StorageManagement-MIB::agentGlobalSystemStatus .1.3.6.1.4.1.674.10893.1.20.110.13.0

"blade"

systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1
StorageManagement-MIB::agentGlobalSystemStatus .1.3.6.1.4.1.674.10893.1.20.110.13.0

"global" monitors

GlobalSystemStatus .1.3.6.1.4.1.674.10892.1.200.10.1.2.1

"chassis" monitors

ChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1

"custom" is meant to be customised by advanced users

-= TODO =-

* Fix bugs and make more user friendly.

-= REVISION =-

ver 1.3
* If non-numeric codes are returned just add the text to the statusinfo
* This was done to allow adding machine information Dell Model Number and Service Tag to output.

ver 1.2
*major rewrite. simplified the way new systems are defined.
*added system type "test" which can be used to easly generate new system definitions.

ver 1.1
*formating of text output
*add blade system type... blades apparently do not support systemStatePowerSupplyStatusCombined,
systemStateCoolingDeviceStatusCombined or systemStateChassisIntrusionStatusCombined

ver 1.0
*while in verbose mode report which OID failed in a more readable manner.
*add "global", "chassis", and "custom" system type.
*removed system type pe2950.

ver 0.9
*added type dellom_storage as this is more accurate. this plugin works with all PowerEdge servers it has been tested with. left pe2950 type in for compatibility.
*remove min max int options from help text as they are no longer relevant

ver 0.8
*check that perl environment has "Net::SNMP" if not found complain.
missing "Net::SNMP" is the most common issue users report.

ver 0.7
*removed ucdavis definition. Added note about SNMPv3 dependencies

Dell OpenManage Server Administrator Version 5.4 SNMP Reference Guide

http://support.dell.com/support/edocs/software/svradmin/5.4/en/snmp/pdf/om_54_snmp_ref_gd.pdf

Dell OpenManage Server Administrator Version 1.0 to 6.2 SNMP Reference Guide

http://support.dell.com/support/edocs/software/svradmin/6.3/en/SNMP/1.0-6.2/PDF/SNMP.pdf

Preparing the Dell Openmange server for monitoring:

Linux: http://www.ubergeek.co.uk/blog/2008/05/dell-openmanage-on-linux-debian/

Windows: http://docs.google.com/View?docID=dg2fzp3d_4d2pgb3cp

Reviews (4) Add a Review
error dellom
by jriker1, January 31, 2016

When I run the test it scrolls thru fine except for one item. systemStateCoolingDeviceStatusCombined always shows NO RESPONSE. If I put the same OID in there twice and name the second like systemStateCoolingDeviceStatusCombinedd with two d's, both return OK. Take either out and it returns NO RESPONSE. Also when running dellom I get: Use of uninitialized value in concatenation (.) or string at ./check_dell_openmanage.pl line 491. Use of uninitialized value within %dell_oids in string at ./check_dell_openmanage.pl line 492.



And then...
by carekapt, February 28, 2015

[root@nagios]# ./check_dell_openmanage.1.3.pl -H 10.1.1.1 -C public -T test -t 60 TEST MODE: Trying all preconfigured Dell OID's against target... StorageManagementGlobalSystemStatus (.1.3.6.1.4.1.674.10893.1.20.110.13.0) ERROR: Alarm signal (Nagios time-out) Even after changing time-out, i keep getting this...any hint?



It makes my life easier
by imagine1437, August 31, 2010

I have around 200+ Dell server here and it helps me to monitor them easily. Jason is really helpful as well.



Dell PowerEdge 1850
by christimpaler, February 28, 2010

Works excellent on a Dell PowerEdge 1850. This plugin is interoperating with Dell OpenManage 5.5 on Red Hat Enterprise Linux 5. The documention is quite good and it's easy to adjust the plugin to your personal needs. Good work. Tahnk you.



Add a Review

You must be logged in to submit a review.

Thank you for your review!

Your review has been submitted and is pending approval.

Recommend

To:


From:


Thank you for your recommendation!

Your recommendation has been sent.

Project Stats
Rating
4.5 (6)
Favorites
4
Views
148,842