In the past few weeks I've received a few mails indicating a problem when the Nagios plugin check_esxi_hardware is launched against a Dell PowerEdge R620 server.
The plugin will exit in Warning state and the following warning will be in the output:
WARNING : System Board 1 Riser Config Err 0: Connected - Server: Dell Inc. PowerEdge R620 s/n: xxxxxx System BIOS: 1.1.2 2012-03-08
The reason is that the CIM element shown as warning (System Board 1 Riser Config Err 0) uses an exit code different than 2:
Element Name = System Board 1 Riser Config Err 0: Connected
Element Op Status = 12
By CIM definition the exit code 12 means No Contact.
I'm still waiting for an official statement of DELL why this single element returns the exit code 12 instead of 2 (typo maybe??). So far it seems that only Dell PowerEdge R620 servers are affected.
As a workaround, check_esxi_hardware users may just use the ignore parameter to exclude this element from being checked:
./check_esxi_hardware.py -H esxi-server-ip -U username -P mypass -V dell -i "System Board 1 Riser Config Err 0: Connected"