Hi,
I'm new so please forgive my newness :smileyblush:
I need to know how I can tell if a hardware element has failed using the esxcli hardware command ? What indicator will show me if a failure has occurred ? I'm writing an automated script which will periodically login and run the command, it will hopefully test for a failure condition and then send an SMS to the on-call engineers handset when a failure occurs. The only problem I have is I don't know what in the output show me a failure....anyone any experience in this and can help. I've searched all the PDF and documents on esxcli going but nothing jumps out of the page to answer my question....is it that this failure information isn't in this type of command output ?
i.e.
~ # esxcli hardware ipmi sdr list
Node-Sensor Description Entity-Instance Computed Reading Base Unit Raw Reading Timestamp/Comment Raw
----------- ------------------------------------------------ --------------- ---------------- ----------- ----------- ------------------- ---
0.16 Memory Device 1 04-P1 DIMM 1-3 32.1 30 degrees C 30 2013-08-08T12:51:49
0.17 Memory Device 2 05-P1 DIMM 4-6 32.2 31 degrees C 31 2013-08-08T12:51:49
0.18 Memory Device 3 06-P1 DIMM 7-9 32.3 26 degrees C 26 2013-08-08T12:51:49
0.19 Memory Device 4 07-P1 DIMM 10-12 32.4 27 degrees C 27 2013-08-08T12:51:49
0.20 Memory Device 5 08-P2 DIMM 1-3 32.5 36 degrees C 36 2013-08-08T12:51:49
0.21 Memory Device 6 09-P2 DIMM 4-6 32.6 38 degrees C 38 2013-08-08T12:51:49
0.22 Memory Device 7 10-P2 DIMM 7-9 32.7 31 degrees C 31 2013-08-08T12:51:49
0.23 Memory Device 8 11-P2 DIMM 10-12 32.8 30 degrees C 30 2013-08-08T12:51:49
This command would only list sensor data repository. I would recommend you to check for any alerts on the hardware status tab or check with your hardware vendor if they do have any monitoring tool as per requirement