Is there some command that I can run on the command line on an ESXi server (such as vim-cmd or esxcli) that will tell me the health status of the server? I don't need detailed information. "Healthy" or "Not Healthy" are good enough. If my script detects "Not Healthy", I'll manually take a closer look through the vSphere client or the Web GUI.
I'm running the HP custom image on the HP servers. I have the generic image on a few old Dell servers that will be replaced soon, so I'm not too concerned about those.
There is no vCenter in the environment. I have a mix of versions of ESXi (5.0, 5.5, 6.0, 6.5) due to the fact that some of the older hardware won't run with newer versions of ESXi. (The main problem being that the RAID isn't recognized.)
Thanks.
What is the RAID controller model installed on those servers !
Please specify what exactly you want to monitor? CPU usage, RAM, disk or servers availability at all?
Here a couple of links to the same discussions. Hope they will be useful.
[SOLVED] Hardware Level Monitoring for VMWare ESXi 5.0 Hosts??? - Spiceworks
Monitor DELL hardware on VMware ESXi 5.5 server - Server Fault
The RAID controller varies depending on the make/model. Here are some:
H240ar
P410i
I only care about disk failures at this time. I have ways to monitor other things, but I don't currently have a convenient way to individual disks that are part of a RAID setup.
I can detect some errors (such as power supply failures) with this command:
esxcli hardware ipmi sel list
However, that doesn't show anything if I pull out a disk.