I recently built 5 ESXi 5.5 servers to replace some old ESX 4.o servers, I am using the same HW and the VM's were originally part of the ESX 4.0 builds. I have recently had 2 of the ESXi 5.5 servers reboot(crash) themselves, I am looking for some troubleshooting methods as I am unsure where to look.
All 5 servers are built the same!
HP Proliant DL380 G6
2 x internal SAS drives for the OS
2 Qlogic HBA's - Connected to 2 fabrics
HP EVA4100 Storage array contains the VM datastores
Any help would be appreciated, I know I mentioned this but only 2 of the 5 have had this issue (so far).....
Have you used Update Manager to bring all of the installs up to the most current version? If not, I recommend installing all the patches as one of them may fix the issue.
Have you run any hardware diagnostics to make sure you're not dealing with a faulty device?
What did the logs say? You could open a case with VMware support and send in the logs to be analyzed.
i would suggest you run a firmware upgrade on affected servers, especally I/O modules and BIOS
that solution has worked for me wih HP and ESXi more times than i can Count
Like MattiasN81 said,
Get all the servers up to the latest firmware, drivers, and ESXi updates. Also setup a syslog collector and dump collector. Then if some of the servers are still crashing you will need to check the logs to see why they are randomly rebooting. The syslog and dump collectors are great tools as they store the logs for you that are usually purged on a reboot.