VMware Cloud Community
Jimgil
Contributor
Contributor

2 of 5 recently built ESXi 5.5 servers randomly recycle

Good Day,

I recently built 5 ESXi 5.5 servers to replace some old ESX 4.o servers, I am using the same HW and the VM's were originally part of the ESX 4.0 builds. I have recently had 2 of the ESXi 5.5 servers reboot(crash) themselves, I am looking for some troubleshooting methods as I am unsure where to look.

All 5 servers are built the same!

HP Proliant DL380 G6

64GB RAM

ESXi 5.5

2 x internal SAS drives for the OS

2 Qlogic HBA's - Connected to 2 fabrics

HP EVA4100 Storage array contains the VM datastores

Any help would be appreciated, I know I mentioned this but only 2 of the 5 have had this issue (so far).....

Thanks,

Reply
0 Kudos
4 Replies
BenLiebowitz
Expert
Expert

Have you used Update Manager to bring all of the installs up to the most current version?  If not, I recommend installing all the patches as one of them may fix the issue. 

Have you run any hardware diagnostics to make sure you're not dealing with a faulty device? 

Ben Liebowitz, VCP vExpert 2015, 2016, & 2017 If you found my post helpful, please mark it as helpful or answered to award points.
Reply
0 Kudos
BenLiebowitz
Expert
Expert

What did the logs say?  You could open a case with VMware support and send in the logs to be analyzed. 

Ben Liebowitz, VCP vExpert 2015, 2016, & 2017 If you found my post helpful, please mark it as helpful or answered to award points.
Reply
0 Kudos
MattiasN81
Hot Shot
Hot Shot

i would suggest you run a firmware upgrade on affected servers, especally I/O modules and BIOS

that solution has worked for me wih HP and ESXi more times than i can Count Smiley Happy

VMware Certified Professional 6 - DCV VMware VTSP Software Defined Storage Dell Blade Server Solutions - EMEA Certified Dell PowerEdge Server Solutions - EMEA Certfied Dell Certified Storage Deployment Professional Dell EMC Proven Professional If you found my answers useful please consider marking them as Helpful or Correct
Reply
0 Kudos
JPM300
Commander
Commander

Like MattiasN81 said,


Get all the servers up to the latest firmware, drivers, and ESXi updates.  Also setup a syslog collector and dump collector.  Then if some of the servers are still crashing you will need to check the logs to see why they are randomly rebooting.  The syslog and dump collectors are great tools as they store the logs for you that are usually purged on a reboot. 

Reply
0 Kudos