Hi,
I have 4 new servers(HPE ProLiant DL560 Gen10)(Intel(R) Xeon(R) Gold 5118 CPU @ 2.30GHz).
I installed esxi6.7 for them.
Then they restart at irregular intervals.
The first server,about restart 2-3 times a day.
Second server,about restart 15-20 times a day.
Third server,about restart 4-6 times a day.
Fourth server,about restart 1-2 times a day.
There is no warning before restarting.
Every time it's a sudden restart.
The day before yesterday, I installed centos7 on the second servers above, and now it has been running normally for more than 2 days.
So it should not be the problem of the server itself.
How can I find out why they are restarted when they install esxi6.7?
Thanks.
OK!
Here there is the solution that worked x my 6 HP ProLiant DL560 Gen10:
On BIOS (according with HP) I've forced the power management in
"Virtuality - Max Power".
No more reboots after 20 days.
If all the hosts are restarting, looks like there is a compatibility issue somewhere. To start of with, you can check the KB - VMware Knowledge Base and try to identify if there is anything substantial in the logs. In the meantime, you can perform a reinstall of one host using the HPE custom image for 6.7 (if they are currently on VMware standard image) and have all the BIOS/firmware updated to the latest version. If the issue still persist, try installing ESXi 6.5 on at least one host and see if that remains stable.
Cheers,
Supreet
Have you check IML, or AHS logs ? Log into ILO administration page and see if you find there any hardware events.
Same problem here,
6(!) HPE ProLiant DL560 Gen10 with ESX 6.7 (HP 6.7.0.10.3.0.30, build 8169922).
They random reboot 5-10 times per day, also with no VMs running with no logged errors.
The ILO log indicates "power failure", but we have two dedicated AC lines, in absolute contiunity too!
This morning, a 6.7 U1 version magically appeared on VMWare repositories.
Could it be the solution?
Hi,
There is a service pack released by HPE, kindly check it might have fixed the issue now.
https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-a00050181en_us
Regards,
Do you have ASR enabled?
- MS
Maybe PSOD issues?
Check to see if your ESXi system is generated dump files due to failures of this type.
Usually these files are generated on the scratchlog config location, unless you manually changed the location.
In my prod environment, we always configure Sratchlog to be on a persistent location.
Cheers,
Mario
Hi, thank you.
According to HP customer service a new SPP has been uploaded to their site (2018.09) and yesterday I started the deploying.
It SEEMS the problem has been resolved, but I prefer to wait for a while.
Ok....at least one server is still rebooting. So, the SPP is not the solution.
The only server that remains up&running without rebooting is the only with a VM runnnig.
If I disable ASR feature on BIOS do you know if there is some more setting to do in ESX?
OK!
Here there is the solution that worked x my 6 HP ProLiant DL560 Gen10:
On BIOS (according with HP) I've forced the power management in
"Virtuality - Max Power".
No more reboots after 20 days.