bhuqwl
Contributor
Contributor

Automatic restart of esxi6.7 host,How to find out why?

Jump to solution

Hi,

I have 4 new servers(HPE ProLiant DL560 Gen10)(Intel(R) Xeon(R) Gold 5118 CPU @ 2.30GHz).

I installed esxi6.7 for them.

Then they restart at irregular intervals.

The first server,about restart 2-3 times a day.

Second server,about restart 15-20 times a day.

Third server,about restart 4-6 times a day.

Fourth server,about restart 1-2 times a day.

There is no warning before restarting.

Every time it's a sudden restart.

The day before yesterday, I installed centos7 on the second servers above, and now it has been running normally for more than 2 days.

So it should not be the problem of the server itself.

How can I find out why they are restarted when they install esxi6.7?

Thanks.

1 Solution

Accepted Solutions
Blueyes
Contributor
Contributor

OK!
Here there is the solution that worked x my 6 HP ProLiant DL560 Gen10:

On BIOS (according with HP) I've forced the power management in

"Virtuality - Max Power".

No more reboots after 20 days.

View solution in original post

0 Kudos
9 Replies
SupreetK
Commander
Commander

If all the hosts are restarting, looks like there is a compatibility issue somewhere. To start of with, you can check the KB - VMware Knowledge Base and try to identify if there is anything substantial in the logs. In the meantime, you can perform a reinstall of one host using the HPE custom image for 6.7 (if they are currently on VMware standard image) and have all the BIOS/firmware updated to the latest version. If the issue still persist, try installing ESXi 6.5 on at least one host and see if that remains stable.

Cheers,

Supreet

vijayrana968
Virtuoso
Virtuoso

Have you check IML, or AHS logs ? Log into ILO administration page and see if you find there any hardware events.

0 Kudos
Blueyes
Contributor
Contributor

Same problem here,

6(!) HPE ProLiant DL560 Gen10 with ESX 6.7 (HP  6.7.0.10.3.0.30, build 8169922).

They random reboot 5-10 times per day, also with no VMs running with no logged errors.

The ILO log indicates "power failure", but we have two dedicated AC lines, in absolute contiunity too!

This morning, a 6.7 U1 version magically appeared on VMWare repositories.

Could it be the solution?

0 Kudos
rajen450m
Hot Shot
Hot Shot

Hi,

There is a service pack released by HPE, kindly check it might have fixed the issue now.

https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-a00050181en_us

Regards,

Raj M Please mark helpful or correct if my answer resolved your issue. Visit www.hypervmwarecloud.com for my blog posts, step-by-step procedures etc.,
msripada
Virtuoso
Virtuoso

Do you have ASR enabled?

- MS

0 Kudos
marioxherrera
Contributor
Contributor

Maybe PSOD issues?

Check to see if your ESXi system is generated dump files due to failures of this type.

Usually these files are generated on the scratchlog config location, unless you manually changed the location.

In my prod environment, we always configure Sratchlog to be on a persistent location.

Cheers,

Mario

0 Kudos
Blueyes
Contributor
Contributor

Hi, thank you.
According to HP customer service a new SPP has been uploaded  to their site (2018.09) and yesterday I started the deploying.

It SEEMS the problem has been resolved, but I prefer to wait for a while.

0 Kudos
Blueyes
Contributor
Contributor

Ok....at least one server is still rebooting. So, the SPP is not the solution.

The only server that remains up&running without rebooting is the only with a VM runnnig.

If I disable ASR feature on BIOS do you know if there is some more setting to do in ESX?

0 Kudos
Blueyes
Contributor
Contributor

OK!
Here there is the solution that worked x my 6 HP ProLiant DL560 Gen10:

On BIOS (according with HP) I've forced the power management in

"Virtuality - Max Power".

No more reboots after 20 days.

View solution in original post

0 Kudos