Hi,
one of my host is showing as not responding in my vCenter on testing environment.
I can able to ping it which is interesting
Hi pkmr,
Done with analyzing on your PSOD issue, please follow this KB for resolution "https://kb.vmware.com/s/article/2140848?language=en_US"
Solution: please upgrade to latest update of ESX 6.0 or at-least go to ESX 6.0 U2 to fix this bug
Reason for issue:
PCPU becomes too busy logging all the correctable error messages to perform routine background tasks which leading ESXi to assume that PCPU is unresponsive. and finally it is causes a purple diagnostic screen and showing as host not responding in vCenter.
Please let me know if you have any questions?
Quick questions,
Please open SR with vmware if it is on high priority
Please provide me your ESX version, vCenter version and is there any other host is experiencing this issue? who is hardware vendor?
please message me your log's if you feel not posting to this thread.
Regards,
VKmr.
Hi,
Please check below.
1) Is the ESXi host accessible from SSH
2) Is the ESXi host accessible as standalone host from vSphere client.
3) Whats the status of VM, are they online or in orphened status.
4) Have tried to reconnect the host from vCenter.- If yes, whats the error you are getting .
5) Please see into the ESXi hostd.logs and found if there is any hostd non-responsive status mentioned.
6) If possible attached the hostd.log, vobd.log, vmkwarning.logs
-Sachin
Hi Vkmr,
Thanks for responding, this was happened in my development environment so it was not high priority
I moved all VMs to other host and kept it under maintenance mode, I am analyzing hostd log's as bhards4 said, as of now I didn't find anything
I am using ESX 6.0 U1 and my hardware vendor is Cisco, please find the log's that messaged you hardware and vmware logs.
thinking we have some issues with DIMMs
1) Is the ESXi host accessible from SSH
I can not
2) Is the ESXi host accessible as standalone host from vSphere client.
No, but after restart I can able too
3) Whats the status of VM, are they online or in orphened status.
it is showing as disconnected.
4) Have tried to reconnect the host from vCenter.- If yes, whats the error you are getting .
after restarting it was added automatically.
.
5) Please see into the ESXi hostd.logs and found if there is any hostd non-responsive status mentioned.
I am looking into it, think some issue with DIMMs based on my environment.
6) If possible attached the hostd.log, vobd.log, vmkwarning.logs
I will send you all VMware support logs for this host
Hi Pkmr,
Thanks for providing information, I believe you as you said some issue with DIMMs, I will start with chassis log's
also when you launch your KVM are you seeing any purple screen? I need this information to analyze your VMware log's
Regards,
Vkmr.
I think it was purple screen
cool thanks for quick update as of now I didn't find any issues with DIMM's let me dig into more
can you follows the processes contained in the following KB and extract the details of the PSOD
Hi pkmr,
Done with analyzing on your PSOD issue, please follow this KB for resolution "https://kb.vmware.com/s/article/2140848?language=en_US"
Solution: please upgrade to latest update of ESX 6.0 or at-least go to ESX 6.0 U2 to fix this bug
Reason for issue:
PCPU becomes too busy logging all the correctable error messages to perform routine background tasks which leading ESXi to assume that PCPU is unresponsive. and finally it is causes a purple diagnostic screen and showing as host not responding in vCenter.
Please let me know if you have any questions?
Hi Vkmr,
I was seeing your message, it was clear to me, thanks for taking time on my issue,
Do you know why it [bug] happened to only this host but not other even though few other hosts are running same ESX 6.0 U1 version?
Thank you
pkmr.