VMware Cloud Community
AlbertWT
Virtuoso
Virtuoso

PSoD on ESXi after hours ?

People,

I need your assistance in deciphering and getting more understanding regarding the PSOD on one of my ESXi host on HP BL 465c G7 (Blades) as per below screenshot:

image001.png

this problem was brought up to me after hours after I got the following alarm email from VCenter:

([Event alarm expression: Cannot connect host -  incorrect Ccagent] OR [Event alarm expression: Cannot connect host - network error] OR [Event alarm expression: Cannot connect host - time-out] OR [Event alarm expression: Cannot connect host - time-out] OR [Event alarm expression: Host connection lost])


Any kind of help and comments would be greatly appreciated.


Thanks

/* Please feel free to provide any comments or input you may have. */
9 Replies
DavoudTeimouri
Virtuoso
Virtuoso

Hi,

Do you have any VM with E1000 NIC?

Also please check this article: VMware KB: VMware ESXi 5.x host experiences a purple diagnostic screen mentioning E1000PollRxRing an...

Seems, you ave to applying ESXi patch on your server.

-------------------------------------------------------------------------------------
Davoud Teimouri - https://www.teimouri.net - Twitter: @davoud_teimouri Facebook: https://www.facebook.com/teimouri.net/
AlbertWT
Virtuoso
Virtuoso

Hi Davoud,

Not that I know of, I'll have a look at it tomorrow.

So which logs can I review or investigate further ?

/* Please feel free to provide any comments or input you may have. */
Reply
0 Kudos
DavoudTeimouri
Virtuoso
Virtuoso

Hi,

You can find it on 'vmkernel.log" and it's available on your "syslog" folder, if you have it or core dump file on your server.

Please read this for extracting log files from core dump file: VMware KB: Extracting a core dump file from the VMKCore diagnostic partition following a purple diag...

-------------------------------------------------------------------------------------
Davoud Teimouri - https://www.teimouri.net - Twitter: @davoud_teimouri Facebook: https://www.facebook.com/teimouri.net/
AlbertWT
Virtuoso
Virtuoso

Ok, does generating log through the vsphere console SSH can cause outage?

/* Please feel free to provide any comments or input you may have. */
Reply
0 Kudos
DavoudTeimouri
Virtuoso
Virtuoso

I don't know buddy.

We can talk about that after checking log files and check VMs that they have E1000 NIC.

-------------------------------------------------------------------------------------
Davoud Teimouri - https://www.teimouri.net - Twitter: @davoud_teimouri Facebook: https://www.facebook.com/teimouri.net/
AlbertWT
Virtuoso
Virtuoso

Hi Davoud,

The VM in the PSoD DCTS837 does not have e1000 vNIC, it is using VMXNet3, I'll look into the ESXi log now.

/* Please feel free to provide any comments or input you may have. */
Reply
0 Kudos
Dee006
Hot Shot
Hot Shot

Hi Albert,

Here is general KB article for troubleshooting the exception 14 PSOD

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=102018...

Do you have set an Bandwidth limit in the vds switch?

If Yes,remove the limit and try to update the patch Build Number 1157734

AlbertWT
Virtuoso
Virtuoso

Thanks Dee,

No i do not implement vDS in the VCenter.

/* Please feel free to provide any comments or input you may have. */
Reply
0 Kudos
Borja_Mari
Virtuoso
Virtuoso

Hello,

IMHO it sounds like a hardware issue. Maybe the problem is with RAM or some core/cpu ...

I'd recommend you to open a case with vmware support.

Can you take photograph of another PSOD?

Best regards,

Pablo

------------------------------------------------------------------------------------------------- PLEASE CONSIDER AWARDING any HELPFUL or CORRECT reply. Thanks!! Por favor CONSIDERA PREMIAR cualquier respuesta ÚTIL o CORRECTA . ¡¡Muchas gracias!! VCP3, VCP4, VCP5-DCV (VCP550), vExpert 2010, 2014 BLOG: http://communities.vmware.com/blogs/VirtuallyAnITNoob