The_Moves
Contributor
Contributor

ESXi 5.5 U1 Realease-1746018 Purple/Pink Screen of Death

I am having issues making sense of this pink screen of my ESXi 5.5 install, can someone help?

I've only had the system up for a week now. All RAM tested fine. So far, this seems to appear randomly. I'm considering going back to 5.1.

photo.JPG

Tags (2)
0 Kudos
6 Replies
NealeC
Hot Shot
Hot Shot

There are existing KBs on this

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=102021...

That one describes what you are seeing, but recommends you collect and analyse the logs to find the root cause using

VMware KB: Extracting the log file after an ESX or ESXi host fails with a purple screen error

However it does appear that it thinks one of your CPUs if faulty.

Do you have hardware monitoring/logs such as an IML on an HP server?

That may also tie in with the above logs to show when a hardware failure/issue/overheat occurred.

Regards

Chris

-------------- If you found this or any other answer useful please consider the use of the Helpful or Correct buttons to award points. Chris Neale VCIX6-NV;vExpert2014-17;VCP6-NV;VCP5-DCV;VCP4;VCA-NV;VCA-DCV;VTSP2015;VTSP5;VTSP4 http://www.chrisneale.org http://www.twitter.com/mrcneale
The_Moves
Contributor
Contributor

I will make sure to grab a core dump/extract logs the next time this happens, per your link. In the mean time, I will run some I/O intensive tasks to see if I can get it to 'purple screen'.

I had also looked the KB article before for the error I am experiencing. My 'purple screen' is much smaller. I have downgraded to ESXi 5.1u1, and will step it up to new u2 once I can assume it is stable.

0 Kudos
epa80
Hot Shot
Hot Shot

We're seeing the identical PSOD. We've gotten it on 2 Blade (BL460c Gen6) in our server cluster in the past 4 weeks. We've opened a ticket with VMware regarding it. Did you ever manage to get a reolution? Our other datacenter server cluster is on the same version of ESX, but (so far) hasn't had the issue. Those are G7s though.

0 Kudos
vfk
Expert
Expert

I would suggest patching your systems to the latest patches and also open SR with VMware to perform RCA, PSOD is not something you can easily troubleshoot, messages can be cryptic.  But looking the attached screen you may be having some issues with your memory or memory lookup.

VMware KB: Understanding a Failed to ack TLB invalidate purple diagnostic screen

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=200009...

vfk

--- If you found this or any other answer helpful, please consider the use of the Helpful or Correct buttons to award points. vfk Systems Manager / Technical Architect VCP5-DCV, VCAP5-DCA, vExpert, ITILv3, CCNA, MCP
0 Kudos
vlho
Hot Shot
Hot Shot

0 Kudos
epa80
Hot Shot
Hot Shot

Thanks for the link. it doesn't QUITE match our's, but, who knows maybe it's quite similar. Here's what we have seen:

Capture.JPG

0 Kudos