I have a problem where my home esxi 5.0.0 623860 is random crashing. When it crash all my VM stop responding but I can still access esxi from vsphere, but I cannot shutdown VMs or shutdown esxi.
I have the following popup random in my log:
Device
t10.ATA_____WDC_WD20EARX2D00PASB0_____
____________________WD2DWCAZAF671421
performance has deteriorated. I/O latency
increased from average value of 1200
microseconds to 36610 microseconds.
warning
14.07.2013 00:09:38
localhost.localdomain
This is a RDM disk, that I have forwarded to a WHS2011 VM. This warning however is not in the log today when the system has crashed.
Now that the system is crashed, the following is last in vsphere log
Logging to storage has failed. Logs are no longer
being stored locally on this host.
error
01.08.2013 04:34:33
localhost.localdomain
Lost connectivity to storage device t10.ATA_____
OCZ2DAGILITY3___________________________
_OCZ2D5HWPJN6TFPA32Z12. Path vmhba32:C0:
T0:L0 is down. Affected datastores: "",
"datastore1".
error
01.08.2013 04:36:10
localhost.localdomain
This is the SSD that esxi is installed on, it's also hosting some VMs
Lost access to volume 504bbe23-8c141760-08fe-
c860008c3e4b (datastore1) due to connectivity
issues. Recovery attempt is in progress and
outcome will be reported shortly.
info
01.08.2013 04:36:10
datastore1
Lost connection to server 192.168.10.201 mount
point /mnt/homepool/datastore mounted as
2203d1fd-c4cfd23f-0000-000000000000 (NAS).
error
01.08.2013 05:03:10
localhost.localdomain
(this mountpoint is on a pike card forwareded to a freenas vm, freenas is installed on the ssd hosting esxi)
Asus P8B 4E/L mobo
16GB ecc ram
Xeon 1230 (SB)
Asus 2008 PIKE
Could it be a storage controller issue? The RDM HD (hosting WHS2011 vm) and the SSD HD (hosting esxi + some vm) is on the same mobo controller.
My system is rather complex with lots of pci forwarding to VM, I could explain it if someone has hints to a solution.