After updating a mixture of G7, G8 and G9 VMHosts to 5.5 update 3a and Cookbook release September 2015 (SPP JUN 15). I started having host errors specifically on my G7 hardware. One host went so far as to disconnect from VCenter.
I went the full boat update on the OS to bring it fully in line with HP's recipe. This was the first time hitting drivers in quite a while. So when the first errors started coming in, I immediately suspected the HPSA v106 ( Version:184.108.40.206-1OEM ) which was updated from v50 ( Version:220.127.116.11-1OEM)
VCenter is reporting errors
Lost access to volume 56424481-7f094eb0-8ee6-
80c16e6e15e0 (VMHost_local) due to
connectivity issues. Recovery attempt is in
progress and outcome will be reported shortly.
info 2/2/2015 9:00:45 AM (VMHost_local)
VMKernel.log was reporting some conflict's with claim rules between PowerPath and the NMP for the local disk but that was cleared.
2015-11-29T07:53:04.162Z cpu22:33327)WARNING: LinScsi: SCSILinuxAbortCommands:1843: Failed, Driver hpsa, for vmhba1
2015-11-30T19:37:48.614Z [248C4B70 info 'Vimsvc.ha-eventmgr'] Event 1820 : Lost access to volume 4ffd89b4-760e9689-81e3-e83935a81a45 (gldpiesx002_local) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.
2015-11-30T19:37:48.615Z [248C4B70 info 'Vimsvc.ha-eventmgr'] Event 1821 : Successfully restored access to volume 4ffd89b4-760e9689-81e3-e83935a81a45 (gldpiesx002_local) following connectivity issues.
I opened tickets with both HP and VMWare
VMWare came back with the first fix which was to upgrade the hpsa driver(hpsa 18.104.22.168-1OEM) which can be downloaded from https://my.vmware.com/web/vmware/details?downloadGroup=DT-ESXI55-HP-HPSA-550114-1OEM&productId=353. however overnight , the errors returned.
HP has suggested back-revving to 22.214.171.124-1(1 Oct 2014) but from a previous discussion here [ Datastore / Disk latency problems with HP ProLiant DL380 G7 - HP Smart Array P410i controller after ... ] I believe that was also a bad version.
I'm going to wander down the road a bit and see where it leads and if necessary go back to the 70 or even 50 release
Thanks for pointing on this version.
ProLiant DL370 G6 with P410i - RAID 5 + spare disk.
SPP 2016.04.0, firmware 6.64
ESXi 6.0 U1b
Running with scsi-hpsa_126.96.36.199 for about 4 hours - no warnings in logs, no latency peaks. 🙂
Let's wait, if it is not causing PSOD in few days...
Correct me if I'm wrong, but what I remember, working version was not released in past 2 years.