VMware Cloud Community
andvm
Hot Shot
Hot Shot

Predictive disk failure

Hi,

idrac detected disk issue but vSphere vSAN disks all look healthy.

server is in MM and will migrate off all data in 1hr as default.

how can I identify the disk as think I should remove it from disk group before shutting down for replacement?

its part an AF disk group.

thanks

Reply
0 Kudos
2 Replies
TheBobkin
Champion
Champion

Hello andvm,

Assuming it is an LSI based controller, you should be able to use storcli via SSH to the host to get the naa and/or mount point from the slot details in iDRAC:

How do you identify the physical disk (LSI raid controller) in a VMware vSAN environment?

Other than that, you might be able to identify the disk from whatever metrics iDRAC uses to identify a problem disk (possibly smartinfo).

Even just check vmkernel.log/dmesg on the host might make it apparent if it is logging problem sense codes (e.g. 0x3 0x11).

Bob

Reply
0 Kudos
andvm
Hot Shot
Hot Shot

Thanks, managed to get the SAS Address from iDRAC and then went through all vsan disks naa’s Until I got a match using:

esxcli storage core path list -d naa.....

Since it’s an AF with Dedup and comp I will have to remove the disk group and not just this disk

Reply
0 Kudos