VMware Cloud Community
MC1903
Enthusiast
Enthusiast

Add VI Workload Domain fails on subtask "Validate vSAN disks for ESXi Host(s)"

Hi,

I am trying to complete a Add VI Workload Domain (vSAN) in my Lab with 4 nested ESXi Hosts and I am getting stopped at the subtask "Validate vSAN disks for ESXi Host(s)" with this error:

DescriptionValidate vSAN disks for ESXi Host(s)
Progress MessagesvSAN Disks Validation on the ESXi Host(s) failed: ESXi Host vcf01-m01-esx05.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx06.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx07.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx08.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one)
Error

Message: vSAN Disks Validation on the ESXi Host(s) failed: ESXi Host vcf01-m01-esx05.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx06.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx07.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one), ESXi Host vcf01-m01-esx08.vcf.momusconsulting.com does not have valid boot disk (Expecting at least one)
Remediation Message: Make sure that ESXi Hosts disks are eligible for use by vSAN

Reference Token: BJ47Q8

Cause:

 

SDDC Manager Version: 4.5.0

ESXi Version: 7.0.3-20328353

 

My nested ESXi hosts are booted and thus have valid boot disks, so I am lost as to why this error is being kicked out.

Each host has 4 disks. 1 x 12GB boot disk, 1 x 100Gb vSAN Cache disk & 2 x 250Gb vSAN Capacity disks.

The vSAN capacity disks have been tagged as IsCapacityFlash =1.

I have tried manually creating the /store/.capacityflash.json files on each host, as per https://kb.vmware.com/s/article/52586 

 

[root@vcf01-m01-esx05:/vmfs/volumes/631a0f8a-7e2b33b5-a1fb-005056a94fe1/store] vdq -q > /store/.capacityflash.json
[root@vcf01-m01-esx05:/vmfs/volumes/631a0f8a-7e2b33b5-a1fb-005056a94fe1/store] cat /store/.capacityflash.json
[
{
"Name" : "naa.6000c29580337e490eeef16ca096f84c",
"VSANUUID" : "",
"State" : "Ineligible for use by VSAN",
"Reason" : "Has partitions",
"IsSSD" : "1",
"IsCapacityFlash": "0",
"IsPDL" : "0",
"Size(MB)" : "12288",
"FormatType" : "512n",
"IsVsanDirectDisk" : "0"
},

{
"Name" : "naa.6000c290d6ff5b9c627918f381ab94f0",
"VSANUUID" : "",
"State" : "Eligible for use by VSAN",
"Reason" : "None",
"IsSSD" : "1",
"IsCapacityFlash": "1",
"IsPDL" : "0",
"Size(MB)" : "256000",
"FormatType" : "512n",
"IsVsanDirectDisk" : "0"
},

{
"Name" : "naa.6000c29dc55120c5120f4ea2b0819a08",
"VSANUUID" : "",
"State" : "Eligible for use by VSAN",
"Reason" : "None",
"IsSSD" : "1",
"IsCapacityFlash": "1",
"IsPDL" : "0",
"Size(MB)" : "256000",
"FormatType" : "512n",
"IsVsanDirectDisk" : "0"
},

{
"Name" : "naa.6000c292c78fe2a69bd08fafb498c4c6",
"VSANUUID" : "",
"State" : "Eligible for use by VSAN",
"Reason" : "None",
"IsSSD" : "1",
"IsCapacityFlash": "0",
"IsPDL" : "0",
"Size(MB)" : "102400",
"FormatType" : "512n",
"IsVsanDirectDisk" : "0"
}

]
[root@vcf01-m01-esx05:/vmfs/volumes/631a0f8a-7e2b33b5-a1fb-005056a94fe1/store]

 

Additionally; what is the Reference Token: BJ47Q8 referring to? Is there a lookup / log I can correlate?

Any help or suggestions very much welcomed.

Cheers,

M

 

Reply
0 Kudos
5 Replies
RajeevVCP4
Expert
Expert

are you able to add host with out sddc process I mean simple add host in cluster on other vcenter server

https://vkernel.nl/wp-content/uploads/2020/07/VCF-Lab-Constructor-Install-Guide-4Rev4.pdf

 

Rajeev Chauhan
VCIX-DCV6.5/VSAN/VXRAIL
Please mark help full or correct if my answer is use full for you
mannharry
Hot Shot
Hot Shot

Can you check and confirm if there is no underlying issue with the disk of one of the ESXi hosts?

https://kb.vmware.com/s/article/89738

AbbedSedkaoui
Enthusiast
Enthusiast

Hi @MC1903 , set the VSAN boot disk on each ESXi to 32GB that will do it. 👍

https://kb.vmware.com/s/article/82515

Post Note: And if you're using William Lam script, then download the latest as it was updated recently https://github.com/lamw/vcf-automated-lab-deployment

Cheers

Abbed

MC1903
Enthusiast
Enthusiast

Apologies for the delayed response.

@RajeevVCP4 - I didn't even think to try them as a normal vSAN Cluster; very good shout! 

@mannharry - I did see KB89738; I didn't have mismatched vSAN disk sizes across the nested hosts.

@AbbedSedkaoui - Yes; I think the boot disk size was the problem. It's a shame that the errors SDDC manager kicks out are so vague.

Thank you all for your suggestions. 

I blew the entire VCF 4.4 environment away and will be rebuild as VCF 4.5 (If I can get past another issue with Cloud Builder; If not, I fear another VMTN post soon).

Cheers all.

M

Reply
0 Kudos
AbbedSedkaoui
Enthusiast
Enthusiast

You're welcome,

if it occur again on VCF 4.5 management domain SDDC bringup,

it would say in the Cloud builder debug log 

/opt/vmware/bringup/logs/vcf-bringup-debug.log

"VSAN_MIN_BOOT_DISKS.error"

Note: Query answered?, please mark this response as "Correct" or "Helpful".

Cheers

Abbed

strivevirtually.net 

Reply
0 Kudos