VMware Cloud Community
Alhazred242
Contributor
Contributor

VSCA Update failed again. (6.7.0.2xxx->6.7.0.32...)

Hello,

same problem like some month before.

We habe here a VSCA (Deploy from a BareMetal with the Migration Manager). I try to Install the last Update (i try the last 3) and the VMWare vCenter-Server (and 2 other Services) crash at Start.

I check one Logfile and see only a exit 10.

I need some Tipps for Troubleshooting. I have the SupportBundle for Investigation and start a Rollback.

Reply
0 Kudos
19 Replies
NathanosBlightc
Commander
Commander

Hi

Give us more information about your problem's situation please

Did you upgrade the VCSA to up3 completely and then issue has been occurred?

Did you check remaining spaces of your VCSA virtual Disks (VMDK)?? (df -h)

and one point: you said bare metal... do you means your old VCSA is a physical/Windows-Based vCenter server?

Please mark my comment as the Correct Answer if this solution resolved your problem
Reply
0 Kudos
Alhazred242
Contributor
Contributor

Hi,

here some more Information:

We use VMWare ESXi since 8 years or so. The vCenter Server was long time a physical Server (2008R2). At the last big Upgrade we use the Chance and switch with the Migration Assistant to a VSCA.

The Appliance works fine. The Conditions are good. The Management Interface looks fine.

The Installed Version is 6.7.0.2100. I Update the Server over the Management Port.

First i try the update to .3000 and it fails. After the Restart, the Center Server crash direct at the Start. AFAIK ( in can check it tomorrow) i see in the vpxd.log Exit Code 1.

I have restore the Backup. Restart the Server and go back to the "daily doing".

I try then the next Update in May (.3100?) and the last time the Update mid July. Same Problem. The most Services Starts fine, Center Server crashes at start.

The Disks looks good (i check it tomorrow again). I think, there is a bad Config since long time (we Migrate the Server from 5->5.5->6.0->6.5). The only 3rd Party Plugin is IBM Spectrum Protect (v 8.4.1).

Reply
0 Kudos
IRIX201110141
Champion
Champion

2 Questions:

- Is your Host based on a Intel Skylake or newer?

- If you have updated VCSA vHW to higher than 10/13?

Regards,
Joerg

Reply
0 Kudos
NathanosBlightc
Commander
Commander

Did you check the firstboot.log? There may be some information about all problematic state of your VCSA.

Is your PSC embedded or external? did you check its services?

What's the status of license of vCenter? Did you check its correctness after upgrade?

If you migrate your VCSA's VM to another ESXi, is the problem still exist or not?

Please mark my comment as the Correct Answer if this solution resolved your problem
Reply
0 Kudos
Alhazred242
Contributor
Contributor

Processor is Xeon E5-3650 (Sandy Bridge)

VSCA HW is 13.

The Hostsystem is a ESX 6.5 on IBM X Series.

I try at the First Tests a nother ESXi Server, and same Problem.

Licences are ok (but this Service also Crash)

I check now the firstboot log.

Reply
0 Kudos
Alhazred242
Contributor
Contributor

On another Host it is the same.  PSC is embedded. I can try another Update today, and make a List which Services are crashed.

Reply
0 Kudos
IRIX201110141
Champion
Champion

In April this year i updated a customer environment

- 6.5/6.7? -> VCSA 6.7latest

- Replace Dell Server with R740(Skylake)

All went very well and runs 2 days.... until i schedule the vHW upgrade of the VCSA to the latest/highest and pressing the restart button.  After that our VCSA reacts as same as hours which mean a lot of servcies starting very well but at the end some are crashing or not starting and already started services disapeers.

We opened a #SR and it took a long fight with GSS and after 3 days they found the solution

- As we knows within the vHW also its hardcoded which CPU features are presented to the VM(GuestOS)

- The VCSA contains libraries which acts differently when seeing a Skylake and the Features are enabled. Certains code contains bugs the the services crashed or never starts

The engineer de-register the VCSA VM, edit the *.vmx and change vHW to 10. Register and starts the VM and our VCSA was back into the game and runs perfectly.

Your CPUs are older so its another issure youre facing.  I suggest to open a #SR.

GSS als says they seen a lot of problem when the VCSA is jonied a Windows AD. The suggest to use the "LDAP" option to connect to AD rather than using the kerberos integration.

Regards,
Joerg

Reply
0 Kudos
Alhazred242
Contributor
Contributor

i can check the HW. I upgrade the HW for a last change yesterday. The redeployment from the Backup with the old vHW ist easier.

I restore the VSCA from the Backup. Select Medium Type for more resources. vHW Type is 10.

Shutdown the VM for Snapshot.

Reboot and Install the Upgrade at the WebUI.

Same.

Failed Services:

Content Library

Image Builder (we don´t use it)

vCenter Server

Update Manager

Profile Storage Service

I don´t have access to Licence Server to check it. I try at Monday Backup and Restore a VSCA to a new ESXi with the .3200 Version.

I check the first logs, and its look like a fail from vmware-sps. I think a need disable the 3rd Party Plugin and Check the Configs.

Reply
0 Kudos
RajeevVCP4
Expert
Expert

How you are trying to update

By command line or GUI.

What error you are getting , provide full name of ISO which you are using for upgrade.

Rajeev Chauhan
VCIX-DCV6.5/VSAN/VXRAIL
Please mark help full or correct if my answer is use full for you
Reply
0 Kudos
Alhazred242
Contributor
Contributor

I use the WebUI (Port 5480)

Patch for VMWare VCSA 6.7.0.32000 from Jul 16.

Reply
0 Kudos
ChrisFD2
VMware Employee
VMware Employee

Have you logged a support call?

Regards,
Chris
VCIX-DCV 2024 | VCIX-NV 2024 | vExpert 6x | CCNA R&S
Reply
0 Kudos
Alhazred242
Contributor
Contributor

Yes i have.

I start later with the next try.

Reply
0 Kudos
Alhazred242
Contributor
Contributor

So, some testing and clean up and found a Problem.

I can't delete one Datacenter. There are a datastore who has no Host and some old Networks (without any hosts).

Any hint how can I delete these? I can access the DB or shell if we need it.

Reply
0 Kudos
NathanosBlightc
Commander
Commander

Use in CLI (Shell/SSH) following commands to show your datastore's attributes:

esxcli storage vmfs extent list

esxcli storage core device list

Then copy naa.xxx of considered for next command:

esxcli storage core device detached remove -d naa.xxx

Please mark my comment as the Correct Answer if this solution resolved your problem
Reply
0 Kudos
NathanosBlightc
Commander
Commander

And Also if you cannot remove it with ESXCLI, you can try this if you connect to the VCSA DB by management tools like PgAdmin and remove the specified datastore object.

I explained how to do it in the following link:

Connect and Manage VCSA Database (PostgreSQL)

Please mark my comment as the Correct Answer if this solution resolved your problem
Reply
0 Kudos
Alhazred242
Contributor
Contributor

I need to try it with the DB connect. The host with the Datastore are no exist anymore and I don't have esxcli on the VCSA right? I have found also some Networks, which are no more attached to anything and I cannot remove then.

After I remove the datastore, it is the same. I cannot remove the Datacenter and the Update failed.

I check at this time the Database. Also I check the Backupfiles, when I create a new one, without any stats. I think, it is the best way, to Backup all Settings and deploy a new clean VM, without any Config Errors.

Reply
0 Kudos
NathanosBlightc
Commander
Commander

Yes you don't have ESXCLI in VCSA, it's just for ESXi host. So you have to only remove your ORPHANED objects (like that datastore) from vCenter Server database.  But before that, I guess you know, you can remove datastore with PowerCLI but you need to mentioned host's name (required syntax) , So if you can, please try this PowerCLI too:

Remove-Datastore -Datastore Datastore -VMHost ESXi_IPAddress -Confirm:$false

and then you can remove Datacenter object:

Remove-Datacenter Datacenter

Also in following link describe about how to remove datastore forcefully with PowerCLI

PowerCLI script to forcibly delete a Datastore that is reported as in use. – The Lowercase w

Please mark my comment as the Correct Answer if this solution resolved your problem
Reply
0 Kudos
Alhazred242
Contributor
Contributor

esxcli I used in the past for a similar Problem.

Here its looks like a little bit more chaos.

The Datastore are not attached to any Host. In the Datacenter are no hosts attached. The hosts that I removed, don't attached to this datastore. The Networks are from an old Project. I cannot remove the Networks, the option is not active. I think, the other guys just shutdown the Hosts and don't cleanup the environment. No VMs are attached to the datastore/network (the lists are empty).

I check the folders with the apiexplorer for the Datastore. I think I try to remove the artifacts from the DB, but IMHO it is the best, I save the Configs, Keys, DRS Settings and Setup a new VCSA. The VCSA was in the past a Windows System (since v5) and we have so many changes since then.

Reply
0 Kudos
Alhazred242
Contributor
Contributor

I give it up. I remove the datastore from the DB, and can't remove the Datacenter. Try to remove the Networks (also refresh the PG Views for entity and so), the Appliance are broken.

I setup my new VCSA (with the evaluation License), apply my ESXi Server License and move all active Hosts (create the Clusters, Resource Pools and so on) After then, I only change the IP.

I think now the Config is clean (I check the datastores and VM for old ISO mounts etc.).

I take my initial Backup. Tomorrow I only need add IBM Spectrum Protect. and then everything is work back again.

Thanks for your your help.

Reply
0 Kudos