ccalvetbeta's Posts

Update: There was an issue with the vcenter service account used by NSX ALB. It has been fixed and the cluster creation reach new steps now.
Thank you for the reply. I can confirm that the "machine" is in pending state. I have also noticed pending task at the load balancer level. I do not know if they are at the origin of this issue ... See more...
Thank you for the reply. I can confirm that the "machine" is in pending state. I have also noticed pending task at the load balancer level. I do not know if they are at the origin of this issue or a symptom of it. No member in the pool so maybe they are not there because machine is in pending state, or the machine is in pending state because the pool doesn't have the members.  
Attached are the logs generated from https://github.com/vmware/cloud-provider-for-cloud-director/blob/main/scripts/generate-k8s-log-bundle.sh  I have stopped the cse service to keep access to the ep... See more...
Attached are the logs generated from https://github.com/vmware/cloud-provider-for-cloud-director/blob/main/scripts/generate-k8s-log-bundle.sh  I have stopped the cse service to keep access to the ephemeral VM.
I just noticed from Software requirements: VCD 10.3.3.1 (tested). Will work with VCD 10.3.1 or newer NSX-T 3.1.1 Avi 20.1.3 Does CSE next works with newer version of NSX and AVI as well? (So it... See more...
I just noticed from Software requirements: VCD 10.3.3.1 (tested). Will work with VCD 10.3.1 or newer NSX-T 3.1.1 Avi 20.1.3 Does CSE next works with newer version of NSX and AVI as well? (So it is a minimum version and not an exact version requirement) (Theoretically, CSE is only supposed to communicate with Cloud Director and let cloud director communicate with the other.) In our environment which was supported for cse 3.1.3 we are using: NSX-T 3.1.3.4 AVI 21.1.1 And the NSX-T version is managed by the Vcloud Foundation so a downgrade is not an option.
Hi, The cluster creation never completed and the cluster never ready. Then vapp and vm are removed and recreated later in loop. Result of a "cat /var/log/cloud-final.err" Note: I am not sure ... See more...
Hi, The cluster creation never completed and the cluster never ready. Then vapp and vm are removed and recreated later in loop. Result of a "cat /var/log/cloud-final.err" Note: I am not sure if we can login as root with ssh, so screenshot is from the console All pods seems ok. And from the journal i don't see relevant error But few errors earlier of type "412 Precondition Failed"   Seems the latest error in event then is associated to the event of deleting vapp and load balancer     Any suggestion of what should be the next step in troubleshooting?  
Hi, Would it be possible to add pre-requisite health check in the gui? (similar to Distributed Switch health check) It would avoid trying to fix a deployment not working. There could be multiple ... See more...
Hi, Would it be possible to add pre-requisite health check in the gui? (similar to Distributed Switch health check) It would avoid trying to fix a deployment not working. There could be multiple level of health check: Infrastructure: Confirm that all objects have been properly created in the API. (Maybe the logged in user would not have the right to see such settings so this test should run under a different account) User: Confirm the logged in user has all prerequisites permissions. Then the user select in which organization network to simulate a deployment: Confirm network has an ip pool configured with enough free IP address Confirm DNS are configured. Confirm the EDGE is properly configured with access to a load balancer, Confirm Enough external IP addresse available Confirm enough VIP available in the edge. Confirm enough capacity (CPU/Memory/Storage) Confirm sizing policies created. Then deploy a test VM in this network (similar to how the ephemeral VM would be created) Confirm that the test VM has access to DNS server That the VM has access to all URLS needed. (List should be provided in documentation, not all environment can provide full internet access) That the VM has access to Cloud Director (and eventually if certificates are trusted) List non-exhaustive If all test results passed/failed are visible in the gui, it would be easy to pinpoint wrong settings and fix them before even trying to deploy a cluster. Regards,
Thanks for the reply. Unfortunately if I stop CSE service too early, maybe the task that trigger the issue will not be executed in the VM making the VM useless for troubleshooting. This future feat... See more...
Thanks for the reply. Unfortunately if I stop CSE service too early, maybe the task that trigger the issue will not be executed in the VM making the VM useless for troubleshooting. This future feature is really a must have for the GA,
Issue: The vapp is created, then the VM "EPHEMERAL_TEMP_VM". Later the VAPP and VM are removed and the cycle is then repeated. I would like to analyse all logs in the VM. But when it is deleted i... See more...
Issue: The vapp is created, then the VM "EPHEMERAL_TEMP_VM". Later the VAPP and VM are removed and the cycle is then repeated. I would like to analyse all logs in the VM. But when it is deleted i may have missed the latest. Is there an equivalent to the setting "rollback:false" in the legacy CSE?
Hi, So far I have not managed to create a cluster. Creating CAPVCD cluster task stuck at 1%. I am trying to troubleshoot but it is difficult to identify what is wrong from the task/logs without kno... See more...
Hi, So far I have not managed to create a cluster. Creating CAPVCD cluster task stuck at 1%. I am trying to troubleshoot but it is difficult to identify what is wrong from the task/logs without knowing how it is supposed to look like. Would it be possible to provide the following information from a working environment? First step, deploy new clusters (management and workload) from the the new beta plugin and when they are ready: Export all related cloud director tasks Export all related cloud director event. And provide a copy of logs: cse.log journalctl -axel -u cse Such information would be very helpful for troubleshooting. Would it be also possible to provide step by step chart of what is happening when creating a new cluster? Addition: Would it be possible to add the list of all URL that need to be allowed from the vm/nodes?