VMware Cloud Community
JonGarlock
Contributor
Contributor
Jump to solution

replication suddenly failing: status "not active", error "invalid state"

I have been replicating between sites for some time.  Today, all but 1 of the VM replication states changed to "Not Active" .. and later, of course, to "Not Active (RPO Violation)".  When I try to force a sync to happen now, web client gives me an error that a sync is already in progress.  in vCenter, it reports that the "VM is in an invalid state".

I can't find anything about this anywhere.  have you seen this before?  I'm not sure what next steps I can take here;

Reply
0 Kudos
1 Solution

Accepted Solutions
LachezarKozhuha
Contributor
Contributor
Jump to solution

Hi

In order to investigate we need support bundles. Could you file a SR and attach the support bundles? Most likely it's a connectivity problem between between the host and the replica site.

Thanks,

Lachezar

View solution in original post

Reply
0 Kudos
3 Replies
JonGarlock
Contributor
Contributor
Jump to solution

hostd.log

2013-04-18T20:04:16.026Z [78CA8B90 info 'TaskManager' opID=3440367b-3b03-4f3c-b4de-04676ebdf35c-26-e4] Task Created : haTask--vim.HbrManager.createInstance-395847520
2013-04-18T20:04:16.027Z [78CA8B90 error 'Hbrsvc' opID=3440367b-3b03-4f3c-b4de-04676ebdf35c-26-e4] ReplicationGroup (groupID=GID-b22ccbe1-3ff3-432c-b4bf-0343dadbe967) Failed to start instance, group is in incorrect state: lwd delta
2013-04-18T20:04:16.028Z [78CA8B90 info 'ha-eventmgr' opID=3440367b-3b03-4f3c-b4de-04676ebdf35c-26-e4] Event 17120 : Failed to start delta for virtual machine XXXXX on host XXXXX in cluster XXXXXX in ha-datacenter: Virtual machine is in an invalid state.
2013-04-18T20:04:16.028Z [78CA8B90 info 'Default' opID=3440367b-3b03-4f3c-b4de-04676ebdf35c-26-e4] AdapterServer caught exception: vim.fault.ReplicationVmFault
2013-04-18T20:04:16.028Z [78CA8B90 info 'TaskManager' opID=3440367b-3b03-4f3c-b4de-04676ebdf35c-26-e4] Task Completed : haTask--vim.HbrManager.createInstance-395847520 Status error
Reply
0 Kudos
LachezarKozhuha
Contributor
Contributor
Jump to solution

Hi

In order to investigate we need support bundles. Could you file a SR and attach the support bundles? Most likely it's a connectivity problem between between the host and the replica site.

Thanks,

Lachezar

Reply
0 Kudos
JonGarlock
Contributor
Contributor
Jump to solution

I don't have bundles but I wish I had seen your reply earlier.  It turned out that the recovery site VSR appliance couldn't ping 2 of the hosts from the protected site.  Just those 2 sites.  After a bunch of network troubleshooting, we simply rebooted the affected hosts and connectivity was restored.

Reply
0 Kudos