<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:clearspace="http://www.jivesoftware.com/xmlns/clearspace/rss" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>VMware Communities: Message List - Virtual Machine corrupted during snapshot removal?</title>
    <link>http://communities.vmware.com/community/vmtn/general/vm-guest?view=discussions</link>
    <description>Most recent forum messages</description>
    <language>en</language>
    <pubDate>Sun, 02 Aug 2009 05:38:46 GMT</pubDate>
    <generator>Clearspace 1.10.12 (http://jivesoftware.com/products/clearspace/)</generator>
    <dc:date>2009-08-02T05:38:46Z</dc:date>
    <dc:language>en</dc:language>
    <item>
      <title>Re: Virtual Machine corrupted during snapshot removal?</title>
      <link>http://communities.vmware.com/message/1327051?tstart=0#1327051</link>
      <description>&lt;br /&gt;
I've run into this same problem. Since I don't see anyone from VMWare providing a useful answer of any kind, I'll respond, even though this is an old thread.&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
The issue is that the above user didn't wait long enough for his snapshot commit to complete. Basically, it takes roughly 4 hours for a 300GB snapshot to commit to the prime vmdk. This is pretty linear of a size/time ratio, assuming modern hardware. See: &lt;br /&gt;
&lt;p /&gt;
&lt;a class="jive-link-external" href="http://communities.vmware.com/thread/182276"&gt;http://communities.vmware.com/thread/182276&lt;/a&gt;&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
If you do not wait for the commit to complete, essentially you are corrupting the primary vmdk that any later snapshot depends on. So, even though you may have a proper running snapshot file, any disruption of the primary vmdk will render it useless. Remember: snapshots are essentially based on a differential from the initial vmdk. &lt;br /&gt;
&lt;p /&gt;
Once you get the later errors observed in this post, your only solution is to restore from backup. But hopefully this message will prevent anyone from doing anything drastic like rebooting their ESX platform because of a perceived "hang" at 95%.&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
-W &lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;br /&gt;</description>
      <pubDate>Sun, 02 Aug 2009 05:38:46 GMT</pubDate>
      <author>warrenv</author>
      <guid>http://communities.vmware.com/message/1327051?tstart=0#1327051</guid>
      <dc:date>2009-08-02T05:38:46Z</dc:date>
      <clearspace:dateToText>3 months, 3 weeks ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: Virtual Machine corrupted during snapshot removal?</title>
      <link>http://communities.vmware.com/message/1111708?tstart=0#1111708</link>
      <description>&lt;div class="jive-quote"&gt;Consolidate Helper- 0 was created by a backup tool. Usually VCB.&lt;/div&gt;
&lt;br /&gt;
Thanks for the info.&lt;br /&gt;
&lt;br /&gt;
I've noticed that a lot of my VMs have a Consolidate Helper snapshot in their VMSD file, even though no snapshots are listed in the VI client and no delta files exist. Some also have other snapshots I have created in the past (and since committed) in their VMSD files, again no listing in VI Client nor any delta files. &lt;br /&gt;
&lt;br /&gt;
Is this normal?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;One of the delta files could have been locked. Not sure why that would be the case. The errors you see in the vmware.log are related to the delta deletion problem and your restore was the proper thing to do.&lt;/div&gt;
&lt;br /&gt;
Thanks for the info. Any tips of how to track down the underlying cause of the problem?&lt;br /&gt;
&lt;p /&gt;
Cheers,&lt;br /&gt;
Matt Kilham</description>
      <pubDate>Mon, 01 Dec 2008 02:17:35 GMT</pubDate>
      <author>mattjk</author>
      <guid>http://communities.vmware.com/message/1111708?tstart=0#1111708</guid>
      <dc:date>2008-12-01T02:17:35Z</dc:date>
      <clearspace:dateToText>12 months, 1 day ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: Virtual Machine corrupted during snapshot removal?</title>
      <link>http://communities.vmware.com/message/1111515?tstart=0#1111515</link>
      <description>Hello,&lt;br /&gt;
&lt;br /&gt;
Moved to the VI: Virtual Machine and Guest OS forum.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;Looking at the snapshot data in the VM's .vmsd file after the above occurred, there were two snapshots listed - one was the original snapshot that I'd tried to delete, and the other named "Consolidate Helper- 0". I have no idea where there "Consolidate Helper- 0" one came from, and unfortunately am also not sure if it was present before the attempt to delete snapshots or not - it certainly wasn't listed in the "Snapshot Manager" in VC though.&lt;/div&gt;
&lt;br /&gt;
Consolidate Helper- 0 was created by a backup tool. Usually VCB.&lt;br /&gt;
&lt;br /&gt;
One of the delta files could have been locked. Not sure why that would be the case. The errors you see in the vmware.log are related to the delta deletion problem and your restore was the proper thing to do.&lt;br /&gt;
&lt;br /&gt;
&lt;br&gt;Best regards,&lt;br /&gt;
Edward L. Haletky&lt;br /&gt;
VMware Communities User Moderator&lt;br /&gt;
====&lt;br /&gt;
Author of the book 'VMWare ESX Server in the Enterprise: Planning and Securing Virtualization Servers', Copyright 2008 Pearson Education.&lt;br /&gt;
SearchVMware Blog: &lt;a class="jive-link-external" href="http://itknowledgeexchange.techtarget.com/virtualization-pro/"&gt;http://itknowledgeexchange.techtarget.com/virtualization-pro/&lt;/a&gt;&lt;br /&gt;
Blue Gears Blogs - &lt;a class="jive-link-external" href="http://www.itworld.com/"&gt;http://www.itworld.com/&lt;/a&gt; and &lt;a class="jive-link-external" href="http://www.networkworld.com/community/haletky"&gt;http://www.networkworld.com/community/haletky&lt;/a&gt;&lt;br /&gt;
As well as the Virtualization Wiki at &lt;a class="jive-link-external" href="http://www.astroarch.com/wiki/index.php/Virtualization"&gt;http://www.astroarch.com/wiki/index.php/Virtualization&lt;/a&gt;</description>
      <pubDate>Sun, 30 Nov 2008 16:01:51 GMT</pubDate>
      <author>Texiwill</author>
      <guid>http://communities.vmware.com/message/1111515?tstart=0#1111515</guid>
      <dc:date>2008-11-30T16:01:51Z</dc:date>
      <clearspace:dateToText>12 months, 1 day ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Virtual Machine corrupted during snapshot removal?</title>
      <link>http://communities.vmware.com/message/1111483?tstart=0#1111483</link>
      <description>Hi,&lt;br /&gt;
&lt;p /&gt;
&lt;br /&gt;
We just had the nasty experience of ESX (3.5 update 2) corrupting one of our Virtual Machines while trying to remove a snapshot. I'm interested to see if anyone on here can shed some light on why it might have happened - there are quite a few threads on similar issues but none seem directly equivilent to our experience.&lt;br /&gt;
&lt;br /&gt;
This evening while performing some other work I noticed that the VM in question (W2K8 64-bit, if it matters) had a snapshot that we'd forgotten to remove - the snapshot was over a month old, but relatively small (only 800MB or so in the delta-disks).&lt;br /&gt;
&lt;br /&gt;
I deleted the SS via VirtualCenter, which "worked" on it for a few seconds before erroring out with the message "Doing an online commit, cannot power off". At the same time the Virtual Machine stopped (was powered on when I started the SS commit).&lt;br /&gt;
&lt;br /&gt;
Subsequent attempts to power on the Virtual Machine resulted in VC showing errors like "Failed to power on xxxx on xxx in xxx: A general ssystem error occurred: Internal error". Examining the VM's log files showed the problem was to do with inconsistencies between the delta disks and the main disk:&lt;br /&gt;
&lt;br /&gt;
Nov 30 18:20:57.718: vmx| DISK: Cannot open disk "/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003.vmdk": The parent virtual disk has been modified since the child was created (18).&lt;br /&gt;
Nov 30 18:20:57.718: vmx| Msg_Post: Error&lt;br /&gt;
Nov 30 18:20:57.718: vmx| &lt;a class="jive-link-external" href="http://msg.disk.noBackEnd"&gt;http://msg.disk.noBackEnd&lt;/a&gt; Cannot open the disk '/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003.vmdk' or one of the snapshot disks it depends on.&lt;br /&gt;
Nov 30 18:20:57.718: vmx| &lt;a class="jive-link-external" href="http://msg.disk.configureDiskError"&gt;http://msg.disk.configureDiskError&lt;/a&gt; Reason: The parent virtual disk has been modified since the child was created.&lt;br /&gt;
&lt;br /&gt;
Also, the VM's log file at the time that the snapshot deletion was running shows lots of opening &amp;#38; closing of the delta disk and base VMDK files, followed by:&lt;br /&gt;
&lt;br /&gt;
Nov 30 18:11:29.726: vmx| DISKLIB-LINK  : Attach: Content ID mismatch (9bb515b0 != 95b90c26).&lt;br /&gt;
Nov 30 18:11:29.736: vmx| DISKLIB-CHAIN : "/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx.vmdk" : failed to open (The parent virtual disk has been modified since the child was created).&lt;br /&gt;
Nov 30 18:11:29.738: vmx| DISKLIB-VMFS : "/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003-delta.vmdk" : closed.&lt;br /&gt;
Nov 30 18:11:29.738: vmx| DISKLIB-VMFS : "/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-flat.vmdk" : closed.&lt;br /&gt;
Nov 30 18:11:29.738: vmx| DISKLIB-LIB   : Failed to open '/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003.vmdk' with flags 0xa (The parent virtual disk has been modified since the child was created).&lt;br /&gt;
Nov 30 18:11:29.738: vmx| DISK: Cannot open disk "/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003.vmdk": The parent virtual disk has been modified since the child was created (18).&lt;br /&gt;
Nov 30 18:11:29.738: vmx| Msg_Post: Error&lt;br /&gt;
Nov 30 18:11:29.738: vmx| &lt;a class="jive-link-external" href="http://msg.disk.noBackEnd"&gt;http://msg.disk.noBackEnd&lt;/a&gt; Cannot open the disk '/vmfs/volumes/9a9d0976-a1cfd695/xxx/xxx-000003.vmdk' or one of the snapshot disks it depends on.&lt;br /&gt;
Nov 30 18:11:29.738: vmx| &lt;a class="jive-link-external" href="http://msg.disk.configureDiskError"&gt;http://msg.disk.configureDiskError&lt;/a&gt; Reason: The parent virtual disk has been modified since the child was created.&lt;br /&gt;
&lt;br /&gt;
Looking at the snapshot data in the VM's .vmsd file after the above occurred, there were two snapshots listed - one was the original snapshot that I'd tried to delete, and the other named "Consolidate Helper- 0". I have no idea where there "Consolidate Helper- 0" one came from, and unfortunately am also not sure if it was present before the attempt to delete snapshots or not - it certainly wasn't listed in the "Snapshot Manager" in VC though.&lt;br /&gt;
&lt;p /&gt;
No attempts on my behalf managed to get it "working again", so we restored the most recent SAN (NetApp) snapshot (luckily taken 5 minutes before the problem), which seems to have brought the VM back to life again without issues (aside from having to remove from VC / re-add). &lt;br /&gt;
&lt;br /&gt;
Post-restore, a subsequent attempt to delete the ESX snapshot succeeded, with the VM showing no ill effects.&lt;br /&gt;
&lt;p /&gt;
So - can anyone suggest why this may have occurred? While we suffered no data loss I did lose 4 hours of my time - on a Sunday night no less - and would prefer this didn't happy again!! &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/happy.gif" alt=":-)" /&gt;&lt;br /&gt;
&lt;p /&gt;
Cheers,&lt;br /&gt;
Matt Kilham&lt;br /&gt;
&lt;br /&gt;
Message was edited by: mattjk</description>
      <pubDate>Sun, 30 Nov 2008 12:16:14 GMT</pubDate>
      <author>mattjk</author>
      <guid>http://communities.vmware.com/message/1111483?tstart=0#1111483</guid>
      <dc:date>2008-11-30T12:16:14Z</dc:date>
      <clearspace:dateToText>12 months, 1 day ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
  </channel>
</rss>

