Skip navigation
VMware

This Question is Possibly Answered

1 "correct" answer available (10 pts) 2 "helpful" answers available (6 pts)
1,998 Views 12 Replies Last post: Jan 12, 2010 8:21 AM by dbrashear RSS
dbrashear Novice 16 posts since
Apr 13, 2007
Currently Being Moderated

Jan 3, 2010 7:49 PM

Snowleopard server guest panics randomly since 10.6.2 update?

 

Running Fusion 3.0.1 on a 32 bit booted Macbook Pro 10.6.2, my 10.6 guest now randomly panics in various processes (servermgrd, for example). This didn't happen with 10.6.1.

 

 

Am I alone? It's a throwaway VM... did I get unlucky?

 

 

Guru VMware Employees 11,935 posts since
Oct 15, 2006
Currently Being Moderated
1. Jan 4, 2010 11:53 AM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

panic.log? Are you using multiple vCPUs (if so, try dropping down to 1 - SMP OS X guests are very finicky).

Guru VMware Employees 11,935 posts since
Oct 15, 2006
Currently Being Moderated
3. Jan 4, 2010 3:40 PM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

no panic.log being generated, for reasons I can't ascertain. (you mean PanicReporter log, yes? panic.log died in 10.4?)

 

Yes, whatever it's being called these days (I thought panic.log was through 10.5, and PanicReporter + date sounds right for 10.6). The file with the backtrace thingamajig

 

I enabled two-kernel debugging and I'll get a backtrace from gdb.

 

That's probably good too.

HPReg Expert VMware Employees 470 posts since
Dec 22, 2004
Currently Being Moderated
6. Jan 12, 2010 1:51 AM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

We haven't tested 10.6.2 in a Fusion 3 VM much, as 10.6.2 hadn't been released at the time we released Fusion 3.0.1.

 

Thanks for providing the panic logs, that is very helpful.

 

There is only one piece of code which uses the string "Spinlock acquisition timed out:" in the kernel source code (xnu), and it is present in both the 10.6.1 and the 10.6.2 kernel, so the fact that you are starting to see this seems unrelated to your 10.6.1 -> 10.6.2 upgrade.

 

To workaround this issue, I recommend you add the following option to your kernel boot arguments:

slto_us=0xffffffff

which will effectively bump the spinlock acquisition timeout to about 4300 seconds.

 

Then run for a while and let us know what you observe:

 

1) If the VM seems to run fine, then the issue is related to virtualization, but it is impossible to fix by design. We need to work with Apple so they bump their timeout when they detect that they are virtualized.

 

2) If the VM sometimes now hangs (for 4300 seconds!) instead of panicing, then this is a bug in the Apple kernel and you should report it to them.

 

Now that being said, since you said your VM only has one virtual CPU, then I don't think it can possibly be #1. I think it has to be #2, i.e. I think there is a buggy code path in the Apple kernel which attempts to acquire a spinlock (apparently on the interrupt path) which is already acquired (probably by a non-interrupt path). So I think you should let Apple's Panic Reporter report the issue to Apple engineers, so they can work on it.

HPReg Expert VMware Employees 470 posts since
Dec 22, 2004
Currently Being Moderated
8. Jan 12, 2010 7:23 AM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

PanicReporter doesn't offer

 

It does not pop-up when you restart after the panic, or it pops-up but does not offer to send the data to Apple?

 

I'm just surprised it's not manifesting anywhere else, including the host.

 

Thread scheduling in the guest is very different in a VM than it is on the host. That is because unlike physical CPUs, virtual CPUs can be descheduled on the host.

 

As a result, VMs often exacerbate race conditions in kernels which are real bugs, but which hit much less often on physical machines.

HPReg Expert VMware Employees 470 posts since
Dec 22, 2004
Currently Being Moderated
10. Jan 12, 2010 8:14 AM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

I'm familiar with that panic. Same root cause, different panic message, same way to workaround it.

 

But I'm really confused now. We have only ever seen this panic in SMP VMs (VMs with multiple virtual CPUs). Are you 100% certain that your VM only has one virtual CPU? Can you send the .vmx file?

HPReg Expert VMware Employees 470 posts since
Dec 22, 2004
Currently Being Moderated
12. Jan 12, 2010 9:03 AM in response to: dbrashear
Re: Snowleopard server guest panics randomly since 10.6.2 update?

No problem. It happens.

 

If you have multiple virtual CPUs, the workaround of passing this as a kernel boot argument:

   slto_us=0xffffffff

will solve your problem then. It only works on 10.6 guests though (10.5 kernels do not recognize this option).

Bookmarked By (0)

Share This Page

Communities