Linux Timer Rate

Linux Timer Rate

Introduction

A hardware timer is used by modern systems for a variety of fine-grained  operations at the operating system level. VMware's virtualization  platforms virtualize this timer in the kernel. Because the virtual timer  provided to the VM is actually software, it is subject to the same  resource restrictions as other processes. The busier the system the more  the timer execution must contend with other hypervisor activities.  There are two implications of this:

  1. When the system is very busy, the software timer may not execute as regularly and virtual time may fall behind.
  2. Depending on how frequently the OS wishes to be interrupted by the timer, the hypervisor must do different amount of work.

From

VMware KB article #1420.

Linux guest operating systems keep time by counting timer  interrupts. Unpatched 2.4 and earlier kernels program the virtual system  timer to request clock interrupts at 100Hz (100 interrupts per second).  2.6 kernels, on the other hand, request interrupts at 1000Hz - ten  times as often. Some 2.4 kernels modified by distribution vendors to  contain 2.6 features also request 1000Hz interrupts, or in some cases,  interrupts at other rates, such as 512Hz.
Furthermore,  an SMP-capable Linux kernel requests additional timer interrupts from  the virtual local APIC timer. An SMP-capable kernel running on a one-CPU  system generates twice as many total timer interrupts as the  corresponding UP kernel, while such a kernel running on a two-CPU system  requests three times as many. In general, an SMP-capable kernel running  on <n> CPUs requests <n+1> times as many interrupts per  second as a UP kernel. For example, an unmodified 2.6 Linux kernel  running on a two-CPU virtual machine requests a total of 3000 clock  interrupts per second.
When a guest asks for more than  1000 clock interrupts per second, it can be difficult for the virtual  machine to keep up, especially if other applications are running on the  host at the same time. This can cause the clock in the guest operating  system to fall so far behind real time that it is unable to catch up.  The overhead of delivering so many virtual clock interrupts can also  hurt guest performance and increase host CPU consumption.

The amount of work required to manage the virtual timer is greatest with Red Hat Enterprise Linux

5 (RHEL5) SMP systems, which use a clock frequency of 1000 Hz and  suffer from a multiplicative amount of work due to SMP support. For  instance, the following table of timer interrupts was created on a 1000  Hz RHEL VM:

vCPU CountInterrupts/sec
26,000
420,000
872,000


So, the amount of work that needs to be done by the hypervisor increases  dramatically with the addition of vCPUs.  In addition, decreasing the  timer interrupt rate greatly decreases the work that needs to be done by  the VMkernel to virtualize the timer.  In RHEL 5.1, a Linux kernel that  enables reducing the timer rate was included. By adding the parameter  "divider=10" to the boot parameters, the amount of work required of the  VMkernel to virtualize the timer goes down by an order of magnitude.

Comments

Is this still a problem on ESX 3.5 UP1? The KB Article you listed only lists up to ESX 2.5 but I know we saw a problem like this in 3.0.2.

Well, I wouldn't say that it's ever a "problem". Operating systems need to be interrupted by their timer hardware. Unless we break the abstraction layer and let the guests access the underlying hardware, those interrupts will have to happen in software. It's an unavoidable fact of any virtualization platform.

Of course, when OSes are designed with specific features for virtualization, they can help the hypervisor to reduce the work. Most paravirt guests do precisely this. But again, that comes from the guest and not the hypervisor.

Despite the age of those articles, interrupt rates and timer virtualization for full virtualization have not changed since ESX 2.5.

Thank you for the information!

Scott -

KB #1420 gives me a message that I am not authorized to view it. Has the number been changed perhaps?

Dave Convery

VMware vExpert 2009

http://www.dailyhypervisor.com

http://twitter.com/dconvery

Careful. We don't want to learn from this.

Bill Watterson, "Calvin and Hobbes"

It looks like that KB has been deprecated and replaced by 1008284, 1005861, 1006072, 1006113, 1005953, 1006090, 1006086, 1006088, and 1005802. But I need to wade through these to find what I need to update this article.

Scott

More information on my communities blog and on Twitter:

http://communities.vmware.com/blogs/drummonds

http://twitter.com/drummonds

Thanks!

Dave Convery

VMware vExpert 2009

http://www.dailyhypervisor.com

http://twitter.com/dconvery

Careful. We don't want to learn from this.

Bill Watterson, "Calvin and Hobbes"

Version history
Revision #:
1 of 1
Last update:
‎03-14-2008 09:30 AM
Updated by: