VMware Cloud Community
Qwerty256
Contributor
Contributor
Jump to solution

Performence issue, and high CPU Ready

Hello,

On one (the most used) VM i have performence issue and very big load average, but there is nothing worng about the cpu from ESX side, just one thing i spot is READY% is pretty big, and the problems usually are when READY is close to USAGE, but the machine has a lot left CPU, so could anybody tell me what can be wrong with my configuration ?

esx cpu.png

0 Kudos
1 Solution

Accepted Solutions
rickardnobel
Champion
Champion
Jump to solution

Qwerty256 wrote:

did the same, result in screenshot, pretty better isnt it ?

So it was indeed the NUMA that caused it. Nice that you got your RDY back to normal!

My VMware blog: www.rickardnobel.se

View solution in original post

0 Kudos
15 Replies
Sreec
VMware Employee
VMware Employee
Jump to solution

Hi,

    Overprovisioning of vCPUs will cause high CPU ready time.Please follow the steps in KB:http://kb.vmware.com/kb/1005362and do let me know the result.

Cheers,
Sree | VCIX-5X| VCAP-5X| VExpert 7x|Cisco Certified Specialist
Please KUDO helpful posts and mark the thread as solved if answered
0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

%CSTP
  0.00
  1.37
  1.33
  0.39
  0.09
  0.40
  1.38
  0.38
  0.00
  0.00

No issue Smiley Happy

0 Kudos
Sreec
VMware Employee
VMware Employee
Jump to solution

Hi,

    When you captured this O/P did you had performance issue?You have to check the esxtop exactly when the issue is happening.

Cheers,
Sree | VCIX-5X| VCAP-5X| VExpert 7x|Cisco Certified Specialist
Please KUDO helpful posts and mark the thread as solved if answered
0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

Yes i didnt have big issue then, the performence is slower anyway, but anyway will check when the issu with server occures, thanks

0 Kudos
rickardnobel
Champion
Champion
Jump to solution

Could you also check %MLMTD while having high %RDY?

My VMware blog: www.rickardnobel.se
0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

%MLMTD  is 0 all the time, i will do printscreen when will have issue with performence

0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

extop when issue

0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

Also this is problem and i don't know why,

I have plenty of free resources, there is no machine which have reservation, and i cannot give 20 000 Mhz to any VM,

10 000mhz is possible. IMHO should be possible at least 150 000 Mhz ... Am i doing anything wrong ?

0 Kudos
rickardnobel
Champion
Champion
Jump to solution

Could you post a screenshot of the m (memory) screen? Use "V" to clear all other instances than the VMs.

Could you also post another screenshot of the memory, but with this customized:

f - to select fields,

remove: B, K, L and O.

add: G = NUMA STATS

Make sure the N%L is visible and then post the screenshot.

My VMware blog: www.rickardnobel.se
0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

In attachment

0 Kudos
Sreec
VMware Employee
VMware Employee
Jump to solution

Hi ,

      Looking at the attached esxtop screen i can see rdy% value has exceeded the threshold(Threshold value is 10).Please follow steps in KB::http://kb.vmware.com/kb/1005362

Cheers,
Sree | VCIX-5X| VCAP-5X| VExpert 7x|Cisco Certified Specialist
Please KUDO helpful posts and mark the thread as solved if answered
0 Kudos
rickardnobel
Champion
Champion
Jump to solution

From what I see most of your VMs are 8 vCPU and 32 GB vRAM, which is of course quite large. However, your NUMA nodes are 10 cores and 64 GB so they should fit good, and the only VM with somewhat less good NUMA locality is the CentOS 5.5-LB, but it should not really explain why all VMs experience high RDY time.

My VMware blog: www.rickardnobel.se
0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

http://kb.vmware.com/kb/1005362
On the CPU screen, check the %CSTP  value. If this number is higher than 100, the performance issues may be  caused by the vCPU count. Try lowering the vCPU count of the virtual  machine by 1.

it's not higher than 100

0 Kudos
Qwerty256
Contributor
Contributor
Jump to solution

I found in that thread:

http://communities.vmware.com/thread/391284

I had the same problem. A temporary fix is to set "Numa.RebalanceEnable" to 0

esxi-host configuration, Software, Advanced Settings, Numa, Numa.RebalanceEnable = 0

Supermicro Opteron 61xx and 62xx hosts.

When I did this the cpu load balanced evenly between the cores and the ready times dropped to 0.XX as was expected.

did the same, result in screenshot, pretty better isnt it ?
0 Kudos
rickardnobel
Champion
Champion
Jump to solution

Qwerty256 wrote:

did the same, result in screenshot, pretty better isnt it ?

So it was indeed the NUMA that caused it. Nice that you got your RDY back to normal!

My VMware blog: www.rickardnobel.se
0 Kudos