VMware Communities > Blogs > VMware Communities Blog > 2007

Blog Posts

VMware Communities Blog : November 2007

Previous Next
0

November 30 Update

Posted by RDellimmagine VMware Moderator Nov 30, 2007

Forum stability and performance have continued again this week at acceptable levels most of the time.

We are still planning to implement clustering on the application, which will allow us to increase the amount of processing power to the application, increase the size of the caches, and make single-node restarts in the cluster transparent to users (since the other node will handle the traffic). All of this is expected to make noticeable improvements for visitors to VMware Communities. We will roll out the clustered implementation next week, i.e. the first week of December.

At the same time, we will also roll out the 2-column layout for VMware Communities that is described in VMware Communities Proposed Homepage Design along with community feedback in Homepage Design Proposal. The 2-column layout is meant to offer better navigation, better readability, and an appropriate emphasis on the community content.

0 Comments Permalink
0

November 26 Update

Posted by Badsah VMware Moderator Nov 26, 2007

Nothing new to report. We're still keeping an eye on performance, which appears to be good. If, however, you are still experiencing problems, please report it to us with specifics of which page(s) are consistently slow, and if there is any pattern (e.g. hours of the day).

Thanks.

Badsah

0 Comments Permalink
5

November 21 Update

Posted by Badsah VMware Moderator Nov 21, 2007


Last update before we Americans go stuff ourselves over the Thanksgiving holiday weekend: performance and stability appear to be fine for the fifth straight day. Given this, and a combination of VMware's scheduled maintenance windows plus holiday vacation schedules for critical team members, we have decided to postpone the deployment of a second node to our cluster until first week of December.

Thank you, and will be back online on Monday, November 26!

Badsah Mukherji, Sr. Community Manager, VMware

5 Comments Permalink
0

NOTICE: Due to the U.S. Thanksgiving holiday, the VMware's U.S. offices will be closed Thursday and Friday, November 22 and 23, 2007.

Well, we are cautiously optimistic that we've reached a period of stability on the site. We have enjoyed four plus days of continuous uptime, and the performance has been consistently reasonable. We are still targeting end of next week (last week of November) for adding the second server to the cluster, to enhance the performance.

If you still encounter sporadic error messages, reload/refresh the page and you will usually see the error disappear. If the problem persists, please bring it to our attention. Details such as timing, page information, and specifics of what you were doing at the time are helpful, as always.

Thank you.

Badsah Mukherji, Sr. Community Manager, VMware

0 Comments Permalink
0

November 15 Update

Posted by RDellimmagine VMware Moderator Nov 15, 2007

After running 27 hours with reasonable performance, we saw a slowdown again last night. We rebooted the application and still saw some issues. After a bit more investigation, we realized that the issues were related to the kernel timer, so we moved the application to different hardware, and voila, no more timing issues. As I write, the system has been up with reasonable performance for another 14 hours.

The timing issues would explain the viewed threads problem and other similar cache-related problems people have reported.

Given that performance is significantly improved, we are taking a more measured approach to implementing the cluster, so that we can be sure the configuration is bullet-proof before it is deployed. We are currently targeting the end of November for the clustered configuration. Also, we are continuing to work on feature fixes, and after we have seen a few more days of reasonable performance without kernel timing issues, then we will start focusing on getting those fixes rolled out to the site.

0 Comments Permalink
0

November 14 Update

Posted by RDellimmagine VMware Moderator Nov 14, 2007

Last night we did an upgrade on the kernel to update the system clock to a newer version. This change is required before we add a second node to the cluster. The result of the update is that we have seen system performance improve significantly in the 18 hours since it was implemented. Users have reported performance improvements as well. If you have observations to share, please do: Performance?.

In addition to (and after) the system clock update, we also added in a second node to the cluster, but rolled it back because of some configuration issues. We are working actively to resolve those configuration issues so that we can get the second node running in the cluster.

Some users have reported errors in read tracking and private message notifications, and we believe this to be related to some work on the second node. We are monitoring it closely, and believe these errors to be in the presentation of the data, not in the data itself.

0 Comments Permalink
0

November 12 Update

Posted by RDellimmagine VMware Moderator Nov 12, 2007

We configured a load balancer this afternoon during the low-traffic window in preparation for implementing a clustered configuration of the community application. Behind the load balancer is still a single node, so visitors won't see any changes in performance or stability, but the next step is to implement the second node in the cluster, which will bring improvements. Meanwhile, there is a known issue that causes an occasional error when viewing threads. The error resolves itself when the page is refreshed, and it will be fixed when the cluster is complete.

0 Comments Permalink
0

November 8 Update

Posted by RDellimmagine VMware Moderator Nov 8, 2007

Performance and stability continue to be much improved.

Meanwhile, we are investigating 3 areas:

  • 1. some compatibility errors we're seeing related to the JDK version and Jive's caching engine;
  • 2. moving to a more accurate system clock to avoid some bottlenecks waiting on a new time;
  • 3. we are accelerating our move to a clustered configuration.

I'll post again as we define the precise action plan around these.

0 Comments Permalink
1

November 7 Update

Posted by RDellimmagine VMware Moderator Nov 7, 2007

Yesterday morning, we found and fixed an error in the way the site handled web crawlers, which was causing threads to block. We also did additional tuning of cache sizes. These changes resulted in a significant increase in stability and performance: the site remained up for 22 hours with much better performance than we had seen.

We continue to monitor the site and are investigating the cause of this morning's restarts, but it appears that we have turned a corner, given site visitors are also reporting improved performance. We continue to investigate performance, as we believe page load times can be further improved. The main area of work, as I wrote earlier, is to cluster the application server, which will allow us to increase overall cache size and take advantage of load balancing. This will result in faster performance and increased flight time of the application.

1 Comments Permalink
0

November 5 Update

Posted by RDellimmagine VMware Moderator Nov 5, 2007

Today we increased the heap size of the application server and then increased cache sizes in the application to increase their effectiveness. We also rolled out a patch to the AuthFactory that fixes a locking problem in the status level calculator and removes the root cause of one class of system slowdowns.

The result of these fixes, along with the switch to JDK 1.5 last Friday, is that we are seeing improved stability and overall lower cpu usage. However, we are still seeing periodic performance slowdowns that we continue to monitor closely.

We are actively working to implement a clustered solution for the application server, which will allow for larger caches and therefore fewer hits against the backend database. Meanwhile, we have decided to hold off for a few days on the feature enhancements we discussed last week, so that we can focus fully on the app server cluster.

0 Comments Permalink
0

November 3 Update

Posted by RDellimmagine VMware Moderator Nov 3, 2007

On November 2, we rolled back the November 1 update because it introduced errors that we are still investigating.

However, we kept or re-implemented the following configuration changes:

  • Changed JDK version: We changed from JDK1.6 to JDK1.5 for better compatibility with the caching engine used by the community platform.

  • Increased Heap & Cache Sizes: We increased the Tomcat heap size and the community platform's message cache size for better message throughput.

  • Database Connections: We increased the number of database connections by 50% because peak traffic occasionally maxed the number of database connections.

  • APIC Setting: We disabled a power management setting that was causing timer-related problems.

Next week, we will work to re-implement the usability features we rolled back, including the AuthFactory fix that will remove one class of issues resulting in system slowdowns.

0 Comments Permalink
0

November 1 Fixes

Posted by RDellimmagine VMware Moderator Nov 1, 2007

Stability


Tonight we rolled out the following stability / performance changes:

  • Database Connections: We increased the number of database connections by 50% because peak traffic occasionally maxed the number of database connections.

  • APIC Setting: We disabled a power management setting that was causing timer-related problems.

  • AuthFactory fix: We implemented a fix that allows us to avoid locking in the status level calculator. This removes the root cause of one relatively infrequent class of system slowdowns.

While these fixes help, we are actively working on some additional stability / performance fixes that we will roll out next week.

Features


We rolled out the feature fixes described in yesterday's blog post:

  • Filter searches by user: There's an additional field in Advanced Search where you can specify an individual user so that search returns matching results from that user only.

  • Search returns individual messages, not threads: Search returns a result for each thread (instead of for each message) matching the search term.

  • Email notification of private messages: There is a new preferences that allows you to specify whether you get sent an email when you receive a private message. This works if your email is displayed or hidden.

  • Email monitor improved: The email monitor now catches (and discards) a wider range of out-of-office replies.

  • Edit Email in Profile: You can now edit your email address in your profile.

And also a couple of additional fixes:

  • Status Level Legend: A status level legend showing the points for each status level now displays on the people page.

  • Documents-only Communities: They now show the documents tab, e.g. Community Sample Code -- before they were erroneously showing a discussions tab even though discussions were disabled.

0 Comments Permalink

VMware Communities Blog

Status updates and the behind-the-scenes story of VMware Communities