Skip navigation
VMware

Virtual thoughts

11 Posts
0

Better than the real thing

Posted by larstr Feb 20, 2010

Virtually anything is impossible

A lot of people seem to have a subconscious opinion regarding the word “virtual”. It is used in many different contexts and it’s often a word with a bit of negative load. Probably because the “real thing” is so much better. We have virtual memory, virtual reality, virtual private networks, virtual tour, virtual particle, virtual world, virtual sex, virtual machine, virtual artifact, virtual circuit, etc… This listing could be much longer, but you get the idea.. Most of these virtual entities are not considered equally good as their physical counterparts.

 

People who are not very familiar with the current state of virtualization and cpu technologies may have heard that virtual machines are not equally good as physical ones and that they may only be used for “low hanging fruit”  or maybe just testing and development. As a consultant I meet people from time to another who have systems that are so special that they will not consider running them inside virtual machines. They are concerned about the performance. Some are concerned about security. Or  availability. Latency. Stability. Yes, there’s an endless list of things that people worry about. An ISV told me that their application suite wouldn’t work at all in a virtualized environment.

 

Lets have a closer look at the current state of these issues with the latest technology:

 

Performance & Latency

Project VRC revealed in their newly released report that by virtualizing a 32 bit XenApp farm on vSphere 4.0 with a vMMU (RVI/EPT) capable cpu you would get almost twice as many users on the system than they did last year (with the software and hardware available back then).

 

VMware has also shown that with an extra powerful storage system (3xCX4-960 with solid state drives) you could achieve over 350 000 random io/s from a single ESX host with only three virtual machines running. Latency was below 2 ms.  Even though such a storage setup is highly unusual amongst today’s data centers it shows that the hypervisor is not a limiting factor.

 

These things still doesn't mean that you should avoid tuning your application load if needed.

 

Security

It’s not a secret that a large part of the technology that VMware is utilizing is derived from research done at Stanford University by VMware fonder Mendel Rosenblum and his associates. A few years ago there was a research paper released titled "A Virtual Machine Introspection Based Architecture for Intrusion Detection”. This technology is present today as an API for third party vendors to integrate into. It allows for monitoring of cpu, memory, disk and network. Yes, you can monitor all of the core four with this API that is known as VMsafe.

 

Several of the established security vendors such as (EMC) RSA, CheckPoint,(IBM) ISS, Trend Micro and others have partnered up with VMware and are delivering security solutions that support parts or all of this stack. This means that you can have all your VMs firewalled even if they live on the same subnet in the same VLAN. It also means that you can detect if any malware is infecting a VM even if it has no antimalware agent installed. To have VMs on the same subnet protected by a switching firewall in the physical world is also possible if you’re using Crossbeam or similar, but these devices are not known to be cheap.

 

A quick diagram from one of these products quickly shows how such a setup can help secure an environment:

 

These kind of security protections are better than the real thing. In the physical world you have no way of looking into the memory or cpu of a computer without installing an agent inside the operating system.

 

Even without a third party product you can get better protection than physical. If you have a DMZ network with several hosts, these can normally access each other through the network (not protected from each other) so the solution is often to have many VLANs separating these services making the setup more complex to manage. With VMware’s distributed switches you can establish private VLANs in  “Isolated mode”. This means that the VMs on the same Isolated PVLAN will only be able to communicate with hosts on the non-local network. The neighboring DMZ server in the same subnet will be invisible and inaccessible. These things make the virtualized networking better than the real thing.

 

Stability & Availability

VMware’s hypervisor has been here for almost 10 years now. ESX is running a relatively small kernel (vmkernel) that is known to be “dead stable”. The only times I’ve seen it have problems is if there’s a bad hw component.

 

In addition to this, VMware will load balance your workload across all of your hosts with DRS. With FT VMware will run a single load on two separate hosts in case of hw fault. With VMware HA it will start the VMs that were running on the dead host at other hosts in your cluster. With VMware DRS it will take care of your whole infrastructure in case of a total datacenter crash. All this without configuring anything special in any of the VMs.  Traditional clustering typically has a quite large administrative overhead, is complex to setup and (almost) impossible to test. With SRM you can even test your DR plan while the rest of your environment is up and running.

 

Conclusion

All of this makes a virtualized environment much better than a physical environment. Does this mean you should virtualize 100% of your workload? No, there are still a few exceptions, but 97% of the systems can typically be virtualized without any issues. Existing customers are not migrating their critical systems to VMware despite of virtualization.  They are running their critical systems on VMware because of the extra benefits virtualization is giving their infrastructure regarding HA, DR, security and management.

343 Views 0 Comments Permalink Tags: performance, esx, drs, ha, srm, checkpoint, vsphere, trend_micro, ibm_iss, emc_rsa, xerox, low_hanging_fruit
0

vSphere troubleshooting class

Posted by larstr Feb 20, 2010

This week I attended the “VMware vSphere: Troubleshooting” class. It’s a four day class that gives you the right to certify for VCP similar to a few other classes. This is a fairly new class and this was the first class in the Nordic region. Our instructor was not dutch, even though his last name was Blom. He was from Sweden and was fairly well informed about the topics of this class and VMware in general.

 

The troubleshooting class sounded like an interesting class. I have never attended a VMware class before, but since I have been using VMware products for quite a while this class sounded like the most reasonable one to take.

This class had the following modules:

-     ESXi Command Line Troubleshooting

-     ESX/ESXi & vCenter Server log files

-     Network troubleshooting

-     Management troubleshooting

-     Storage troubleshooting

-     VMotion troubleshooting

-     VMware Infrastructure troubleshooting

-       vSphere 4 DRS Cluster Troubleshooting

Note that this class is not about performance troubleshooting. It’s only about troubleshooting configuration problems. A performance class is coming up soon from VMware educational services.

 

This class was around 70% labs and 30% lecture time. Most of the labs were performed as two-student teams. Most  labs involved configuring something, either from the command line or from the vSphere client, and later a configuration issue was being introduced to our environment (by the instructor) that we should try to resolve.

 

This class was a good look into managing your infrastructure from vMA and we learned how it differed from the ESX console and the unsupported ESXi console. VMware’s educational labs were used and compared to the labs I attended at VMworld 2009, these actually worked.

 

A few of the labs were a bit too repetitive and the scripts run by the instructor to bring problems into our configuration didn’t always work as intended. Most of the problems introduced did also IMHO not reflect problems you see in most real life environments. Since you were not allowed to watch the “Tasks and events” when troubleshooting a problem it made the troubleshooting a bit strange for some issues (you couldn’t monitor the progress of your VMotion if VMotion was the issue). It was still a very valid reason we couldn’t watch the logs when problems were introduced as it would have made it too easy to solve the issues, but it still made the troubleshooting situation a bit weird compared to real life.

 

The class was nice because of all the labs and for me it was especially useful because it made me get more familiar with vCLI which is required for supported command line management of ESXi. As we all know : ESXi is the future and the future is now.

518 Views 0 Comments Permalink Tags: esxi, vmkernel, esxcfg, vcli, vma, dvs, wireshark, vicfg, unsupported_console
0

To ESXi and beyond

Posted by larstr Jan 22, 2010

Like I also blogged about in a previous blog posting, the Service Console of ESX is being discontinued and that leaves us with ESXi as the only option for the future.  vMA will take the Service Console’s role for command line stuff and you only need one of these for managing many servers.

 

First I’d like to give a shortened history lesson so we can better understand where we are today and where we’re heading. I didn’t include every new feature here, but only those I regard as the most important ones regarding the evolution from tightly bound systems to stateless ones.

 

VMware have provided technologies that for each iteration has had a greater level of statelessness than the previous version.  Version 1.x of ESX could run one VM per physical cpu. When ESX 2.0 was released in 2003 you could no longer count the cpus to get the number of VMs anymore as this limit was removed. Instead of binding each VM to a cpu it would load balance all of the running VMs across all available cpus, and it could now also run multiple VMs per cpu. After the introduction of Virtual Center in 2004, data center admins no longer needed to know which physical server a VM is running on. Virtual Center gave you the ability to administer all your ESX hosts from a single interface and with VMotion you could move your VMs from physical host A to B without any service interruption.

 

ESX 3 was released in 2006 and brought in new important features such as a Distributed Resource Scheduler (DRS) and Distributed Availability Services (VMware HA). These new features would ensure that the performance of your environment was optimal at any time and that all of your VMs was running, even if a host died or if a VM or two would start eating up a lot of resources.

 

ESX 3.5 was released late in 2007 and came with the ability to migrate VMs between different cpu models (EVC), the ability to change datastore for your VMs while running and a software update utility for hosts and guest VMs. ESXi (or 3i as it was called back then) was also released at the same time and could do everything that a full blown ESX host could do, except for the missing Service Console.

 

In 2008 we had the release of Site Recovery Manager (SRM). Having a recovery site (a datacenter at a remote location that will host all your services in case of a disaster) is something that has existed in the physical world for quite some time, but there has not been a unified solution that would work equally for all kinds of services. Virtualization is a game changer here as it uncouples the servers from their physical hardware and with SRM you could also bring up a copy of your main datacenter at the disaster site within minutes by the click of a single button.

 

With vSphere 4.0 they also introduced host profiles and distributed switches for making it easier to deploy the same configuration across multiple ESX hosts. SRM was also updated to support two active data centers.

 

VMware demoed booting of ESXi from the network back at VMworld 2009, and it is surely something that will be coming as an official feature from VMware in the future. By doing this you get the same advantages that you get with your VMs today as you’re unbinding the systems from their hw. It also lowers your deployment time (of a new host) as you don’t have to install anything. We also have a similar technology from VMware already with VMware View where all users boot from the same disk image (linked clones). The users may also already be using stateless thin clients to access these desktops.

 

A couple of years ago I showed some end users from the company I worked for around in the data center and they were wondering what server “their” system was running on. My answer was that I didn’t know. All I knew was that their system was running on one of the ESX servers, but it could change server without anyone knowing it and without anyone noticing it. I also told them that there were around 80 virtual servers running on these 4 physical boxes and that it would have taken two full racks of pizza box servers if we had not been running it like that.

 

As we all know ESX is an acronym for Elastic Sky X. I don’t know if they were having cloud computing in mind when they invented that name, but it surely seems to be a name that fits the future. Now that traditional ESX is going away we’re left with ESXi which gives us Elastic Sky XI. XI in roman numerals equals 11, something I take as a hint to that 2011 is the year ESXi can help use move VMs into the sky.

 

So I guess when I show around end users again in the data center in a year or two, my answer would be “your system may be running on these boxes here, or on one of the similar ones  in our secondary data center, but if it is running some heavy transactions it may also have been temporarily migrated to a more powerful system out somewhere on the internet”.

310 Views 0 Comments Permalink Tags: virtual, esx, center, esxi, vcenter, vsphere, cloud
0

I’ve had feedback from each of the vendors that I’ve blogged about so it's probably time to update the status on these topics

 

NetApp: NFS vs FC

http://communities.vmware.com/blogs/larstr/2009/02/06/nfs-vs-fc

 

The first vendor I  blogged about was NetApp and some material that was part of their ASAP training. I had done some testing using NetApp storage back at VMworld 2006 that gave results that didn’t match those slides from the training. When these slides came up during training I told the trainer that I thought these slides could not be valid and told him about the testing I had done myself a few years back. He couldn’t comment any further than that these slides where part of the official NetApp course.

 

A few months later I received an email from Mike Shea (@NetApp) who had read my blog and informed me that “the slides from NetApp is factual, but it is old and very out of date.”.

 

VMware also released a paper during the summer confirming that FC is still king of the hill on storage performance.

http://www.vmware.com/files/pdf/perf_vsphere_storage_protocols.pdf

 

Even though FC gives best performance I still prefer to use NFS for my customers with NetApp storage due to the extra benefits you get from using it.

 

I don’t know if the ASAP course still is teaching about superior NFS performance. Maybe someone who has taken it more recently would care to share their knowledge?

 

CheckPoint: Riding the virtualization wave

http://communities.vmware.com/blogs/larstr/2009/06/17/riding-the-virtualization-wave

 

The second blog posting was about CheckPoint’s virtual firewall solution for VMware that had insane pricing and licensing. A few days after writing this post I also discussed this issue with a local CheckPoint SE who wasn’t aware of this issue, but he agreed with me that this didn’t look right. He checked my findings and he could also only find the same information about this as I did. All he could offer me was special pricing if I showed him a customer case, but I told him the point here was not special pricing for a single case.

 

1.5 half monts later there was a CheckPoint sales presentation at my office. They where talking about upcoming products and here also the virtual firewall appliance (“Virtual Edition”) was mentioned. The upcoming version was supposed to be much better than the previous one as it was using VMsafe and it would be a specialized solution for VMware environments. I raised my hand and asked if they were planning to fix their broken licensing model too.They were not aware of the licensing model, but since this was a non-released product licensing wasn’t necessarily 100% nailed yet

 

I sent them an email with a link to my blog posting and it reached the people within CheckPoint responsible for this product and they wanted a conference call with us. In the end they came visiting the local CheckPoint office for a meeting about this.

 

At VMworld I also visited the CheckPoint booth for a demo of the upcoming version and it surely looks nice.

 

We still haven’t seen the final results of this, but I’d be very surprised if their licensing model for VE remains unchanged.

 

Trend Micro: My view on the new TrendMicro Smart Scan Server Virtual Appliance

http://communities.vmware.com/blogs/larstr/2009/06/26/my-view-on-the-new-trendmicro-smart-scan-server-virtual-appliance

 

The third vendor I blogged about was TrendMicro who had just released a series of virtual appliances where I did a peek into one of them during a customer install. TrendMicro was the ones I had the quickest feedback from.  Within an hour  I got feedback on twitter from a Trend Micro employee stating: ”Thank you sir, I have already passed your suggestions on to the dev team.”

 

http://twitter.com/rik_ferguson/status/2344954320

 

During VMworld in San Francisco I also talked to a Trend Micro marketing guy (@Solutions Exchange) from whom I also learned that they would now start using .ovf files instead of .iso files for distributing their virtual appliances (great move). If they will also start having VMware Tools preinstalled remains to be seen.

698 Views 0 Comments Permalink Tags: performance, virtual, nfs, netapp, appliance, fc, ovf, checkpoint, trendmicro, asap, vpn1-ve
0

I needed to run a demo for some customers, but our lab environment is going through a relocation process to a new datacenter so I had issues with both VLANs and SAN connectivity that wasn't fully available as needed.

 

Instead of bringing a noisy server to the event I could have installed ESX inside VMware Workstation (or Player).

 

Instead I decided to try to install ESX 4.0U1 on a Lenovo Thinkpad T500 (dual core 2.8 GHz,, 4GB ram). It comes with an Intel based nic and Intel based SATA controller so I thought it should have a fair chance on being able to run ESX.

 

 

I booted the ESX cd and the installation process would only run in text mode, but that's not a big deal. After entering text mode I answered all the questions the same way as if it was a real server and I did not need to load any extra drivers.

 

After finishing the installation steps I successfully started up ESX, configured it, and setup a few VMs. It worked out quite well. The io is quite limited when running your VMs from a single SATA disk, but it can easily run a handful (low load) VMs. As always, the cpu is not the limiting factor: RAM and storage performance are, and I did get way high disk latency while doing installation on multiple VMs simultaneously (40-50ms). Other than that, the VMs were running quite well once installation was finished.

 

The system was running very stable, but I had a few PSODs during shutdown of ESX. I didn't look more deeply into these (during ESX shutdown) PSODs as this system is a 100% unsupported config.

640 Views 0 Comments Permalink Tags: esx, ibm, thinkpad, lenovo, t500, 4.0u1
0

The wonderful Service Console

Posted by larstr Oct 25, 2009

When I was first introduced to the Service Console back in the ESX 2.0 days (in 2003) I was delighted with what VMware had done. The Service Console is a linux (sort of) VM which is what you see when you boot up an ESX server. The environment you see is a linux environment, but the hardware seen from this environment is not the full blown one as it is running as a virtual machine with some special privileges that normal VMs don't have. This means that you can also access and configure the VMware (vmkernel) environment.  The MUI (management user interface) was also nice, but everything you saw of statistics in the MUI was derived from stats that you could find within the proc nodes of the Console OS, which was the name it was known by back then. It made sense that a GUI wasn't always as correct as the “real deal” where the raw numbers were sitting, so if you were in doubt of anything you could always double check it if you knew where to look. If you knew some basic unix scripting you could also in a short amount of time write a script that monitored what you where looking for.

 

 

When ESX 3.0 was introduced things had changed a bit. The Service Console still looked the same at first glance, but when you had a closer look you could tell that things had changed beyond the change of Linux version. These values that you could look at where still there, but they didn't make much sense. It turned out that /proc/vmware wasn't the main source of statistics anymore, as VMware had created their own stats interface called VSI(VMware SysInfo). The proc nodes within 3.0 was either a leftover from 2.0 that wasn't complete for 3.0 or it was a half done conversion interface from VSI. I don't know which one, but surely enough you couldn't rely on the proc nodes for anything anymore even though it was there.

 

The reasons for abandoning the proc nodes may have been valid. The proc nodes is where all linux distros today and many other unix variants (aix, solaris, etc) present their performance numbers through. The resolution of stats within the proc nodes on linux is always 100Hz even though the system timer could be higher. The stats within the kernel could be of a higher resolution, but the numbers presented to applications are always 100ms accurate as per USER_HZ. The real system timer rate is not accessible to user applications because it should not be necessary for them to know the real timer rate (there exists patches that overrides this). The reason for this is compatibility reasons with normal linux tools such as ps, top, uptime, etc. This means that the accuracy of stats were only as +accurate as 100ms while the accuracy in the stats in the VSI interface was 1000ms. Wouldn't normally mean a lot on most stats, but more accurate graphs are of course for the win.

 

 

The main console tool for monitoring ESX performance is esxtop. Esxtop in version 3.x looked quite similar to the one in 2.x, but it had been rewritten from a different project than before. In esx 2.x, esxtop was derived from GNU code, while in 3.x it was now derived from BSD code. VMware had also extended it quite a bit since 2.x with various stats that they retrieved from VSI. The best tools for dumping values from VSI was however esxcfg-info that dumped all available stats in a given category. In 3.5 you could also choose the preferred formatting of the dumped stats if you wanted them in xml or perl table formats.

 

When troubleshooting performance issues on ESX you would normally start with the stats in viclient to get an overview, then look at esxtop, and for further details you could use esxcfg-info and vscsiStats for more detailed information. esxcfg-info and vscsiStats where new tools introduced to the Service Console in ESX 3.0 and 3.5.

 

 

There was never much documentationof the statistics of neither the values in the proc nodes or in esxcfg-info, but it was possible to  interpret what they were by comparing the stats you were looking for to the ones shown in esxtop and in the viclient. There was eventually some documentation provided by vmware on the esxtop stats (by performance guru Scott Drummonds).

 

The announcement of the death of the Service Console @ vmworld 2007 has been received with mixed feelings. In ESX 3 and newer you can do everything you need from the GUI and may never need to know that there's a Service Console available. They also introduced a console less hypervisor (ESXi) we can all see the advantages it brings. As the Service Console is a Redhat based linux install, it needs additional  patching which is in reality unrelated to VMware's core business of virtualization. By having a thin hypervisor of the ESX hosts you get a smaller attack surface and it should in theory also give you slightly better performance as you don't have the additional load of an extra linux VM running (even though it's fairly lightly loaded unless you've filled it with agents).

 

ESXi (or ESX 3i as it was known as upon announcement) was introduced to be the future successor of normal ESX, but without a Service Console. In ESX3 you can do everything that you need to configure and manage your ESX environment from the GUI (viclient), but troubleshooting issues is often easier done from the command line. They couldn't just stop shipping normal ESX with a console right away since many third party vendors have software agents that depend on the Service Console.

 

 

To provide command line access similar to the Service Console, VMware initially introduced the Remote CLI appliance. It was based on Debian 3.1 and wasn't that bad of an idea. It provided you a command line interface where you could run resxtop and similar commands to get statistics from an ESX host and you could also to a fair bit of other management related tasks. It did include a version of esxtop, but both esxcfg-info and vscsiStats were missing.  It also came with all of the developer tools a console oriented linux programmer needed to interface with a VMware environment. VMware also had their own debian repository available for this appliance so you could add the packages you needed similar to a normal debian distro. In the original Service Console, there was no online software repository, but the ESX install CD provided extra packages that you could install.

 

The Remote CLI appliance's software repository was suddenly gone and VMware introduced another appliance that had similar capabilities. The VIMA (Virtual Infrastructure Management Assistant). It was an appliance that included the same remote cli command set, but was still different. It did not include any development tools, it included a method of setting up a trust between the ESX servers and this appliance so that you didn't need to authenticate for each command (vi-fastpass). It also didn't have the ability to add any packages, but was based on RHEL 5.2 so you could take packages from RHEL risking breaking any support from VMware if you should need to call them.

 

In ESX4 this appliance has a new version and also a new name. It's now known as vMA 4.0, vSphere Management Assistant and is basicly the same appliance as VIMA 1.0, but with updated packages and you now also have the ability to setup a trust to the vCenter server in order to run commands on all of your ESX servers instead of setting up trusts to each ESX server in your environment. It still lacks both esxcfg-info and vscsiStats for deep level performance troubleshooting.

 

ESX4 still has the vmware proc nodes present, and whether they are a reliable source of information or not I'm not 100% sure. But I doubt they have done much with it since no software relies on it anymore and the core info lives in the VSI.

 

A new (official) feature that was introduced with ESX4 is thin provisioning. Thin provisioned disks only store the blocks that have data on them and is a good way of saving data while utilizing your storage subsystems better (not always performance wise, but atleast data wise). One should however be aware that when viewing a thin provisioned disk from the local Service Console, it will always show the full size of the disk, even when a fraction of the disk is allocated physically. To get the correct disk usage of this VM you will have to use the viclient (GUI) so things are now beginning to be the other way around from what we were used to in the “old days”. I don't know what other features that comes out wrongly in the console nowadays that are correct in viclient, but wouldn't be surprised if this wasn't the only one.

 

 

The death of the Service Console is indeed coming up. VMware have stated that it will die with the next release of ESX. I'm not sure if that means ESX4.5 or 5.0, but it surely will die soon enough and we already see that the focus is shifting in that direction. I hope that when that day comes, all of the functionality we are using today will be available through supported means on the new platform.

1,744 Views 0 Comments Permalink Tags: rhel, console, redhat, cos, service, esxtop, proc, vscsistats, discontinue, jiffies, nodes, esxcfg-info
0

During the past year we've seen more and more of the old established software vendors starting to support VMware. Either by allowing their software to run on VMware or by offering solutions directly as virtual appliances.

 

I recently attended a presentation by Trend Micro where they told us that they were now offering a virtual OfficeScan appliance for VMware environments and that you would by using this appliance would get very good performance. Much better than you would if you ran it physically on a traditional windows server.

 

I think it's amazing that such products are finally targetting virtual environments. It's something I've been waiting for for many years, and now finally, things are happening!

 

Today I installed this appliance to see what it's good for. It's shipped as an iso file and installation is very quick. I think the initial install took about 10 minutes if not less.

 

 

When you look a bit under the hood of this VM you can tell it's a modified 64 bit RHEL 5.2 VM. Trend Micro is apparently also catching on to the cloud terminology:

bash-3.2# cat /etc/redhat-release

Cloud Scan Service release 3 (Final)

 

RHEL is a great OS and it is the base for many vendors (including the VMware Service Console). One should however be aware that RHEL does not support VMI (paravirt_ops) because it's using a too old kernel (and they haven't backported this functionality like SUSE did).

 

VMware has an old Best Practices document that recommends that a virtual appliance is shipped with easy configuration through graphical or web interfaces. This appliance is following these steps with a graphical installer and a web interface that is available after installation is done. The best practices also states:

 

"It is also important that your virtual appliance includes VMware Tools. VMware Tools provides optimized drivers for VMware virtual hardware and management tools that can monitor and manage the virtual appliance with VMware VirtualCenter."

 

VMware Tools was not installed in this VM per default, but as it was based on RHEL 5.2 it should be fairly easy to install.

 

I did however need to change a few things to get it to mount the cdrom:

1. I added the following line to /etc/fstab

/dev/hda /media/cdrom udf,iso9660 user,noauto 0 0

2. I created the directory /media/cdrom

 

Now I could (After having chosen Install VMware Tools from the menu) mount the virtual cdrom containing the tools with the commandmount /dev/cdrom

 

Installing VMware Tools was now a breeze. I accepted all defaults and they installed successfully.

 

One other thing I found was that this VM wasn't following VMware's best practices on Timekeeping. VMware's kb article recommend you to add the parameters notsc divider=10

 

By adding those parameters to the boot image line in /boot/grub/menu.lst it will not only give you better timekeeping, but it will also give you better performance.

 

By doing these very few and easy steps I suddenly have a VM that is much better performing than it was out of the box. I hope also the software vendors will realize this soon so also the average admin can get optimal performance without having to read this blog posting first.

 

Lars

1,869 Views 0 Comments Permalink
1

In 1996 I attended some basic training and did my first firewall installation. During the following year I did a few more and every time I had a question I called other people in my company or the distributor. In 1998 I discovered that this product had a mailing list. I joined it and it opened a new world for me regarding known problems and solutions. There were a few very knowledgeable people on the list who seemed to know most of the common questions. These users where consultants or end users of the product who out of their own curiosity by the product learned the ins and outs of the product, offering free support for other peer users. There were also members of the list working for the vendor itself, but they would normally post from their personal accounts. Sometimes we would however see official postings or responses from the vendor.

 

After joining this mailing list and being there for a while I understood the product much better and the quality of my work also became much better. I could also more easily discuss different questions with my customers as the questions they had was often related to something I had discussed before on the list.

 

This was my first involvement in a mailing list, but I had used news groups (nntp) for years, and before that I had used similar discussion groups on different BBS networks (over modem/pots). Even though comp.sys.firewalls was nice, it’s user mass wasn’t product specific enough to give you the “little extra” of info that the vendors mailing list provided.

 

 

Having a mailing list where anyone can post their questions is something I regard as a very good thing, but it has a few limitations:

 

 

 

  • Threads are not implemented equally across all email clients

  • Searchability – Searching emails will return single postings, not the full thread

  • History – Hard to refer to a previous thread with a link

  • Formatting – Each email program will display emails differently so text only is often the only common denominator

  • No user profiles or stats per user

  • Spam

  • RSS feed?

 

I was introduced to VMware ESX Server 1.5 in 2002, but the project that I was part of waited for version 2.0 that was shipped in September 2003 before it was implemented. Installation was done by the HW vendor and I was thrilled by this new technology. First of all, the Console OS was based on RedHat 7.2 which was something I knew from before. Secondly, there were active news groups for this product that were fairly active. I think I posted my first posting on the VMware news groups in mid-September 2003.

 

A bit later, VMware announced on the news groups that they had started a web based forum. At first I didn’t really see the point why they had done that since the news groups were working so perfectly. I thought it was something that would just go away so I refused to try it. Why use a web browser when you could use your favorite news reader (tin)? A web based forum was likely to be much slower and you couldn’t use your own preferences of threading/searching etc that you were used to in your news reader.

 

News is quite similar to mailing lists and has the same limitations except that threads are working well across different clients.

 

After a few months I noticed that the traffic on the news groups was decreasing. In March 2004 I decided to check out what the entire buzz was about so I created a bogus account. I didn’t think I would log in to this system twice so why bother using real info? After seeing the quality of the content of these forums I logged out and created a real account that I could use in the future. I didn’t give up on the news groups right away, but I was more and more often using the web forums after that. Why? First of all, there were a few VMware employees on the forum that would have answers to questions nobody else could know. Things that wasn’t documented. Things that would require digging in the source to figure out. Documentation back then wasn’t nearly as good as today and a knowledgebase didn’t exist. Having access to discuss topics directly with those who can figure out such things, just make you want to hang around more to find out what else you have missed lately. It turned out to be highly addictive. And quite a few got addicted.

 

Another thing that didn’t hurt the forum activity was the Forum rewards-program. They announced that points were now to be awarded by the question posters and that VMware would send gifts to those who got enough points based on that. VMware also sponsored trip+hotel+access to VMworld 2005 for the top 5 forum users where they would be a panel for an “Ask the community experts”-session. Having good products is one thing, but when you add things like this you risk that people start loving the whole concept. It attracted many smart users who in turn helped keep the quality of the forums at a very high level. When people start loving a company’s concepts they will help the company build a good reputation. We have during the past couple of years seen the blogosphere around VMware grow quite substantially and it wouldn’t do that if people didn’t feel good about VMware and the products. A large part of the bloggers has also a background as active users on the VMware community forums.

 

If I’m questioned about products/technology I don’t know well I usually check if the product has an active community. If it does, it’s often an indication that the products are good or that the products have great potential. You will also be able to get non-marketing answers to questions about the product line even if you’re not an existing customer. An active community does certainly not replace an official support line, but it can offload it quite a bit for the commonly known issues. An official support line is still needed for resolving the more serious issues.

A community is also normally something that potential customers can join and not only existing customers/partners. I’m still amazed that not all vendors understand the value of such a community and also charge their users extra for being able to join or post to such a community. By limiting their forums like that they are probably loosing potential customers and potential active users. If their product is interesting enough it can attract technical people who are willing to spend their free time on their forums. This is a win-win situation for all parts. Potential customers and existing customers are being helped with their basic questions on the forums and the official support center can concentrate about the more serious issues. Users and consultants are getting to know the most common challenges that people are having and special solutions to things in a way that is often more trustworthy than a marketing brochure.

 

A while ago I was introduced to a product that looked very interesting. It had a very nice design, some smart solutions and they could refer to a handful of major customers. Googling the product did however not give too many results and a user community didn’t exist. I evaluated the product and had a quite close contact with the distributor during this process. If the vendor had community forums I suspect I may have gotten answers to some of my questions quicker and saved myself some time during my testing.

 

While this product wasn’t bad, I still prefer using a competing product even though it’s a more expensive one. Not because the competing product is so much better for the end user, but because it has good community forums, a larger user base, and less complex installation procedures.

 

Personally I find the VMware forums an excellent resource, but I’m still waiting for the next version of the forums to be released as the current version is too slow for my patience and the rich text editor is useless. The current version was introduced in late September 2007 and had major problems shortly after being put into production. The stability problems were resolved, but the performance and formatting issues still isn’t perfect. As I’m not currently as active on the forums as I once was, I’m now blogging using the same system, just to be reminded of how bad or good the situation is. I could have used Wordpress or another known working blog platform, but then I wouldn’t get reminded of the status of the VMware forums.

 

There are still lots of activity on the VMware community forums, so it seems all are not as impatient or formatting oriented as me. I have tried joining back at some occasions, but found myself switching to another tab while waiting for a response from the community. Having the forums available is still something I value highly even if I’m not active there on a daily basis.

 

By subscribing to RSS feeds on Planet v12n (“planet virtualization”) I’ve been able to get a great deal of info without using the forums. The weekly VMTN Roundtable podcast on Talkshoe is also good. It also has chat during the show so it’s much more valuable in a live session than a recording.

 

There is also an irc channel on freenode, #VMware that I’ve been part of for a few years now and that’s also a place where it’s possible to discuss VMware related topics with other peer users.

 

In addition to these arenas I’m now also following the progress in the virtualization field via Twitter. It’s amazing how Twitter has attracted the right users who will point you to blog posts and good community postings on a regular basis while there’s also some direct discussion going on right there.

 

To sum things up, the bottom line is: VMware have some very good products that people like. VMware have supported their users by giving them an arena to discuss their product and share their experiences in an unmoderated fashion. This has lead to users spending a lot of their spare time writing about the products on both the community forums and in other arenas. The users now know more about the products than they would by only reading the documentation. This means that the users are now better at what they do, certified or not certified. Several books have been written by community members. The importance of such a community can never be underrated.

200 Views 1 Comments Permalink
0

Riding the virtualization wave

Posted by larstr Jun 17, 2009

Virtualization is a hot topic and has been that for a while now. While there is only really one big player in this field there are many smaller ones that are making their living out of complementary products. We also see virtualization as a field that is still growing. Only a small percentage of the existing servers in the world run virtualized. Only a tiny fragment of a percentage of the desktops is virtualized. Forecasts by IDC (and others) have predicted that virtualization is Soon™ coming to a system near you.

 

I’m sure vendors in many fields have noticed this trend and want to join in on the ride. The easiest solution would be to adapt an existing software solution into the virtualization field without too much rewriting, while writing software from the scratch requires more effort (and money).

 

I’m sure our good friends at CheckPoint had similar thoughts last year before they launched a “Virtual Edition” ( VE ) of their most known firewalling suites (UTM VE, Power VE & UTM Power VE). CheckPoint have provided a virtual appliance for years that has been available through the VMware Virtual Appliance marketplace, but it hasn’t been supported for production usage.

 

 

I have used their old appliance myself and it has worked fine for for testing out different firewalling concepts. CheckPoint is providing their own linux distro called SecurePlatform (SPLAT) which is based on RHEL (similar to the Service Console), but modified to be a firewall distro and comes with a custom shell (cpshell). It also ships with a custom built kernel. This kernel has until recently been a 2.4-kernel by default. R70 (shipped in march 2009) is the first version to have a 2.6-kernel by default. The old 2.4-kernel was a bit special since they have backported many newer drivers into it. The firewall also consist of kernel modules so you can’t use whatever kernel you want.

 

When this new “Virtual Edition” product line was shipped it was advertised as a product that could protect your virtual machines. It was licensed per ESX host and did not support VMotion. It came in two editions: One for up to 5 virtual machines and one unlimited version. This license is per VMware ESX host (yes, I’ve said that already). When this product was shipped it had broad media coverage and many websites/news sites noted that CheckPoint was now shipping products for VMware.

 

 

Who run 5 VMs or less on an ESX server? Who with such a smallish system would need an enterprise firewalling suite to protect the VMs? If you have a larger environment you can buy an unlimited edition, but as VMotion is not supported it would be useless too. In ESX 3.x you can have a maximum of 4 virtual nics (10 in v4), so if you want to protect each of your VMs with a separate set of rules you can’t put too many VMs on the system if the VMs are to be completely separated (also from each other). That still doesn’t justify that CheckPoint is trying to charge $7500 for the cheapest 5 VM edition (UTM) and $15000 for the unlimited edition. Per ESX host. Yes, to protect your VMs with a VE they will charge you a lot more than they would charge if you were installing their software on a physical box. An unlimited UTM license has a list price of $13000. And that price is independent on your number of ESX hosts.

 

 

So what is special about VE? Nothing. Well, it comes as an ovf file instead of an iso file so you don’t have to do the initial 7 minutes of an install wizard getting the disk partitioned+formatted and files being thrown over from the iso. Other than that, VE is a standard SPLAT (R65) install. Not even VMware Tools is installed, so networking performance is only as good as the emulated vlance (pcnet32) nic (that also puts extra cpu load on the system when in use). Guess they couldn’t get vmxnet working since VE is using a 2.4 kernel and vmxnet on 2.4 hasn’t been supported after ESX3 was released. I’m really sorry CheckPoint, but this is not good enough. The Virtual Edition concept really doesn’t have any benefits compared to the traditional editions. It costs a lot more, doesn’t have optimal performance and requires you to stop using VMotion(!). Can’t think of a single case where that would be a useful solution. Running a virtualized firewall is however something that in the future will be as normal as it is for many today to run a virtual server. The networking layer is already on it’s way into the virtualized datacenter now with Cisco as the first vendor with a native ESX switch. Cisco currently has no plans to port their ASA firewalling software over to a virtualized platform (ref VMTN roundtable a few weeks ago). Hope to see firewall vendors jumping onto the bandwagon in a more serious manner soon too. Who wants to ride the virtualization wave?

434 Views 0 Comments Permalink
0

VMware and the Rhino

Posted by larstr Mar 30, 2009

Back in the mid-late-nineties there were one global market leader in the firewall/vpn market: CheckPoint. There were some other players as well (Gauntlet, Raptor, Watchguard), while the big players, Cisco and Microsoft had no solutions at the time.

 

CheckPoint's solution was a software solution that injected a firewall driver between the ip stack and the nic drivers, and could because of this also protect the ip stack of the OS it was running on. It supported several platforms such as SunOS, Aix, HPUX and Windows NT. CheckPoint's development at the time was mainly targeting SunOS and it was back then the preferred platform.

 

 

 

I did my first CheckPoint training and installs back in '96. These were however on HPUX due to current vendor ties of my employer. In the years coming I was involved in quite a few installs in both small and large businesses. Most of them were in the process of getting an internet connection while some of them needed firewalls internally or for vpn's to business partners. With CheckPoint you had a nice GUI, an inspect language for defining new protocols (based on content of the data stream), application proxies, and a very good log viewer. CheckPoint was king of the hill, functionality wise. But there were annoying issues with licensing and the process for acquiring access to 3des licenses/software was quite awkward. The price model was also of the more expensive type.

 

 

 

Cisco had bought firewalling technology back in '95 that they had been developing further and tried adjusting the command set be similar to IOS. I attended Cisco Pix training in 1999 after Cisco had some campaigns where they had shown off their Pix firewall and declared it ready for business. Cisco's main argument was performance, but they also used other tricks in their marketing. The cisco Pix was also priced cheaper than CheckPoint, and for small customers it didn't really matter so much what product they chose as both would solve their core need of seperating/protecting networks.

 

 

 

We did however get a request from a larger customer in the financial sector who had decided to go for the Cisco Pix. We had installed CheckPoint at several similar environments before, but it's always hard to estimate how much time you're going to use as each environment is unique. In that particular case I think it's safe to say that we did get everything working in the end, but we probably used five times as many hours of setting it up than we would if we had used CheckPoint. One of the comments that came up during troubleshooting these Cisco issues was that you can compare Cisco with a Rhino: "They can have a slow start with products or arrive late to the table, but once they're gaining some speed you can't stop them." And at that point Cisco's product wasn't as mature as we had wanted it to be.

 

 

 

During this same period (late nineties-2001) we also used Axent Raptor Firewall for quite a few customers. Raptor was a product that had a very slick management interface, that would allow relatively complex environments to be configured easily. It was one of the earliest MMC applications that I came in touch with. This technology was later bought and brought to failure by Symantec who replaced the MMC interface with a java based one.

 

 

 

CheckPoint have however done little to their enterprise pricing other than that they have included more and more functionality into their package. For the SOHO market they launched appliance boxes through their daughter company Sofaware that were reasonably priced.

 

 

 

Several other competitors were also joining this field and most of them have taken a slice of the cake. Cisco also launced a new and much improved technology in 2005 with their ASA series to a fraction of the price that CheckPoint demands. Microsoft also launced MS Proxy server (later known as ISA), but it's functionality and performance wasn't covering the same area as CheckPoint and was often used to bring complementary functionality alongside with a "standard" firewall.

 

 

 

There is no doubt that CheckPoint have lost customers during these past 10 years (especially in the non-enterprise market), but they have also got new ones. In the SMB market they have lost a quite huge part of their share. For big environments, CheckPoint still have a large market share due to their still superior management software, even though their pricing haven't changed much (even with the new price model launced 3 weeks ago). According to themself, 100% of fortune 100 and 98% of fortune 500 are using a checkpoint security solution (they now also have many other products than firewalls). Other than having a good product line it has also helped that they have had many partners providing supplementary and/or prepackaged solutions. So it is possible after all to survive a Rhino, but you probably can't do it without some good products, partners, marketing, focus and timing. Having good products still wasn't good enough to keep SMB customers though, where price often is a more important factor than functionality as long as cheaper competing products are "good enough".

 

 

 

So how does VMware fit into this picture? Well, VMware is facing a similar situation now as there are several players that have just entered their core market. Citrix XenServer is the product that currently seems to beat VMware on raw performance, while Microsoft Hyper-V seems to be the most important comptetitor in the long term. In this game, Microsoft has taken the role as the Rhino and they have put some tremendous effort in order to get into the hypervisor game (and they missed their own deadline by a few years). Until now, VMware has shown to have smart roadmap in order to face the competition by being first to the table both with new technology and by bringing free products to the market (first Player, then Server and now last year ESXi was launched). We have seen Microsoft preaching the importance of the functionality that will be available in Hyper-V R2 since 2006, and they are repeating this at every opportunity now.

 

 

 

Even though Citrix have had fairly good technology since 2007, it seems that the mainstream wide adoption of XenServer isn't coming too quickly, even though they also include live migration for free in the version that will be available in two weeks. Citrix have over the years bought many types of products, but there's only one product package (that changes name every leap year) that people associate with Citrix. I still think that we'll see some adoption of XenServer in the product area that Citrix know best: Server based computing for end users (XenApp & XenDesktop).

 

Citrix and Microsoft have been both close partners and competitors for quite some time, and on the virtualization side, they are cooperating to bring functionality to each others hypervisor while competing with VMware.

 

In the months to come we will probably see a lot of marketing and technical sessions from MS showcasing Hyper-V R2 and in the meantime it will be exciting to see VMware launch vSHH the next version of Virtual Infrastructure with lots of new bells and whistles. While no marketing stunts can compete with the stunt of giving something away for free, VMware aren't doing so bad either. Their focus on Fusion for OSX has IMHO been a ground where they have showed excellent marketing skills, making them the current market leader also on OSX. I guess that it has been a good drill for the upcoming era of new products being launched. I'm surely looking forward to an exciting time ahead.

378 Views 0 Comments Permalink
1

NFS vs FC?

Posted by larstr Feb 6, 2009

While attending NetApp training they told us that NFS performance is better than FC when using many VMs on a host because latency is

lower.

 

Here are the slides they showed:

 

 

 

No doubt that by using NFS instead of FC you get better functionality on NetApp than you do on most other solutions such as storage native per VM snapshots without performance degradation.

 

Still, I'm not sure I'm buying these statements just like that... VMware has also done some metering of the different storage protocols and there FC is clearly much faster than the other protocols, even when running many VMs. VMware is measuring throughput instead of latency so the numbers are not directly comparable:

 

 

I did some testing back at VMworld 2006 in a lab where we compared storage performance in  the different protocols and got similar throughput results as those from the VMware paper above (FC 2x as fast as NFS/iSCSI). Latency is of course an important factor here as well, but I would imagine that latency over FC would typically be quite a bit lower than one transported over IP (atleast if you don't count in FCOE with enterprise 10GbE switches). The storage used in that lab was also provided by NetApp.

 

 

Lars

834 Views 1 Comments Permalink


Communities