VMware Cloud Community
vmuniverse
Contributor
Contributor
Jump to solution

Active/Passive SAN and the Storage Processors

We have a CX500 SAN (Active/Passive) with 2 Storage Processors (SPA & SPB)

We have created different LUNS for the needs of different VM Machines.

I have configured so that all the LUNS we're going to use for our VI 3 are set to use the Storage Processor B, I want to know if thats neccessary because we use an Active/Passive SAN?

What I really would like to see is for example:

2 LUNS ---> use SPA (Storage Processor A)

2 LUNS ---> use SPB (Storage Processor B)

This would create some sort of load balancing between the Storage Processors

Can I use the method without risking Path Trashing? I know have to use the MRU on the ESX Servers (and I will), but it puzzles me if I can use SPA & SPB mixed for the LUNS

0 Kudos
1 Solution

Accepted Solutions
mitchellm3
Enthusiast
Enthusiast
Jump to solution

Yes, you can use both SPs without causing path thrashing as long as all your ESX hosts are configured to use the MRU multipathing policy. Remember, even though both SPs are active, they will only be active for the LUNS that you assign to them. So if LUN1 has a preferred controller of SPA and LUN2 has a preferred controller of SPB, and you configure 10 ESX hosts to access both luns and configure them to use a multipathing policy of MRU...of which you should, they will all access LUN1 through SPA and LUN2 through SPB. What makes your array truly active/passive is that when LUN1 has a preferred controller of SPA, it can not be accessed by SPB. SPB can only access LUN1 when it becomes the preferred controller...and that happens in three cases.

1. You manually change the preferred controller(SP) for LUN1 to SPB

2. A failure happens on a hba/switch/path, so the host multipathing software tells the ARRAY to change LUN1s preferred controller to SPB (as MRU will do for you)

3. SPA in the array fails so SPB will automatically make itself the preferred controller. Using MRU, all your ESX hosts will be notified to start accessing LUN1 on SPB.

Here is a good explaination of path thrashing posted by Jhanekom that I'm copying from this thread http://communities.vmware.com/message/874826#874826

Path thrashing is when the active path for a LUN switches rapidly from one storage controller to the next. It generally results in severely degraded performance and, in extreme cases, a number of instability issues. Path thrashing does not occur with only one ESX host or single-controller storage systems.

In ESX, path thrashing most frequently occurs if the ESX multipathing policy is set to "fixed", and the storage controllers are active/passive at the back-end ("esxcfg-mpath -l" lists half the paths as "on", and the other half as "standby".) The other case is when there is a configuration issue on the storage controller, specifically that the controllers are really active/passive, but report themselves as being active/active ("esxcfg-mpath -l" returns all paths' status as "on".)

In the first case, if ESX servers are set up to have different preferred paths (i.e. Server 1 prefers to use controller A, Server 2 prefers to use controller B), they may cause the active/passive controllers to switch a LUN from one controller to the next constantly. This is a configuration problem on the ESX server and can be corrected by setting the path policy back to the default of MRU. (LUNs on active/passive controllers should always use MRU; active/active LUNs can use either MRU or Fixed, but uses fixed by default.)

The second case is a configuration issue on the storage controller and needs to be corrected there. In this case, the ESX servers will have no way of determining which is the correct path and will try to communicate on the first available one. The storage controller will then fail the path over to the controller being target. The feature that needs to be disabled is sometimes called AVT (auto-volume transfer) and you'll see mention to it in /var/log/vmkwarning if the problem occurs. In most cases, setting the host type to "cluster" resolves this, but you need to check your SAN vendor's documentation for this one.

View solution in original post

0 Kudos
7 Replies
Texiwill
Leadership
Leadership
Jump to solution

Hello,

Active/Passive means only 1 SP is available at a time. If you did things the way you have outlined you would cause path thrashing. You can not present through the passive SP. You can its doable but once SPB becomes active all the LUNS on SPA shift to SPB. But since those LUNs want to work on SPA only, then everything shifts back.

2 LUNS ---> use SPA (Storage Processor A)

2 LUNS ---> use SPB (Storage Processor B)

This would create some sort of load balancing between the Storage Processors

Only on an Active/Active SAN.

Can I use the method without risking Path Trashing?

No


Best regards,

Edward L. Haletky

VMware Communities User Moderator

====

Author of the book 'VMWare ESX Server in the Enterprise: Planning and Securing Virtualization Servers', Copyright 2008 Pearson Education.

CIO Virtualization Blog: http://www.cio.com/blog/index/topic/168354

As well as the Virtualization Wiki at http://www.astroarch.com/wiki/index.php/Virtualization

--
Edward L. Haletky
vExpert XIV: 2009-2023,
VMTN Community Moderator
vSphere Upgrade Saga: https://www.astroarch.com/blogs
GitHub Repo: https://github.com/Texiwill
0 Kudos
Sanjana
Hot Shot
Hot Shot
Jump to solution

Texiwill,

Can I use the method without risking Path Trashing?

No

[

|http://www.astroarch.com/wiki/index.php/Virtualization]

I'm not sure if that's completely true for the Clariion. The arrays controllers are Active/Passive on a per-lun basis. There should be no path thrashing. The IOs will be serviced by the controller that is designated for a particular lun.

0 Kudos
patrickds
Expert
Expert
Jump to solution

My idea as well.

Just select the correct owning SP in the controller software (Navisphere was it?) and you can divide LUNs over both to create a Load balancing scheme.

The CX is active/passive for each lun, but both SPs are actually online.

mitchellm3
Enthusiast
Enthusiast
Jump to solution

The way you would like to create your LUNs with 2 preferred by SPA and 2 preferred by SPB will work and is probably the way to go. But as you stated, make sure you use MRU. The definition of Active/Active and Active/Passive changes depending on who you talk to. While looking for a good explaination online, I found one on techtarget that said an active/passive is one controller working while the other sits idle...waiting. I think you will find the following explaination helpful and I believe is what VMware uses as their definition.

http://www.emcstorageinfo.com/2008/06/what-is-difference-between-active.html

0 Kudos
vmuniverse
Contributor
Contributor
Jump to solution

It's a Clariion indeed. We use Navisphere to manage this SAN.

And yes both SP's are active. We have different LUNS that use both active SP's

I know the CX is an Active/Passive SAN, I have had this confirmation from EMC.

It's still confusing me because both SP's are active

So I can use both SP's without causing any Path trashing?

0 Kudos
Texiwill
Leadership
Leadership
Jump to solution

Hello,

If it is an 'active/passive' SAN per LUN then both SPs are active but never for the same LUN. So what you propose will work.

Other SANs, will accept traffic on the passive SP but over its backplane send the data to the active SP (EVA Line). Others will thrash (Low end SANs)

So it really depends on the SAN and how they handle the data. Low end SANs tend to thrash, high end SANs generally do not have this problem. Those in the middle may not thrash but have other ways of mitigating the possibility.


Best regards,

Edward L. Haletky

VMware Communities User Moderator

====

Author of the book 'VMWare ESX Server in the Enterprise: Planning and Securing Virtualization Servers', Copyright 2008 Pearson Education.

CIO Virtualization Blog: http://www.cio.com/blog/index/topic/168354

As well as the Virtualization Wiki at http://www.astroarch.com/wiki/index.php/Virtualization

--
Edward L. Haletky
vExpert XIV: 2009-2023,
VMTN Community Moderator
vSphere Upgrade Saga: https://www.astroarch.com/blogs
GitHub Repo: https://github.com/Texiwill
0 Kudos
mitchellm3
Enthusiast
Enthusiast
Jump to solution

Yes, you can use both SPs without causing path thrashing as long as all your ESX hosts are configured to use the MRU multipathing policy. Remember, even though both SPs are active, they will only be active for the LUNS that you assign to them. So if LUN1 has a preferred controller of SPA and LUN2 has a preferred controller of SPB, and you configure 10 ESX hosts to access both luns and configure them to use a multipathing policy of MRU...of which you should, they will all access LUN1 through SPA and LUN2 through SPB. What makes your array truly active/passive is that when LUN1 has a preferred controller of SPA, it can not be accessed by SPB. SPB can only access LUN1 when it becomes the preferred controller...and that happens in three cases.

1. You manually change the preferred controller(SP) for LUN1 to SPB

2. A failure happens on a hba/switch/path, so the host multipathing software tells the ARRAY to change LUN1s preferred controller to SPB (as MRU will do for you)

3. SPA in the array fails so SPB will automatically make itself the preferred controller. Using MRU, all your ESX hosts will be notified to start accessing LUN1 on SPB.

Here is a good explaination of path thrashing posted by Jhanekom that I'm copying from this thread http://communities.vmware.com/message/874826#874826

Path thrashing is when the active path for a LUN switches rapidly from one storage controller to the next. It generally results in severely degraded performance and, in extreme cases, a number of instability issues. Path thrashing does not occur with only one ESX host or single-controller storage systems.

In ESX, path thrashing most frequently occurs if the ESX multipathing policy is set to "fixed", and the storage controllers are active/passive at the back-end ("esxcfg-mpath -l" lists half the paths as "on", and the other half as "standby".) The other case is when there is a configuration issue on the storage controller, specifically that the controllers are really active/passive, but report themselves as being active/active ("esxcfg-mpath -l" returns all paths' status as "on".)

In the first case, if ESX servers are set up to have different preferred paths (i.e. Server 1 prefers to use controller A, Server 2 prefers to use controller B), they may cause the active/passive controllers to switch a LUN from one controller to the next constantly. This is a configuration problem on the ESX server and can be corrected by setting the path policy back to the default of MRU. (LUNs on active/passive controllers should always use MRU; active/active LUNs can use either MRU or Fixed, but uses fixed by default.)

The second case is a configuration issue on the storage controller and needs to be corrected there. In this case, the ESX servers will have no way of determining which is the correct path and will try to communicate on the first available one. The storage controller will then fail the path over to the controller being target. The feature that needs to be disabled is sometimes called AVT (auto-volume transfer) and you'll see mention to it in /var/log/vmkwarning if the problem occurs. In most cases, setting the host type to "cluster" resolves this, but you need to check your SAN vendor's documentation for this one.

0 Kudos