The Case of the Failed Storage (East/West) Switch fabric with #StorageSpacesDirect

Hey Checkyourlogs fans,

I wanted to write to you today chatting about what happens if you lose your East/West (Storage) Switch Fabric with Storage Spaces Direct. In my design for this customer, they had a Dedicated pair of Mellanox switches for their Storage Network and then different core switches for their North/South (VM & MGMT Networks). We had to do some emergency maintenance on our pair of Mellanox Switches that would require a reboot of both.

I let the customer know that when this happens, the RDMA (RoCE) traffic would just
fail over to non-RDMA (RoCE) during the outage. They couldn’t believe this and wanted to check it out or themselves.

Here is how our Virtual Adapters are configured:

MGMT are configured through a SET Switch on NIC_1 and NIC_2 to the Core Cisco Switches

HB/LM/SMB_1/SMB_2 are configured through a 2^nd SET Switch to the Mellanox Core

So, I reloaded both switches (One had failed, so I only had one left). In essence a complete failure of the Storage Network at this point. (This is one major reason why I like dedicated Storage and Client Networks)

If you check in Failover Manager this is what it looks like:

Before…

After …

Our VMs for this cluster stayed online without any issues.

Pretty cool right. Just shows off a bit of resilience for Storage Spaces Direct for you today.

NOTE: All of the traffic would be a bit more congested but things would be alive and not just die and bluescreen. This obviously isn’t a permanent solution but knowing that you can do this gives some nice options for outage windows with your S2D Clusters.

BTW à This works the same for Windows Server 2016 and 2019.

Hope you enjoy,

Dave

Featured

Duplicate SIDs + KB5065426: Restoring SMB/File & Print Sharing Using Microsoft’s Known Issue Rollback (KIR)

Featured

Calgary Windows Server and Azure Hybrid Meetup - Jan 8, 2026 - Red vs. Blue who wins

Featured

Install Veeam Backup Enterprise Manager on Linux (Veeam Software Appliance) at Hyper-V 2025

Featured

Security Copilot in Intune is GA - Here’s What It Actually Does

Featured

AuditWindows PowerShell and AI to Better Clarity on BitLocker and LAPS

Featured

Introducing the Definitive Installation Script, Install-WinGetV2.ps1

The Case of the Failed Storage (East/West) Switch fabric with #StorageSpacesDirect

Related

About The Author

Kawula Dave

Leave a ReplyCancel reply

Translate our Blog

Subscribe to our videos

Subscribe to our Blog

Our Authors

Cary Sun

Cristal Kawula

Dave Kawula

Émile Cabot

John O'Neill Sr.

Kawula Dave

Kevin Kaminski

Rick Vanover

Steve Labeau

Follow Us

Facebook

Youtube

Twitter

Instagram

Category

Blog Stats

Featured

Duplicate SIDs + KB5065426: Restoring SMB/File & Print Sharing Using Microsoft’s Known Issue Rollback (KIR)

Featured

Calgary Windows Server and Azure Hybrid Meetup - Jan 8, 2026 - Red vs. Blue who wins

Featured

Install Veeam Backup Enterprise Manager on Linux (Veeam Software Appliance) at Hyper-V 2025

Featured

Security Copilot in Intune is GA - Here’s What It Actually Does

Featured

AuditWindows PowerShell and AI to Better Clarity on BitLocker and LAPS

Featured

Introducing the Definitive Installation Script, Install-WinGetV2.ps1

The Case of the Failed Storage (East/West) Switch fabric with #StorageSpacesDirect

Share this:

Related

About The Author

Kawula Dave

Related Posts

Get-SCPortClassification Scripting issue in SCVMM 1711 – Fixed #MVPBuzz #HyperV #StorageSpaceDirect

Deploying Storage Spaces Direct – Part 15 #StorageSpacesDirect #mvphour

The Case of Expanding a Full – Azure Stack HCI Nested Resilient Volume – #AzureStackHCI #S2D

How to fix storage space direct cluster file share witness failed

Leave a ReplyCancel reply

Translate our Blog

Subscribe to our videos

Subscribe to our Blog

Our Authors

Cary Sun

Cristal Kawula

Dave Kawula

Émile Cabot

John O'Neill Sr.

Kawula Dave

Kevin Kaminski

Rick Vanover

Steve Labeau

Follow Us

Facebook

Youtube

Twitter

Instagram

Category

Tags

Blog Stats