The Problem:
Ivan had a customer that set up an Exchange 2010 c7000 BladeSystem cluster and they seemed to have a lot of MPIO errors in the eventlogs and were trying to pinpoint the root cause of these errors. On some of these servers they did have problems with disks disappearing. Most of the systems were multi-node clusters so that most of the time the application survived.
The Infrastructure:
C7000 Blade Enclosures with Virtual Connect 8Gb 24 Port FC Modules
BL460c G7 with Emulex LP1205 FC HBAs
HP Storage 8/40 SAN Switches
HP Storage EVA6400 Arrays
Operating System:
OS: Windows 2008 R2-SP1
MPIO 4.00.00 and 4.01.00
So what did Ivan find?
They did some deeper analysis on this issue and came to the following conclusion. Interesting enough to share I believe.
On our 2010 Exchange DAG cluster we experienced disk problems. Looking to the Event Viewer, we immediately noticed a lot of MPIO errors about failing paths. Our investigation led us to investigating other nodes (of different clusters but same Hardware (HW) ruling out any HW problems. But we also noticed lots of MPIO errors on our Hyper-V clusters? Are these related?
We did some Event Viewer analysis on the Hyper-V clusters and discovered that we needed to categorize all MPIO errors
Next challenge is explain this to the System Center Operations Manager (SCOM) ![]()
EVENTID 302: An unrecoverable path failure occurred on SCSI address xxx. Disk xxx failed due to no redundant paths available
EVENTID 304: An unrecoverable path failure occurred on SCSI address xxx. Disk xxx is still accessible over redundant path
Row Labels | Count of EventID 302 + 304 |
HBICLU01 | |
HBIHYP01.ad-cob.domain | 31 |
HBIHYP02.ad-cob.domain | 28 |
HBIHYP03.ad-cob.domain | 42 |
HBIHYP04.ad-cob.domain | 28 |
HBIHYP07.ad-cob.domain | 109 |
HBIHYP08.ad-cob.domain | 28 |
HBICLU02 | |
HBIHYP05.ad-cob.domain | 8 |
HBIHYP06.ad-cob.domain | 8 |
HBIHYP09.ad-cob.domain | 8 |
HBIHYP10.ad-cob.domain | 8 |
HBIHYP11.ad-cob.domain | 8 |
HBIHYP12.ad-cob.domain | 8 |
Grand Total | 314 |
Just sharing some field info and hope it helps when you are looking at MultiPath I/O issues. Any comments or suggestions on this topic?
We encourage you to share your comments on this post. Comments are moderated and will be reviewed and posted as promptly as possible during regular business hours.
To ensure your comment is published, please follow our community guidelines.