Назад | Перейти на главную страницу

Почему все диски выпадают из RAID при выходе из строя 1? LSI9260-8i / IBM M5014

У меня интересная проблема, и я надеюсь, что вы все сможете мне помочь. У меня есть IBM 5014 (эквивалент LSI 9260-8i) с двумя виртуальными дисками RAID10. Первый - это 4 диска WD RE4, каждый по 2 ТБ, на общий диск 4 ТБ - назовем его VD1. Другой - это 4 WD RE4-GP, каждый по 2 ТБ, для другого накопителя на 4 ТБ - назовем это VD0. В случае, если это имеет значение, карта работает в корпусе Norco с 3 вентиляторами (по 1 на каждом банке из 4 дисков + 1 на гигабайтном МБ, 16 ГБ ОЗУ и карте IBM. Также имеется IBM5015 с 4 твердотельными накопителями по 256 ГБ в массиве RAID10. ). Я виртуализирован с помощью ESXi5.5 с серией виртуальных машин. Карта 5014 работает в режиме сквозной передачи на хост WHS2011, а карта 5015 содержит сами виртуальные машины.

VD0 работает нормально и без проблем. Это мое основное хранилище документов.

Однако VD1, содержащий все мои видео, периодически сбрасывает диск, вызывая ухудшение его работы, а затем почти мгновенно (обычно с той же меткой времени, но иногда с задержкой в ​​1 секунду) сбрасывает и остальные диски, что также вызывает это перейти в автономный режим.

Сам контроллер работает нормально в течение почти 6 месяцев, поэтому, хотя он может быть связан с контроллером, похоже, что это вызовет проблемы с обоими виртуальными дисками, а не только с одним из них.

Проблема в том, что диски не выпадают постоянно (по крайней мере, согласно журналу) в одном и том же порядке, поэтому я не знаю, какой из дисков вызывает проблему. Я включил отрывок из журнала ниже. Как вы увидите, он удаляет диски, а затем снова добавляет их.

Любые советы о том, как устранить неполадки на каком диске, были бы очень кстати - я не могу поверить, что все они вышли из строя вместе, и я не могу поверить, как мало информации содержится в самом журнале MSM.

Спасибо всем заранее!

Дуг

        ID = 248
    SEQUENCE NUMBER = 382617
    TIME = 07-07-2015 08:14:46
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   8

    ID = 112
    SEQUENCE NUMBER = 382616
    TIME = 07-07-2015 08:14:46
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:1

    ID = 248
    SEQUENCE NUMBER = 382615
    TIME = 07-07-2015 08:14:45
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   13

    ID = 112
    SEQUENCE NUMBER = 382614
    TIME = 07-07-2015 08:14:45
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:3

    ID = 248
    SEQUENCE NUMBER = 382613
    TIME = 07-07-2015 08:14:44
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   9

    ID = 112
    SEQUENCE NUMBER = 382612
    TIME = 07-07-2015 08:14:44
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:0

    ID = 248
    SEQUENCE NUMBER = 382611
    TIME = 07-07-2015 08:14:44
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   14

    ID = 112
    SEQUENCE NUMBER = 382610
    TIME = 07-07-2015 08:14:44
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:2

    ID = 247
    SEQUENCE NUMBER = 382609
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   14

    ID = 91
    SEQUENCE NUMBER = 382608
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:2

    ID = 247
    SEQUENCE NUMBER = 382607
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   9

    ID = 91
    SEQUENCE NUMBER = 382606
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:0

    ID = 247
    SEQUENCE NUMBER = 382605
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   8

    ID = 91
    SEQUENCE NUMBER = 382604
    TIME = 07-07-2015 07:53:09
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:1

    ID = 247
    SEQUENCE NUMBER = 382603
    TIME = 07-07-2015 07:53:04
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   13

    ID = 91
    SEQUENCE NUMBER = 382602
    TIME = 07-07-2015 07:53:04
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:3

    ID = 248
    SEQUENCE NUMBER = 382601
    TIME = 07-07-2015 07:52:44
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   9

    ID = 112
    SEQUENCE NUMBER = 382600
    TIME = 07-07-2015 07:52:44
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:0

    ID = 248
    SEQUENCE NUMBER = 382599
    TIME = 07-07-2015 07:52:42
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   13

    ID = 112
    SEQUENCE NUMBER = 382598
    TIME = 07-07-2015 07:52:42
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:3

    ID = 248
    SEQUENCE NUMBER = 382597
    TIME = 07-07-2015 07:52:41
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   8

    ID = 112
    SEQUENCE NUMBER = 382596
    TIME = 07-07-2015 07:52:41
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:1

    ID = 248
    SEQUENCE NUMBER = 382595
    TIME = 07-07-2015 07:52:40
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   14

    ID = 112
    SEQUENCE NUMBER = 382594
    TIME = 07-07-2015 07:52:40
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:2

    ID = 145
    SEQUENCE NUMBER = 382593
    TIME = 07-07-2015 07:10:59
    LOCALIZED MESSAGE = Controller ID:  0   Battery temperature is high

    ID = 149
    SEQUENCE NUMBER = 382592
    TIME = 07-07-2015 06:56:54
    LOCALIZED MESSAGE = Controller ID:  0   Battery temperature is normal

    ID = 247
    SEQUENCE NUMBER = 382591
    TIME = 07-07-2015 04:08:56
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   14

    ID = 91
    SEQUENCE NUMBER = 382590
    TIME = 07-07-2015 04:08:56
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:2

    ID = 247
    SEQUENCE NUMBER = 382589
    TIME = 07-07-2015 04:08:56
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   9

    ID = 91
    SEQUENCE NUMBER = 382588
    TIME = 07-07-2015 04:08:56
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:0

    ID = 247
    SEQUENCE NUMBER = 382587
    TIME = 07-07-2015 04:08:55
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   8

    ID = 91
    SEQUENCE NUMBER = 382586
    TIME = 07-07-2015 04:08:55
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:1

    ID = 248
    SEQUENCE NUMBER = 382585
    TIME = 07-07-2015 04:08:49
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   8

    ID = 112
    SEQUENCE NUMBER = 382584
    TIME = 07-07-2015 04:08:49
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:1

    ID = 248
    SEQUENCE NUMBER = 382583
    TIME = 07-07-2015 04:08:47
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   9

    ID = 112
    SEQUENCE NUMBER = 382582
    TIME = 07-07-2015 04:08:47
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:0

    ID = 248
    SEQUENCE NUMBER = 382581
    TIME = 07-07-2015 04:08:47
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   14

    ID = 112
    SEQUENCE NUMBER = 382580
    TIME = 07-07-2015 04:08:47
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:2

    ID = 247
    SEQUENCE NUMBER = 382579
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   14

    ID = 91
    SEQUENCE NUMBER = 382578
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:2

    ID = 247
    SEQUENCE NUMBER = 382577
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   13

    ID = 91
    SEQUENCE NUMBER = 382576
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:3

    ID = 247
    SEQUENCE NUMBER = 382575
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   8

    ID = 91
    SEQUENCE NUMBER = 382574
    TIME = 07-07-2015 03:24:32
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:1

    ID = 247
    SEQUENCE NUMBER = 382573
    TIME = 07-07-2015 03:24:27
    LOCALIZED MESSAGE = Controller ID:  0  Device inserted   Device Type:       Disk  Device Id:   9

    ID = 91
    SEQUENCE NUMBER = 382572
    TIME = 07-07-2015 03:24:27
    LOCALIZED MESSAGE = Controller ID:  0   PD inserted:       -:-:0

    ID = 248
    SEQUENCE NUMBER = 382571
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   9

    ID = 112
    SEQUENCE NUMBER = 382570
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:0

    ID = 248
    SEQUENCE NUMBER = 382569
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   14

    ID = 112
    SEQUENCE NUMBER = 382568
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:2

    ID = 248
    SEQUENCE NUMBER = 382567
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   8

    ID = 112
    SEQUENCE NUMBER = 382566
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:1

    ID = 248
    SEQUENCE NUMBER = 382565
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   13

    ID = 112
    SEQUENCE NUMBER = 382564
    TIME = 07-07-2015 03:23:36
    LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:3
ID = 139
SEQUENCE NUMBER = 382435
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   Deleted VD:       1

ID = 114
SEQUENCE NUMBER = 382434
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:0  Previous   =   Failed      Current   =   Unconfigured Bad

ID = 114
SEQUENCE NUMBER = 382433
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:2  Previous   =   Failed      Current   =   Unconfigured Bad

ID = 114
SEQUENCE NUMBER = 382432
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:1  Previous   =   Failed      Current   =   Unconfigured Bad

ID = 114
SEQUENCE NUMBER = 382431
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:3  Previous   =   Failed      Current   =   Unconfigured Bad

ID = 114
SEQUENCE NUMBER = 382430
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:0  Previous   =   Online      Current   =   Failed

ID = 248
SEQUENCE NUMBER = 382429
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   9

ID = 112
SEQUENCE NUMBER = 382428
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:0

ID = 252
SEQUENCE NUMBER = 382427
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  VD is now OFFLINE   VD       1

ID = 81
SEQUENCE NUMBER = 382426
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change on VD:   1      Previous   =   Degraded  Current   =       Offline

ID = 114
SEQUENCE NUMBER = 382425
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:2  Previous   =   Online      Current   =   Failed

ID = 248
SEQUENCE NUMBER = 382424
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   14

ID = 112
SEQUENCE NUMBER = 382423
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:2

ID = 114
SEQUENCE NUMBER = 382422
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:1  Previous   =   Online      Current   =   Failed

ID = 248
SEQUENCE NUMBER = 382421
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   8

ID = 112
SEQUENCE NUMBER = 382420
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:1

ID = 251
SEQUENCE NUMBER = 382419
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  VD is now DEGRADED   VD       1

ID = 81
SEQUENCE NUMBER = 382418
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change on VD:   1      Previous   =   Optimal  Current   =       Degraded

ID = 114
SEQUENCE NUMBER = 382417
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   State change:   PD       =   -:-:3  Previous   =   Online      Current   =   Failed

ID = 248
SEQUENCE NUMBER = 382416
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0  Device removed   Device Type:       Disk  Device Id:   13

ID = 112
SEQUENCE NUMBER = 382415
TIME = 04-07-2015 08:27:32
LOCALIZED MESSAGE = Controller ID:  0   PD removed:       -:-:3

Извините, не испытал того же, но мы запускаем LSI и раньше обновляли прошивку. Убедитесь, что у вас установлена ​​последняя версия прошивки устройства.