使用 storcli 修复 LSI RAID 硬盘 spinup failed 问题
分类:Hardware 标签:Raid
WHAT
最近处理一批 LSI RAID 故障的机器,遇到硬盘 UGood
状态竟无法配置 RAID 的情况
发现 PD 物理盘的 Spin 状态是 D
down 状态:
$ sudo /usr/local/sbin/storcli /c0/e0/s8 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
0:8 4 UGood - 7.276 TB SATA HDD N N 512B ST8000NM0016-1U3101 D <-- Spin 状态为 D
--------------------------------------------------------------------------
使用 storcli
设置 PD Spin 状态 spinup
结果报错:
$ sudo /usr/local/sbin/storcli /c0/e0/s8 spinup
Controller = 0
Status = Failure
Description = Spin Up Drive Failed.
Detailed Status :
===============
-----------------------------------------------------------------------
Drive Status ErrCd ErrMsg
-----------------------------------------------------------------------
/c0/e0/s8 Failure 50 device state doesn't support requested command
-----------------------------------------------------------------------
HOW
使用 erase
清除一下 PD 硬盘 状态 :
$ sudo storcli /c0/e0/s8 help|grep -w erase
storcli /cx[/ex]/sx start erase [simple| normal| thorough | standard| threepass | crypto]
storcli /cx[/ex]/sx stop erase
storcli /cx[/ex]/sx show erase
硬盘 Spin 状态从 D
变为 U
:
$ sudo storcli /c0/e0/s8 start erase simple
earse 除了擦除 PD 状态,还会擦除数据,8T 硬盘耗时还是比较久的:
$ sudo storcli /c0/e0/s8 show erase
Controller = 0
Status = Success
Description = Show Drive Erase Status Succeeded.
----------------------------------------------------
Drive-ID Progress% Status Estimated Time Left
----------------------------------------------------
/c0/e0/s8 1 In progress 9 Hours 31 Minutes
----------------------------------------------------
Spin 状态复位后,停止擦除即可:
$ sudo storcli /c0/e0/s8 stop erase
Controller = 0
Status = Success
Description = Stop Drive Erase Succeeded.
清除磁盘状态后,Spin 状态可以顺利 spinup
和 spindown
开关:
$ sudo storcli /c0/e0/s8 spindown
Controller = 0
Status = Success
Description = Spin Down Drive Succeeded.
$ sudo storcli /c0/e0/s8 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
0:8 4 UGood - 7.276 TB SATA HDD N N 512B ST8000NM0016-1U3101 D
--------------------------------------------------------------------------
$ sudo storcli /c0/e0/s8 spinup
Controller = 0
Status = Success
Description = Spin Up Drive Succeeded.
$ sudo storcli /c0/e0/s8 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
0:8 4 UGood - 7.276 TB SATA HDD N N 512B ST8000NM0016-1U3101 U
--------------------------------------------------------------------------
故障盘设置 offline
、missing
下线后,需要设置 PD spindown
:
storcli /cx/ey/sz set offline
storcli /cx/ey/sz set missing
storcli /cx/ey/sz spindown
硬盘在 spindown
状态后是无法创建 RAID 的:
$ sudo storcli /c0/e0/s8 spindown
Controller = 0
Status = Success
Description = Spin Down Drive Succeeded.
$ sudo storcli /c0 add vd type=r0 drives=0:8
Controller = 0
Status = Failure
Description = physical disk does not have appropriate attributes
重新设为 spinup
后,UGood
硬盘就可以正常创建 RAID 了:
$ sudo storcli /c0/e0/s8 spinup
Controller = 0
Status = Success
Description = Spin Up Drive Succeeded.
$ sudo storcli /c0/e0/s8 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
0:8 4 UGood - 7.276 TB SATA HDD N N 512B ST8000NM0016-1U3101 U
--------------------------------------------------------------------------
$ sudo storcli /c0 add vd type=r0 drives=0:8
Controller = 0
Status = Success
Description = Add VD Succeeded
CASE 2
更换故障盘无法创建 VD :
# storcli /c0 add vd r0 drives=9:7
Controller = 0
Status = Failure
Description = physical disk does not have appropriate attributes
没有 Foreign 盘:
# storcli /c0/fall show
Controller = 0
Status = Success
Description = Couldn't find any foreign Configuration
重新 上下线 硬盘还是不行:
# storcli /c0/e9/s7 spindown
Controller = 0
Status = Success
Description = Spin Down Drive Succeeded.
# storcli /c0/e9/s7 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC0002-1XX112 D <--
--------------------------------------------------------------------------
# storcli /c0/e9/s7 spinup
Controller = 0
Status = Success
Description = Spin Up Drive Succeeded.
# storcli /c0 add vd r0 drives=9:7
Controller = 0
Status = Failure
Description = physical disk does not have appropriate attributes
下线 PD 使用 earse
清除,重新 初始化 一下:
# storcli /c0/e9/s7 spindown
Controller = 0
Status = Success
Description = Spin Down Drive Succeeded.
# storcli /c0/e9/s7 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC0002-1XX112 D <--
--------------------------------------------------------------------------
# storcli /c0/e9/s7 start erase simple
Controller = 0
Status = Success
Description = Start Drive Erase Succeeded.
# storcli /c0/e9/s7 show erase
Controller = 0
Status = Success
Description = Show Drive Erase Status Succeeded.
----------------------------------------------------
Drive-ID Progress% Status Estimated Time Left
----------------------------------------------------
/c0/e9/s7 0 In progress 0 Seconds
----------------------------------------------------
PD 的 Spin
状态变为 T
:
# storcli /c0/e9/s7 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC0002-1XX112 T <--
--------------------------------------------------------------------------
过一会儿 Spin 状态变为 U
状态:
# storcli /c0/e9/s7 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC0002-1XX112 U
--------------------------------------------------------------------------
# storcli /c0/e9/s7 show erase
Controller = 0
Status = Success
Description = Show Drive Erase Status Succeeded.
----------------------------------------------------
Drive-ID Progress% Status Estimated Time Left
----------------------------------------------------
/c0/e9/s7 0 In progress 0 Seconds
----------------------------------------------------
停掉 erase
擦除任务:
# storcli /c0/e9/s7 stop erase
Controller = 0
Status = Success
Description = Stop Drive Erase Succeeded.
# storcli /c0/e9/s7 show erase
Controller = 0
Status = Success
Description = Show Drive Erase Status Succeeded.
--------------------------------------------------------
Drive-ID Progress% Status Estimated Time Left
--------------------------------------------------------
/c0/e9/s7 - Not in progress -
--------------------------------------------------------
# storcli /c0/e9/s7 show
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp
--------------------------------------------------------------------------
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC0002-1XX112 U
--------------------------------------------------------------------------
成功重新创建 VD :
# storcli /c0 add vd r0 drives=9:7
Controller = 0
Status = Success
Description = Add VD Succeeded
对比清除前后 PD 的详情配置,发现 Sequence Number
字段不一样:
$ diff -y pd-failed pd-online|egrep ' [|<>]'
9:7 14 UGood - 7.276 TB SATA HDD N N 512B ST8000NC00 | 9:7 14 Onln 12 7.276 TB SATA HDD N N 512B ST8000NC00
Drive Temperature = 25C (77.00 F) | Drive Temperature = 32C (89.60 F)
> Drive position = DriveGroup:12, Span:0, Row:0
Sequence Number = 5 | Sequence Number = 10
^ ^^
reference
- storcli
- LSI storcli64 examples
- How do I replace a failed drive with LSI 9280 cards?
- How do I remove a failing disk from a LSI MegaRAID disk group?
本文标题 | 使用 storcli 修复 LSI RAID 硬盘 spinup failed 问题 |
---|---|
原始链接 | https://lvii.github.io/hardware/2019-10-28-fix-LSI-raid-pd-disk-storcli-spinup-failed-problem/ |