升级Intel 545s SSD firmware

我的二手 HPE ProLiant DL360 Gen9服务器 服务器使用了一块我很久以前购买的Intel 545s Series SSD磁盘,不过这块SSD时不时在系统日志中留下触目惊心的Err记录:

dmesg 中SSD磁盘错误日志
[Sun Aug  6 11:05:54 2023] ata5.00: exception Emask 0x0 SAct 0x80080000 SErr 0x0 action 0x6 frozen
[Sun Aug  6 11:05:54 2023] ata5.00: failed command: READ FPDMA QUEUED
[Sun Aug  6 11:05:54 2023] ata5.00: cmd 60/08:98:98:20:9c/00:00:02:00:00/40 tag 19 ncq dma 4096 in
                                    res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
[Sun Aug  6 11:05:54 2023] ata5.00: status: { DRDY }
[Sun Aug  6 11:05:54 2023] ata5.00: failed command: READ FPDMA QUEUED
[Sun Aug  6 11:05:54 2023] ata5.00: cmd 60/08:f8:e8:e4:8c/00:00:00:00:00/40 tag 31 ncq dma 4096 in
                                    res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
[Sun Aug  6 11:05:54 2023] ata5.00: status: { DRDY }
[Sun Aug  6 11:05:54 2023] ata5: hard resetting link
[Sun Aug  6 11:05:54 2023] ata5: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[Sun Aug  6 11:05:54 2023] ata5.00: configured for UDMA/133
[Sun Aug  6 11:05:54 2023] sd 4:0:0:0: [sdb] tag#31 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=30s
[Sun Aug  6 11:05:54 2023] sd 4:0:0:0: [sdb] tag#31 Sense Key : Illegal Request [current]
[Sun Aug  6 11:05:54 2023] sd 4:0:0:0: [sdb] tag#31 Add. Sense: Unaligned write command
[Sun Aug  6 11:05:54 2023] sd 4:0:0:0: [sdb] tag#31 CDB: Read(10) 28 00 00 8c e4 e8 00 00 08 00
[Sun Aug  6 11:05:54 2023] blk_update_request: I/O error, dev sdb, sector 9233640 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[Sun Aug  6 11:05:54 2023] ata5: EH complete
[Sun Aug  6 11:05:54 2023] ata5.00: Enabling discard_zeroes_data
[Sun Aug  6 11:06:24 2023] ata5.00: exception Emask 0x0 SAct 0x1000000 SErr 0x0 action 0x6 frozen
[Sun Aug  6 11:06:24 2023] ata5.00: failed command: READ FPDMA QUEUED
[Sun Aug  6 11:06:24 2023] ata5.00: cmd 60/08:c0:70:1f:ce/00:00:00:00:00/40 tag 24 ncq dma 4096 in
                                    res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[Sun Aug  6 11:06:24 2023] ata5.00: status: { DRDY }
[Sun Aug  6 11:06:24 2023] ata5: hard resetting link
[Sun Aug  6 11:06:24 2023] ata5: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[Sun Aug  6 11:06:24 2023] ata5.00: configured for UDMA/133
[Sun Aug  6 11:06:24 2023] ata5.00: device reported invalid CHS sector 0
[Sun Aug  6 11:06:24 2023] sd 4:0:0:0: [sdb] tag#24 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=30s
[Sun Aug  6 11:06:24 2023] sd 4:0:0:0: [sdb] tag#24 Sense Key : Illegal Request [current]
[Sun Aug  6 11:06:24 2023] sd 4:0:0:0: [sdb] tag#24 Add. Sense: Unaligned write command
[Sun Aug  6 11:06:24 2023] sd 4:0:0:0: [sdb] tag#24 CDB: Read(10) 28 00 00 ce 1f 70 00 00 08 00
[Sun Aug  6 11:06:24 2023] blk_update_request: I/O error, dev sdb, sector 13508464 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[Sun Aug  6 11:06:24 2023] ata5: EH complete
[Sun Aug  6 11:06:24 2023] ata5.00: Enabling discard_zeroes_data

找了一下资料,感觉有两种可能:

  • SATA连接线存在电气问题

  • Firmware存在bug

我感觉Firmware存在bug可能性较大

  • 通过 smartctl info 检查磁盘设备可以看到SN以及firmware版本:

smartctl -i 检查磁盘info信息
sudo smartctl -i /dev/sda
smartctl -i 检查Intel SSD磁盘info信息
=== START OF INFORMATION SECTION ===
Model Family:     Intel 545s Series SSDs
Device Model:     INTEL SSDSC2KW512G8
Serial Number:    BTLA7513037S512DGN
LU WWN Device Id: 5 5cd2e4 14eea7536
Firmware Version: LHF002C
User Capacity:    512,110,190,592 bytes [512 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 (minor revision not indicated)
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Aug 23 11:42:31 2023 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

根据 Latest Firmware For Solidigm™ (Formerly Intel®) Solid State Drives 545s系列 最新版本是 004C ,比我目前的 002C 要高2个版本

关于更新,我有点疑惑,主要是Intel的SSD业务已经卖给了 SK海力士 ( Solidigm正式宣布成立,成为NAND闪存技术市场领导者 目前在Solidigm网站也能查询到Intel SSD产品信息。Intel官方网站文档 How to Update the Firmware of an Intel® SSD with the Intel® Memory and Storage Tool 是针对内存和SSD的混合管理工具。

Intel DC系列SSD升级

Upgrading the firmware of Intel DC series SSDs in Linux (Debian)How to update Intel SSD firmware - CentOS 分别介绍了在Ubuntu和CentOS上如何升级 Intel DC系列SSD (数据中心SSD存储):

  • 安装 Intel SSD Data Center Tool (Intel SSD DCT),也就是 isdct 工具

  • 检查是否有新版本firmware:

使用 isdct 检查是否有新版Intel SSD firmware
isdct show -intelssd
  • 执行以下命令升级:

使用 isdct 升级新版Intel SSD firmware
isdct load -intelssd {serial number}
# or
isdct load -intelssd {index}

Intel MAS升级SSD

Do I need to update firmware with Intel® SSD Firmware Update Tool for intel 535 SSD when using Ubuntu 16.04? [closed] 提到使用 Intel® SSD Firmware Update Tool 是OS无关的工具 ,应该可以升级个人版SSD

下载

  • 下载 Intel SSD Firmware Update Tool 4.1:

下载 Intel SSD Firmware Update Tool 4.1
wget https://downloadmirror.intel.com/763597/Intel_SSD_FUT_4.1.zip

unzip Intel_SSD_FUT_4.1.zip

解压缩以后可以看到包含了手册和 iso 文件:

-rw-r--r-- 1 huatai dialout 1.3M Dec 12  2022  322570_Intel_SSD_Firmware_Update_Tool_User_Guide_Rev016US.pdf
-rw-r--r-- 1 huatai dialout 236K Dec  7  2022  328292_Intel_SSD_Firmware_Update_Tool_Release_Notes_Rev037US.pdf
-rw-r--r-- 1 huatai dialout  79M Dec 14  2022  issdfut_64_4.1.17.iso
-rw-r--r-- 1 huatai dialout  14K Dec 14  2022  SHA512__Hash.docx
-rw-r--r-- 1 huatai dialout  60K Oct 21  2021 'SoftwareLicenseAgreement_Commercial Use.pdf'

我准备周末将硬盘拆机到线下笔记本电脑上,通过笔记本U盘启动来进行更新(服务器重启实在太麻烦了,不利于升级)

参考