From: Sergey Matveev Date: Mon, 29 Sep 2025 17:27:34 +0000 (+0300) Subject: ZFS показал что у меня начал отваливаться диск X-Git-Url: http://www.git.stargrave.org/?a=commitdiff_plain;h=2638db8098a64f8f870bd026f8a4884f6f10fd83;p=stargrave-blog.git ZFS показал что у меня начал отваливаться диск На рабочем компьютере, после очередного scrub, мне показали что с одним из дисков не всё в порядке: # zpool status pool: zroot state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P scan: scrub repaired 2M in 00:59:21 with 0 errors on Mon Sep 1 22:54:16 2025 config: NAME STATE READ WRITE CKSUM zroot ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 gpt/SGWD0-ROOT ONLINE 16 0 2 gpt/SGWD1-ROOT ONLINE 0 0 0 dmesg, действительно, показывает ошибки: [2463378] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00 [2463378] (ada0:ahcich0:0:0:0): Retrying command, 2 more tries remain [2463380] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00 [2463380] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error [2463380] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC ) [2463380] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00 [2463380] (ada0:ahcich0:0:0:0): Retrying command, 1 more tries remain [2463381] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00 [2463381] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error [2463381] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC ) [2463381] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00 [2463381] (ada0:ahcich0:0:0:0): Retrying command, 0 more tries remain [2463383] (ada0:ahcich0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 00 78 f7 2d 40 07 00 00 08 00 00 [2463383] (ada0:ahcich0:0:0:0): CAM status: ATA Status Error [2463383] (ada0:ahcich0:0:0:0): ATA status: 41 (DRDY ERR), error: 40 (UNC ) [2463383] (ada0:ahcich0:0:0:0): RES: 41 40 17 ff 2d 40 07 00 00 00 00 [2463383] (ada0:ahcich0:0:0:0): Error 5, Retries exhausted SMART здорового диска: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 141 139 021 Pre-fail Always - 3908 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 165 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 010 010 000 Old_age Always - 66297 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 165 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 70 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 110 194 Temperature_Celsius 0x0022 099 085 000 Old_age Always - 44 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART на том, где ошибки: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 10 3 Spin_Up_Time 0x0027 139 138 021 Pre-fail Always - 4008 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 405 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 001 001 000 Old_age Always - 88255 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 403 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 146 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 258 194 Temperature_Celsius 0x0022 097 083 000 Old_age Always - 46 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 10 Работал более десяти лет. Пора на покой. zpool detach, вставляем новый диск, zpool attach, через какое-то время здоровенькое зеркало снова в строю. ---