Ext4
隨機 SSD 關閉 - ext4_find_entry ,讀取目錄 lblock0
從過去兩週開始,我在 GNU/Linux 中的 SSD 出現問題。我認為這不是設備問題,但我不確定。有時(最後幾天每 1-2 天)我會失去對磁碟的物理訪問權限,就好像它已斷開連接或斷電一樣。
錯誤:
EXT4-fs error (device: sda2): ext4_find_entry:1465: inode #1308161: comm NetworkManager: reading directory lblock 0
我從照片中輸入了這個錯誤,所以它可能不完全準確。
筆記:
設備總是相同的“sda2”,沒有註意到其他(大家庭)分區的錯誤。下次我會試著檢查一下。Inode 和程序名稱更改,但 NetworkManager 很常見。lblock 始終為 0。
硬體:
戴爾 E7270 帶SSD盤 LITEON CV3-8D512-11 SATA 512GB
軟體:
Debian 測試,核心 4.11。
smartctl 簡要輸出:
Device Model: LITEON CV3-8D512-11 SATA 512GB Serial Number: TW0956WWLOH006CU022Z LU WWN Device Id: 5 002303 100ce15e0 Firmware Version: T89110D User Capacity: 512,110,190,592 bytes [512 GB] Sector Size: 512 bytes logical/physical Rotation Rate: Solid State Device Form Factor: M.2 Device is: Not in smartctl database [for details use: -P showall] ATA Version is: ATA8-ACS, ATA/ATAPI-7 T13/1532D revision 4a SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Wed Jul 5 12:32:39 2017 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled ... SMART Attributes Data Structure revision number: 1 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 5 Reallocated_Sector_Ct 0x0003 100 100 000 Pre-fail Always - 0 9 Power_On_Hours 0x0002 100 100 000 Old_age Always - 327 12 Power_Cycle_Count 0x0003 100 100 000 Pre-fail Always - 335 175 Program_Fail_Count_Chip 0x0003 100 100 000 Pre-fail Always - 0 176 Erase_Fail_Count_Chip 0x0003 100 100 000 Pre-fail Always - 0 177 Wear_Leveling_Count 0x0003 100 100 000 Pre-fail Always - 59 178 Used_Rsvd_Blk_Cnt_Chip 0x0003 100 100 000 Pre-fail Always - 0 179 Used_Rsvd_Blk_Cnt_Tot 0x0003 100 100 000 Pre-fail Always - 0 180 Unused_Rsvd_Blk_Cnt_Tot 0x0033 100 100 005 Pre-fail Always - 2688 181 Program_Fail_Cnt_Total 0x0003 100 100 000 Pre-fail Always - 0 182 Erase_Fail_Count_Total 0x0003 100 100 000 Pre-fail Always - 0 187 Reported_Uncorrect 0x0003 100 100 000 Pre-fail Always - 0 194 Temperature_Celsius 0x0003 100 100 000 Pre-fail Always - 76 195 Hardware_ECC_Recovered 0x0003 100 100 000 Pre-fail Always - 0 199 UDMA_CRC_Error_Count 0x0003 100 100 000 Pre-fail Always - 0 238 Unknown_Attribute 0x0003 097 100 000 Pre-fail Always - 3 241 Total_LBAs_Written 0x0003 100 100 000 Pre-fail Always - 4293005286 242 Total_LBAs_Read 0x0003 100 100 000 Pre-fail Always - 3510503294 SMART Error Log Version: 0 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 298 - # 2 Short offline Completed without error 00% 294 - # 3 Offline Interrupted (host reset) 80% 294 - # 4 Offline Interrupted (host reset) 10% 294 - # 5 Short offline Completed without error 00% 294 - # 6 Short offline Completed without error 00% 1 - # 7 Short offline Aborted by host 90% 1 -
想法:
- 執行壞塊檢查
- 檢查連接
我想我已經通過移除 SDD、將空氣吹入 M.2 連接器並重新插入來解決此問題。
當我從 USB 啟動以拯救 Debian 時,我注意到更詳細的核心調試資訊。在搜尋時,我注意到大多數解決方案都是更換 SATA 電纜。筆記型電腦 M.2 連接沒有電纜。
一些最重要的日誌文本:
exception Emask 0x10 SAct ... SErr ... action 0xe frozen interface fatal error, PHY RDY changed SError: { PHYRdyChg LinkSeq } failed command: WRITE FPDMA QUEUED Emask 0x10 (ATA bus error) hard resetting link