Hy again!
Have a TW install on an Intel SSD
sudo smartctl -a /dev/sda
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-5.1.7-1-default] (SUSE RPM)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Intel 53x and Pro 2500 Series SSDs
Device Model: INTEL SSDSC2BW120H6
Serial Number: aaaaaaaaaaaaaaaaaaaaa
LU WWN Device Id: 5 5cd2e4 14cabbee6
Firmware Version: RG21
User Capacity: 120,034,123,776 bytes [120 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-3 (minor revision not indicated)
SATA Version is: SATA 3.2, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Fri Jun 14 07:45:27 2019 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 2930) seconds.
Offline data collection
capabilities: (0x7f) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Abort Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 58) minutes.
Conveyance self-test routine
recommended polling time: ( 4) minutes.
SCT capabilities: (0x0025) SCT Status supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0032 100 100 000 Old_age Always - 0
9 Power_On_Hours_and_Msec 0x0032 100 100 000 Old_age Always - 14697h+00m+00.000s
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 350
170 Available_Reservd_Space 0x0033 081 100 010 Pre-fail Always - 0
171 Program_Fail_Count 0x0032 100 100 000 Old_age Always - 0
172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0
174 Unexpect_Power_Loss_Ct 0x0032 100 100 000 Old_age Always - 162
183 SATA_Downshift_Count 0x0032 100 100 000 Old_age Always - 11
184 End-to-End_Error 0x0033 100 100 090 Pre-fail Always - 0
187 Uncorrectable_Error_Cnt 0x0032 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0032 026 100 000 Old_age Always - 26 (Min/Max 13/35)
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 162
199 UDMA_CRC_Error_Count 0x0032 100 100 000 Old_age Always - 0
225 Host_Writes_32MiB 0x0032 100 100 000 Old_age Always - 47016
226 Workld_Media_Wear_Indic 0x0032 100 100 000 Old_age Always - 65535
227 Workld_Host_Reads_Perc 0x0032 100 100 000 Old_age Always - 40
228 Workload_Minutes 0x0032 100 100 000 Old_age Always - 65535
232 Available_Reservd_Space 0x0033 081 100 010 Pre-fail Always - 0
233 Media_Wearout_Indicator 0x0032 083 100 000 Old_age Always - 0
241 Host_Writes_32MiB 0x0032 100 100 000 Old_age Always - 47016
242 Host_Reads_32MiB 0x0032 100 100 000 Old_age Always - 30561
249 NAND_Writes_1GiB 0x0032 100 100 000 Old_age Always - 32383
…which is not tooooooooooooooo old. But for the last 3-4 days once daily the file system went read-only and the machine became unresponsive.
When the system is in trouble I see in the console something like
systemd-journal ... : Failed to write entry (24 items...), ignoring: Read-only file system
print_req_error: I/O error, dev sda, sectro ... flags 01
Buffer I/O error on dev sda3, logical block 0, lost sync page write
EXT4-fs (sda3): I/O error while writing super block
EXT4-fs error (device sda3): ext4_journal_check_start:61: Detected aborted journal
EXT4-fs (sda3): Remounting filesystem read-only
.....
After rebooting the system is fine for some time. I see nothing in dmesg after the reboot.
Is the SSD end-of-life? Or another problem with the EXT4 file system I use on / and /home (sda3) (third partition is a 2.01 GB swap partition).
Any help highly appreciated, in the meantime I will prepare for a fresh install on an new SSD.