Well, I’ve had two freezes earlier (I’m currently booted into the other TW/drive instance). This last freeze, I rebooted again and captured journal logs - I’m thinking I found something wrong with the NVME drive for that other TW instance.
However, I downloaded a tool called HDSentinel to do a check on that other NVME.
It’s odd, but it reports the drive is “perfect”.
The first code block is the output of HDSentinel - the second code block is the last few lines of the journal log, just before it froze - you will see the BTRFS errors at the end before the next boot-up sequence begins.
Any thoughts (?) - maybe the BTRFS filesystem for / is going bad? (my /home is separate
)
# ./HDSentinel
Hard Disk Sentinel for LINUX console 0.20.10851 (c) 2023 info@hdsentinel.com
Start with -r [reportfile] to save data to report, -h for help
Examining hard disk configuration ...
HDD Device 0: /dev/nvme0
HDD Model ID : Samsung SSD 970 EVO 500GB
HDD Serial No: xxxxxxxxxxxxxxx
HDD Revision : 1B2QEXE7
HDD Size : 476940 MB
Interface : NVMe
Temperature : 41 °C
Highest Temp.: 41 °C
Health : 100 %
Performance : 100 %
Power on time: 105 days, 0 hours
Est. lifetime: more than 1000 days
Total written: 9.46 TB
The status of the solid state disk is PERFECT. Problematic or weak sectors were not found.
The health is determined by SSD specific S.M.A.R.T. attribute(s): Available Spare (Percent), Percentage Used
No actions needed.
journal logs
Sep 16 09:04:40 systemd[1]: Finished Backup /etc/sysconfig directory.
Sep 16 09:10:02 smartd[1068]: Device: /dev/sda [SAT], old test of type S not run at
Sep 16 09:10:02 smartd[1068]: Device: /dev/sda [SAT], starting scheduled Short Self-Test.
Sep 16 09:26:08 systemd[1]: Starting Backup RPM database...
Sep 16 09:26:08 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:08 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1722, gen 0
Sep 16 09:26:08 backup-rpmdb[6015]: cat: /usr/lib/sysimage/rpm/Packages.db: Input/output error
Sep 16 09:26:08 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:08 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1723, gen 0
Sep 16 09:26:08 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:08 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1724, gen 0
Sep 16 09:26:09 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:09 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1725, gen 0
Sep 16 09:26:09 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:09 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1726, gen 0
Sep 16 09:26:18 backup-rpmdb[6021]: gzip: stdin: Input/output error
Sep 16 09:26:18 backup-rpmdb[6011]: ERROR!! can not backup RPM Database to /var/adm/backup/rpmdb.
Sep 16 09:26:18 backup-rpmdb[6011]: Maybe there is not enough disk space.
Sep 16 09:26:18 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:18 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1727, gen 0
Sep 16 09:26:18 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:18 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1728, gen 0
Sep 16 09:26:18 kernel: BTRFS warning (device nvme0n1p2): csum failed root 533 ino 9541247 off 202940416 csum 0xc4672adb expected csum 0x1284964b mirror 1
Sep 16 09:26:18 kernel: BTRFS error (device nvme0n1p2): bdev /dev/nvme0n1p2 errs: wr 0, rd 0, flush 0, corrupt 1729, gen 0
Sep 16 09:26:18 systemd[1]: backup-rpmdb.service: Deactivated successfully.
Sep 16 09:26:18 systemd[1]: Finished Backup RPM database.
Sep 16 09:26:18 systemd[1]: backup-rpmdb.service: Consumed 10.065s CPU time.
Sep 16 09:30:14 wickedd-dhcp6[1180]: enp6s0: Committing DHCPv6 lease with:
Sep 16 09:30:14 wickedd-dhcp6[1180]: enp6s0 +ia-na.address xxxx:xxxx:9e0:xxxx::23/0, pref-lft 3600, valid-lft 3600
---------- logging stopped, then froze some minutes lateer ---------------
-- Boot xxxxxxxxxxxd04e6fa9725ba44f1416ac --
Sep 16 09:59:29 kernel: Linux version 6.5.2-1-default (geeko@buildhost) (gcc (SUSE Linux)
Sep 16 09:59:29 kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-6.5.2-1-default root=UUID=78
Sep 16 09:59:29 kernel: BIOS-provided physical RAM map: