System Crash

I experienced a system crash this evening. I received a couple of pop up messages, similar to ‘CPU#3 stuck for 26s’.
Shortly after these messages my PC locked up, and even trying REISUB didn’t work. I had to shut down using the PC’s off button. PC rebooted successfully and has been working OK.

There are many messages in journalctl at 18:58:11 & 18:58:12, non of which makes sense to me.

journalctl starts with:


Feb 02 18:58:11 WHA-PC kernel: WARNING: CPU: 1 PID: 4935 at mm/truncate.c:405 truncate_inode_pages_range+0x2d5/0x710
Feb 02 18:58:11 WHA-PC kernel: Modules linked in: ntfs3 uas usb_storage snd_seq_dummy snd_seq cmac nls_utf8 cifs cifs_arc4 cifs_md4 dns_resolver fscache netfs af_packet nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_mangle iptable_raw iptable_security ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bpfilter vboxnetadp(O) vboxnetflt(O) vboxdrv(O) nct6775 hwmon_vid dmi_sysfs intel_rapl_msr intel_rapl_common intel_tcc_cooling snd_hda_codec_hdmi x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel mei_pxp snd_hda_codec_realtek mei_hdcp iTCO_wdt snd_hda_codec_generic intel_pmc_bxt iTCO_vendor_support ee1004 ledtrig_audio kvm snd_usb_audio snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_usbmidi_lib
Feb 02 18:58:11 WHA-PC kernel:  snd_hda_codec irqbypass snd_rawmidi eeepc_wmi asus_wmi snd_hda_core snd_seq_device mc snd_hwdep battery sparse_keymap snd_pcm platform_profile pcspkr rfkill snd_timer wmi_bmof efi_pstore snd i2c_i801 soundcore mxm_wmi r8169 i2c_smbus realtek mdio_devres mei_me libphy mei thermal fan tiny_power_button intel_pmc_core button acpi_pad nls_iso8859_1 nls_cp437 vfat fat fuse configfs ip_tables x_tables ext4 mbcache jbd2 hid_jabra hid_generic usbhid crct10dif_pclmul crc32_pclmul crc32c_intel i915 ghash_clmulni_intel i2c_algo_bit ttm drm_kms_helper aesni_intel crypto_simd syscopyarea cryptd sysfillrect sysimgblt fb_sys_fops cec xhci_pci xhci_pci_renesas rc_core serio_raw xhci_hcd drm usbcore video wmi sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua msr efivarfs
Feb 02 18:58:11 WHA-PC kernel: CPU: 1 PID: 4935 Comm: Cache2 I/O Tainted: G           O      5.16.2-1-default #1 openSUSE Tumbleweed b40a195b7ff0f3399a616c3290f963c4ad189e84
Feb 02 18:58:11 WHA-PC kernel: Hardware name: System manufacturer System Product Name/PRIME B250-PRO, BIOS 1006 02/22/2018
Feb 02 18:58:11 WHA-PC kernel: RIP: 0010:truncate_inode_pages_range+0x2d5/0x710
Feb 02 18:58:11 WHA-PC kernel: Code: 82 49 8b 47 08 a8 01 48 8d 48 ff 49 0f 44 cf 48 8b 41 20 48 c1 e0 06 4c 01 f8 48 29 c8 48 c1 f8 06 49 39 c4 0f 84 66 ff ff ff <0f> 0b e9 5f ff ff ff 48 81 bd 98 00 00 00 00 c2 27 b3 74 16 4c 89
Feb 02 18:58:11 WHA-PC kernel: RSP: 0018:ffffb44041b5bc90 EFLAGS: 00010283
Feb 02 18:58:11 WHA-PC kernel: RAX: 00000007f1c933a6 RBX: ffffb44041b5bd20 RCX: 0000000000000006
Feb 02 18:58:11 WHA-PC kernel: RDX: 0000000080000000 RSI: 0000000000000000 RDI: ffffd4428b8437c0
Feb 02 18:58:11 WHA-PC kernel: RBP: ffff9968fb3e22f0 R08: fffffffffffffffe R09: ffffffffffffffc0
Feb 02 18:58:11 WHA-PC kernel: R10: 0000000000001000 R11: 0000000000000003 R12: 0000000000000689
Feb 02 18:58:11 WHA-PC kernel: R13: ffffb44041b5bca8 R14: 0000000000000000 R15: ffffd4428b8437c0
Feb 02 18:58:11 WHA-PC kernel: FS:  00007fb92c16e640(0000) GS:ffff996fd6c80000(0000) knlGS:0000000000000000
Feb 02 18:58:11 WHA-PC kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 02 18:58:11 WHA-PC kernel: CR2: 000009a100269f00 CR3: 0000000196f6c004 CR4: 00000000003706e0
Feb 02 18:58:11 WHA-PC kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 02 18:58:11 WHA-PC kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 02 18:58:11 WHA-PC kernel: Call Trace:
Feb 02 18:58:11 WHA-PC kernel:  <TASK>
Feb 02 18:58:11 WHA-PC kernel:  ext4_evict_inode+0x175/0x6c0 [ext4 645d83c608192e8e1d3ec86230d5ca5a465aafb0]
Feb 02 18:58:11 WHA-PC kernel:  evict+0xc3/0x1c0
Feb 02 18:58:11 WHA-PC kernel:  do_unlinkat+0x1d8/0x2d0
Feb 02 18:58:11 WHA-PC kernel:  __x64_sys_unlink+0x3e/0x60
Feb 02 18:58:11 WHA-PC kernel:  ? __ia32_sys_unlink+0x60/0x60
Feb 02 18:58:11 WHA-PC kernel:  do_syscall_64+0x5c/0x80
Feb 02 18:58:11 WHA-PC kernel:  ? syscall_exit_to_user_mode+0x18/0x40
Feb 02 18:58:11 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:11 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:11 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:11 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:11 WHA-PC kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xae
Feb 02 18:58:11 WHA-PC kernel: RIP: 0033:0x7fb946e3f1cb
Feb 02 18:58:11 WHA-PC kernel: Code: f0 ff ff 73 01 c3 48 8b 0d 4a bc 0f 00 f7 d8 64 89 01 48 83 c8 ff c3 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 57 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1d bc 0f 00 f7 d8 64 89 01 48
Feb 02 18:58:11 WHA-PC kernel: RSP: 002b:00007fb92c16d958 EFLAGS: 00000206 ORIG_RAX: 0000000000000057
Feb 02 18:58:11 WHA-PC kernel: RAX: ffffffffffffffda RBX: 00007fb8e755d7c0 RCX: 00007fb946e3f1cb
Feb 02 18:58:11 WHA-PC kernel: RDX: 0000000000000000 RSI: 00007fb8e7394308 RDI: 00007fb8e7394308
Feb 02 18:58:11 WHA-PC kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000011
Feb 02 18:58:11 WHA-PC kernel: R10: 0000000000000100 R11: 0000000000000206 R12: 0000000000000000
Feb 02 18:58:11 WHA-PC kernel: R13: 00007fb9441ab010 R14: 00007fb930c91dc0 R15: 00007fb930c91dc0
Feb 02 18:58:11 WHA-PC kernel:  </TASK>
Feb 02 18:58:11 WHA-PC kernel: --- end trace c8d81ab87508bdeb ]---

The above repeats many times

and finishes with:


Feb 02 18:58:12 WHA-PC kernel: BUG: unable to handle page fault for address: 0000000000001034
Feb 02 18:58:12 WHA-PC kernel: #PF: supervisor read access in kernel mode
Feb 02 18:58:12 WHA-PC kernel: #PF: error_code(0x0000) - not-present page
Feb 02 18:58:12 WHA-PC kernel: PGD 0 P4D 0 
Feb 02 18:58:12 WHA-PC kernel: Oops: 0000 #1] PREEMPT SMP PTI
Feb 02 18:58:12 WHA-PC kernel: CPU: 0 PID: 4935 Comm: Cache2 I/O Tainted: G        W  O      5.16.2-1-default #1 openSUSE Tumbleweed b40a195b7ff0f3399a616c3290f963c4ad189e84
Feb 02 18:58:12 WHA-PC kernel: Hardware name: System manufacturer System Product Name/PRIME B250-PRO, BIOS 1006 02/22/2018
Feb 02 18:58:12 WHA-PC kernel: RIP: 0010:find_get_entries+0x110/0x260
Feb 02 18:58:12 WHA-PC kernel: Code: 48 8d 7c 24 08 e8 c0 10 32 00 48 89 c5 48 3d 06 04 00 00 74 e8 48 3d 02 04 00 00 74 58 48 85 c0 0f 84 95 00 00 00 a8 01 75 ad <8b> 40 34 85 c0 74 44 8d 50 01 f0 0f b1 55 34 75 f2 48 8b 54 24 20
Feb 02 18:58:12 WHA-PC kernel: RSP: 0018:ffffb44041b5bc18 EFLAGS: 00010246
Feb 02 18:58:12 WHA-PC kernel: RAX: 0000000000001000 RBX: fffffffffffffffe RCX: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: RDX: 0000000000000008 RSI: ffff996866b6fd98 RDI: ffffb44041b5bc20
Feb 02 18:58:12 WHA-PC kernel: RBP: 0000000000001000 R08: fffffffffffffffe R09: ffffffffffffffc0
Feb 02 18:58:12 WHA-PC kernel: R10: 0000000000000688 R11: 00000000000002f5 R12: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: R13: ffffb44041b5bd20 R14: ffffb44041b5bca8 R15: 0000000000000001
Feb 02 18:58:12 WHA-PC kernel: FS:  00007fb92c16e640(0000) GS:ffff996fd6c00000(0000) knlGS:0000000000000000
Feb 02 18:58:12 WHA-PC kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 02 18:58:12 WHA-PC kernel: CR2: 0000000000001034 CR3: 0000000196f6c005 CR4: 00000000003706f0
Feb 02 18:58:12 WHA-PC kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 02 18:58:12 WHA-PC kernel: Call Trace:
Feb 02 18:58:12 WHA-PC kernel:  <TASK>
Feb 02 18:58:12 WHA-PC kernel:  truncate_inode_pages_range+0x1be/0x710
Feb 02 18:58:12 WHA-PC kernel:  ext4_evict_inode+0x175/0x6c0 [ext4 645d83c608192e8e1d3ec86230d5ca5a465aafb0]
Feb 02 18:58:12 WHA-PC kernel:  evict+0xc3/0x1c0
Feb 02 18:58:12 WHA-PC kernel:  do_unlinkat+0x1d8/0x2d0
Feb 02 18:58:12 WHA-PC kernel:  __x64_sys_unlink+0x3e/0x60
Feb 02 18:58:12 WHA-PC kernel:  ? __ia32_sys_unlink+0x60/0x60
Feb 02 18:58:12 WHA-PC kernel:  do_syscall_64+0x5c/0x80
Feb 02 18:58:12 WHA-PC kernel:  ? syscall_exit_to_user_mode+0x18/0x40
Feb 02 18:58:12 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:12 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:12 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:12 WHA-PC kernel:  ? do_syscall_64+0x69/0x80
Feb 02 18:58:12 WHA-PC kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xae
Feb 02 18:58:12 WHA-PC kernel: RIP: 0033:0x7fb946e3f1cb
Feb 02 18:58:12 WHA-PC kernel: Code: f0 ff ff 73 01 c3 48 8b 0d 4a bc 0f 00 f7 d8 64 89 01 48 83 c8 ff c3 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 57 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1d bc 0f 00 f7 d8 64 89 01 48
Feb 02 18:58:12 WHA-PC kernel: RSP: 002b:00007fb92c16d958 EFLAGS: 00000206 ORIG_RAX: 0000000000000057
Feb 02 18:58:12 WHA-PC kernel: RAX: ffffffffffffffda RBX: 00007fb8e755d7c0 RCX: 00007fb946e3f1cb
Feb 02 18:58:12 WHA-PC kernel: RDX: 0000000000000000 RSI: 00007fb8e7394308 RDI: 00007fb8e7394308
Feb 02 18:58:12 WHA-PC kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000011
Feb 02 18:58:12 WHA-PC kernel: R10: 0000000000000100 R11: 0000000000000206 R12: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: R13: 00007fb9441ab010 R14: 00007fb930c91dc0 R15: 00007fb930c91dc0
Feb 02 18:58:12 WHA-PC kernel:  </TASK>
Feb 02 18:58:12 WHA-PC kernel: Modules linked in: ntfs3 uas usb_storage snd_seq_dummy snd_seq cmac nls_utf8 cifs cifs_arc4 cifs_md4 dns_resolver fscache netfs af_packet nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_mangle iptable_raw iptable_security ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bpfilter vboxnetadp(O) vboxnetflt(O) vboxdrv(O) nct6775 hwmon_vid dmi_sysfs intel_rapl_msr intel_rapl_common intel_tcc_cooling snd_hda_codec_hdmi x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel mei_pxp snd_hda_codec_realtek mei_hdcp iTCO_wdt snd_hda_codec_generic intel_pmc_bxt iTCO_vendor_support ee1004 ledtrig_audio kvm snd_usb_audio snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_usbmidi_lib
Feb 02 18:58:12 WHA-PC kernel:  snd_hda_codec irqbypass snd_rawmidi eeepc_wmi asus_wmi snd_hda_core snd_seq_device mc snd_hwdep battery sparse_keymap snd_pcm platform_profile pcspkr rfkill snd_timer wmi_bmof efi_pstore snd i2c_i801 soundcore mxm_wmi r8169 i2c_smbus realtek mdio_devres mei_me libphy mei thermal fan tiny_power_button intel_pmc_core button acpi_pad nls_iso8859_1 nls_cp437 vfat fat fuse configfs ip_tables x_tables ext4 mbcache jbd2 hid_jabra hid_generic usbhid crct10dif_pclmul crc32_pclmul crc32c_intel i915 ghash_clmulni_intel i2c_algo_bit ttm drm_kms_helper aesni_intel crypto_simd syscopyarea cryptd sysfillrect sysimgblt fb_sys_fops cec xhci_pci xhci_pci_renesas rc_core serio_raw xhci_hcd drm usbcore video wmi sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua msr efivarfs
Feb 02 18:58:12 WHA-PC kernel: CR2: 0000000000001034
Feb 02 18:58:12 WHA-PC kernel: --- end trace c8d81ab87508d343 ]---
Feb 02 18:58:12 WHA-PC kernel: RIP: 0010:find_get_entries+0x110/0x260
Feb 02 18:58:12 WHA-PC kernel: Code: 48 8d 7c 24 08 e8 c0 10 32 00 48 89 c5 48 3d 06 04 00 00 74 e8 48 3d 02 04 00 00 74 58 48 85 c0 0f 84 95 00 00 00 a8 01 75 ad <8b> 40 34 85 c0 74 44 8d 50 01 f0 0f b1 55 34 75 f2 48 8b 54 24 20
Feb 02 18:58:12 WHA-PC kernel: RSP: 0018:ffffb44041b5bc18 EFLAGS: 00010246
Feb 02 18:58:12 WHA-PC kernel: RAX: 0000000000001000 RBX: fffffffffffffffe RCX: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: RDX: 0000000000000008 RSI: ffff996866b6fd98 RDI: ffffb44041b5bc20
Feb 02 18:58:12 WHA-PC kernel: RBP: 0000000000001000 R08: fffffffffffffffe R09: ffffffffffffffc0
Feb 02 18:58:12 WHA-PC kernel: R10: 0000000000000688 R11: 00000000000002f5 R12: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: R13: ffffb44041b5bd20 R14: ffffb44041b5bca8 R15: 0000000000000001
Feb 02 18:58:12 WHA-PC kernel: FS:  00007fb92c16e640(0000) GS:ffff996fd6c00000(0000) knlGS:0000000000000000
Feb 02 18:58:12 WHA-PC kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 02 18:58:12 WHA-PC kernel: CR2: 0000000000001034 CR3: 0000000196f6c005 CR4: 00000000003706f0
Feb 02 18:58:12 WHA-PC kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Feb 02 18:58:12 WHA-PC kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Feb 02 18:58:11 WHA-PC rsyslogd[3413]:  message repeated 10 times: -- MARK --]
Feb 02 18:58:11 WHA-PC rsyslogd[3413]: action 'action-1-builtin:ompipe' suspended (module 'builtin:ompipe'), retry 0. There should be messages before this one giving the reason for suspension. [v8.2110.0 try https://www.rsyslog.com/e/2007 ]
Feb 02 18:58:11 WHA-PC rsyslogd[3413]: action 'action-1-builtin:ompipe' suspended (module 'builtin:ompipe'), next retry is Wed Feb  2 18:58:41 2022, retry nbr 0. There should be messages before this one giving the reason for suspension. [v8.2110.0 try https://www.rsyslog.com/e/2007 ]
Feb 02 18:58:11 WHA-PC rsyslogd[3413]: main Q:Reg: high activity - starting 1 additional worker thread(s), currently 1 active worker threads. [v8.2110.0 try https://www.rsyslog.com/e/2439 ]
Feb 02 18:58:11 WHA-PC rsyslogd[3413]: action 'action-1-builtin:ompipe' suspended (module 'builtin:ompipe'), retry 0. There should be messages before this one giving the reason for suspension. [v8.2110.0 try https://www.rsyslog.com/e/2007 ]
Feb 02 18:58:11 WHA-PC rsyslogd[3413]: action 'action-1-builtin:ompipe' suspended (module 'builtin:ompipe'), next retry is Wed Feb  2 18:58:41 2022, retry nbr 0. There should be messages before this one giving the reason for suspension. [v8.2110.0 try https://www.rsyslog.com/e/2007 ]
Feb 02 18:58:17 WHA-PC kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kioslave5:11378]

Can anyone help with this please?

Hmm, I experienced something similar with kernel-default 5.16.2-1-default Tumbleweed 20220128. The first hint was also:

watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [chrome:6272]

But after that the cause appears quite different:

[14580.680825] Modules linked in: rpcsec_gss_krb5 af_packet nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_re
ject_ipv6 nft_reject nft_ct nft_chain_nat nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 
libcrc32c iptable_mangle iptable_raw iptable_security vboxnetadp(O) vboxnetflt(O) ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bpfilter vboxdrv(O) dmi_sysfs it87 h
wmon_vid msr nvidia_drm(POE) nvidia_modeset(POE) nvidia_uvm(POE) edac_mce_amd kvm_amd ccp joydev kvm nvidia(POE) irqbypass snd_hda_codec_realtek eeepc_wmi asus_wmi battery sparse_keymap platform_profil
e rfkill snd_hda_codec_generic pcspkr video ledtrig_audio wmi_bmof snd_hda_codec_hdmi efi_pstore drm_kms_helper k10temp fam15h_power snd_hda_intel cec snd_intel_dspcfg
[14580.680888]  snd_intel_sdw_acpi r8169 rc_core i2c_piix4 snd_hda_codec syscopyarea sysfillrect realtek mdio_devres sysimgblt fb_sys_fops snd_hda_core libphy tiny_power_button acpi_cpufreq nls_iso8859
_1 nls_cp437 vfat fat nfsd auth_rpcgss nfs_acl lockd grace sunrpc drm fuse configfs ip_tables x_tables ext4 mbcache jbd2 hid_generic usbhid uas usb_storage crct10dif_pclmul crc32_pclmul crc32c_intel gh
ash_clmulni_intel ohci_pci aesni_intel crypto_simd cryptd firewire_ohci firewire_core sp5100_tco ohci_hcd ehci_pci crc_itu_t ehci_hcd xhci_pci xhci_pci_renesas xhci_hcd wmi button snd_usb_audio snd_hwd
ep snd_usbmidi_lib snd_pcm snd_timer mc snd_rawmidi snd_seq_device snd soundcore usbcore sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua ledtrig_timer i2c_dev efivarfs
[14580.680944] CPU: 0 PID: 6272 Comm: chrome Tainted: P           OE     5.16.2-1-default #1 openSUSE Tumbleweed b40a195b7ff0f3399a616c3290f963c4ad189e84
[14580.680949] Hardware name: To be filled by O.E.M. To be filled by O.E.M./M5A97 EVO, BIOS 1604 10/16/2012
[14580.680950] RIP: 0010:_nv035891rm+0xaa/0xe0 [nvidia]
[14580.681514] Code: 49 8b 4c 24 20 48 89 c2 48 89 ef 48 8d b1 48 01 00 00 4c 89 e9 e8 86 5b ff ff 66 0f 1f 44 00 00 48 89 ef e8 e8 5b ff ff 84 c0 <74> 8a 48 8b 75 00 48 39 5e 08 75 ea 4c 39 26 75 e5 4
9 8b 44 24 20
[14580.681516] RSP: 0018:ffffa1ed4565fb40 EFLAGS: 00000202
[14580.681518] RAX: 0000000000000001 RBX: ffff96764f288430 RCX: ffff96753be01178
[14580.681520] RDX: ffffddd3c6b356c8 RSI: ffffddd3c6b378f5 RDI: ffff96739e7f2d20
[14580.681521] RBP: ffff96739e7f2d20 R08: 0000000000000020 R09: ffff96739e7f2d28
[14580.681522] R10: 0000000000000000 R11: 0000000000000000 R12: ffff96739e794438
[14580.681523] R13: ffffddd3caefa66d R14: ffff96739e7f2d98 R15: ffff96764f288430
[14580.681525] FS:  00007f6aefa15e00(0000) GS:ffff9676aec00000(0000) knlGS:0000000000000000
[14580.681526] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[14580.681528] CR2: 0000561b4c386f20 CR3: 0000000239610000 CR4: 00000000000406f0
[14580.681529] Call Trace:
[14580.681531]  <TASK>
[14580.681534]  ? _nv014660rm+0x2ee/0x770 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.682009]  ? _nv037748rm+0xb3/0x150 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.682463]  ? _nv037747rm+0x297/0x4e0 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.682937]  ? _nv037742rm+0x60/0x70 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.683409]  ? _nv037743rm+0x7b/0xb0 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.683886]  ? _nv036103rm+0x40/0xe0 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.684247]  ? _nv000699rm+0x68/0x80 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.684652]  ? rm_cleanup_file_private+0xea/0x160 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.685085]  ? nvidia_close+0x150/0x310 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.685457]  ? nvidia_frontend_close+0x2b/0x50 [nvidia 34b1cf319600eafc7c9f2d117983f98ef8ab404f]
[14580.685833]  ? __fput+0x94/0x250
[14580.685838]  ? task_work_run+0x5c/0x90
[14580.685841]  ? do_exit+0x374/0xa90
[14580.685844]  ? do_group_exit+0x33/0xa0
[14580.685847]  ? __x64_sys_exit_group+0x14/0x20


As in your case, the error repeats. It was possible to connect from another desktop via ssh, but it was not possible to cleanly shutdown the machine.

I had blamed the nvidia driver and/or chrome, I asked for help in the nvidia forums, perhaps I’m wrong in laying the blame at their door.

I used yast to change my default boot back to 5.16.1-1-default, so far the error has not occurred. If it doesn’t pan out I can fall back to 5.15.8-1-default which I’ve preserved via multiversion.kernels in /etc/zypp/zypp.conf.

I’m not using nvidia drivers and although I do have chrome installed, it’s rarely used, so I don’t think that relates to my crash.

I’m also using kernel-default-5.16.2-1.1, which was installed on 28 Jan, so it had been running fine for 5 days, and nothing has fundamentally changed since then.
I haven’t changed kernels, or anything else, and so far no re-occurrence.

Our errors may be unrelated, but as I understand it, they both started after moving to 5.16.2. In my case 5.16.2 had been working OK for a while as well. So my suggestion is: if the problem reoccurs, consider falling back to some earlier kernel in case something in 5.16.2 is the actual cause of both issues.

That’s a fair suggestion. I’ll keep an eye on it,

Your system was hung. Here’s a Suse doc that explains the message you received: What are all these “Bug: soft lockup” messages about? It may be of some help. I can’t vouch for the recommended amelioration, though.