Leap 15.5 won't boot with any of the kernel-default from update repository

I have an Acer laptop that has been running Leap 15.4 for quite some time without any major issues.
In the past couple of weeks I have tried to upgrade to Leap 15.5 but after all the attempts the system has failed to boot due to:
" Starting Dracut Emergency Shell…
Warning: /dev/disk/by-uuid/B2C3-C97D does not exist
Warning: /dev/disk/by-uuid/d733130e-1ad1-4ee3-be20-8f74d7763d88 does not exist"

(B2C3-C97D refers to /boot/efi in a working system.)
(d733130e-1ad1-4ee3-be20-8f74d7763d88 refers to / in a working system.)
(Different UUIDs in reality. These are just from the latest attempt.)

During the weekend I decided to do a scratch install. By accident I did not enable the online repositories. To my joy the laptop booted up after installation. I logged in, went to yast and updated all packages. Rebooted and ended up in the same situation as above. I rolled back to a working snapshot and in an attempt to bisect I only updated the kernel and again it did not boot.
Currently I have these kernel versions installed:
5.14.21-150500.53.2 (This kernel is working.)
5.14.21-150500.55.36.1 (This kernel is NOT working.)

I am not familiar with troubleshooting things like this so help would be very much appreciated.
I guess that the information above is not enough but I can try to provide what is needed.

Boot kernel that boots successfully, collect output of dmesg command (as root), upload to https://paste.opensuse.org/. Also run

lsinitrd /boot/initrd-5.14.21-150500.53-default
lsinitrd /boot/initrd-5.14.21-150500.55.36-default

for good and bad kernels and also upload to the same location. Post links here.

Thank you for you quick reply @arvidjaar!
I hope this is what you are looking for.

dmesg
lsinitrd /boot/initrd-5.14.21-150500.53-default
lsinitrd /boot/initrd-5.14.21-150500.55.36-default

Nothing stands out when comparing two initrds. Both include the same modules and kernel drivers AFAICT. The dracut rdsosreport.txt would be really helpful.

I will try to save that one. I failed to mount my USB stick but I will give it another go.

There are two problems.

  1. initrd does not include usb-storage driver. You may fix it by force-including it in intrd
echo 'add_drivers+=" usb-storage "' > /etc/dracut.conf.d/usb-storage.conf
mkinitrd
  1. vfat filesystem needs NLS modules which are also missing but I always forget which ones (IIRC should be at least nls_iso8859-1 and may be nls_cp437). You may check with lsmod on the running system after mounting USB stick and add these drivers too.

Will mkinitrd run for both kernels?

Yes, it builds initrd for all installed kernels.

1 Like

rdsosreport.txt

Kernel failed to initialize NVMe controller, also there are hardware errors

[    2.013925] kronblom kernel: pcieport 10000:e0:1c.4: DPC: containment event, status:0x1f11 source:0x0000
[    2.013926] kronblom kernel: pcieport 10000:e0:1c.4: DPC: unmasked uncorrectable error detected
[    2.013934] kronblom kernel: pcieport 10000:e0:1c.4: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID)
[    2.013938] kronblom kernel: pcieport 10000:e0:1c.4:   device [8086:a0bc] error status/mask=00100000/00000000
[    2.013940] kronblom kernel: pcieport 10000:e0:1c.4:    [20] UnsupReq               (First)
[    2.013941] kronblom kernel: pcieport 10000:e0:1c.4: AER:   TLP Header: 34000000 e1000010 00000000 88468846
[    2.013956] kronblom kernel: nvme nvme0: frozen state error detected, reset controller
[    2.036996] kronblom kernel: nvme nvme0: Identify Controller failed (-4)
[    2.037008] kronblom kernel: nvme nvme0: Removing after probe failure status: -5

At this point you need to open bug report at https://bugzilla.opensuse.org/. Use the same user/password as here. Select Leap and correct version, use kernel as component (to expedite routing to the correct people). Attach dmesg from the good kernel and rdsosreport.txt.

Thank you very much for you support @arvidjaar!
Creating a bug report was easier said than done though. It seems that I can’t login. I get a message:
“There is already an account with the login name <my email address>

Usual first step - clear cookies for *.suse.com and *.opensuse.org (or use different clean browser). If it still does not work - send e-mail to bugzilla-admins@suse.com as instructed on the start page.

Ye, I have tested in new, temporary containers in Firefox so no cookies should interfere. I followed the advice on the bugzilla landing page and sent an email to bugzilla-admins@suse.com. Will give it some time and see if I get any response.

While you are waiting, you could try whether adding

nvme_core.default_ps_max_latency_us=0 pcie_aspm=off

kernel parameters has any effect.

Indeed that had effect. Latest kernel booted up directly. Any drawbacks to have these parameters as default?

|Good. Do not forget to mention it in bug report.

Worse power saving at the very least.

1 Like

New bug opened:
https://bugzilla.opensuse.org/show_bug.cgi?id=1218005

I( created a new Suse account.)

Install inxi utility and post

inxi -aFz

Update BIOS.
Try to change kernel parameter “preempt=full” to “preempt=none”.
CPU and iGPU are rather new for Leap. Try to use kernel from kernel:stable:backport repo.

Acer Aspire AV15-51/Jade_TL, BIOS V1.03 11/08/2021

New BIOS is available: BIOS V1.08 2023/09/18.

Oh my, it took much longer to respond than I first thought…

I have tested with latest kernel from backport and it boot up without any changed kernel parameters.

inxi output when running kernel from backport.

I have not updated bios just yet as backport kernel do work.