This has been plaguing me for years. I used to think it’s somehow related to secure boot that I had on my old thinkpad laptop, but it also happens on my new dell latitude laptop with secure boot off.
What happens is that I zypper dup, and when there’s a kernel update it’s a gamble whether it’s going to boot or not. When it doesn’t boot, I get a black screen until I hard reset. When that happens I revert to the older kernel version in the advanced boot options.
Does this happen for anyone else? Is there anything I can do? Is this a known issue? Thoughts?
Currently I’m on kernel 6.9.9-1-default, if I try to boot 6.10.2-1-default I get the black screen I described.
It happens for me too - on a system using an NVIDIA GPU;
It drops me without a GUI on a raw terminal.
However I usually just log in, do a sudo reboot now and the next boot it works.
@eshoe unusual for Intel GPU to not play nice… your running Xorg there, not Wayland? Have you logged out and selected Wayland? If using Xorg, then maybe you created an xorg.conf file?
As soon as Grub menu is exited, strike ESC. The resulting unhidden messages during boot may provide error messages constituting clues to failure. If you remove the splash=silent and quiet entries from your GRUB_CMDLINE_LINUX_DEFAULT= line in /etc/default/grub, and regenerate grub.cfg, you’ll see those messages on every boot.
@dotdotdot333 - different for me. In my case linux doesn’t boot at all. I don’t even get to a TTY. @malcolmlewis - I am using wayland…? @mrmazda - I’ve done that. Nothing. When I select 6.10.2-1-default. Black screen, no messages at all. When I select 6.9.9-1-default I can boot and I see the messages as expected.
Could it be some weird interaction with Dell’s UEFI? But I don’t see why it would happen on every other version or so of the kernel.
I wonder if xe.force_probe=4c8b and i915.force_probe=!4c8b, as you can see I used above on my kernel cmdline, added to yours would be of any benefit. My GPU is a year or more older than yours, so requires at least the first two in order to use the Xe drivers instead of older.
Have you tried a remote login after a 6.10.2 boot was given plenty of time to complete?
@mrmazda - Given around 1 minute of waiting (I did not time it), without seeing any printed message still, laptop is reset. Is auto-reset a known behavior when boot is stuck?
I wonder if xe.force_probe=4c8b and i915.force_probe=!4c8b, as you can see I used above on my kernel cmdline, added to yours would be of any benefit. My GPU is a year or more older than yours, so requires at least the first two in order to use the Xe drivers instead of older.
No change, other than getting this scary looking log message now: [ 4.239381] [ T519] Setting dangerous option force_probe - tainting kernel
@malcolmlewis - what’s the difference between the magic bytes 46d1 and 4c8b?
Hmm raptor-Lake is that not the Intel CPU that has known instabilities???
@gogalthorp - yes it’s the code name. I’m crossing my fingers that I’m not affected by oxidation. I have 3 year warranty from Dell in case I am. Regarding the instability, as far as I know it’s been observed mostly on Desktop CPUs, not mobile. I hope that’s the case.
Oh , thanks. Well using my correct chip id changed nothing for the not-working 6.10.2-1 boot process.
I just read today that Intel is in process of extending generation 13 & 14 CPU warranties from 3 years to 5.
That would be nice. I’m not sure how it works with Dell’s warranty though. It’s not like I’d get full warranty for all hardware and Dell support for extra 2 years (though it would be cool!)
“You” may not be the one doing the changing. Intel could have designated facilities for having a change made by a professional whether laptop or otherwise, possibly to include a Dell service center if one is nearby, and others are not.
Tired the --recommends thing at the bottom of that output: maybe this section could be relevant amongst all the other stuff? Test: recommended kernel modules:
GPU modules are only needed if applicable. NVMe drives do not need drivetemp
but other types do.
To load a module: modprobe - To permanently load add to
/etc/modules or /etc/modules-load.d/modules.conf (check your system paths for
exact file/directory names).
amdgpu: -s, -G AMD GPU sensor data (newer GPUs)… Missing
drivetemp: -Dx drive temperature (kernel >= 5.6)… Missing
nouveau: -s, -G Nvidia GPU sensor data (if using free driver)… Missing
radeon: -s, -G AMD GPU sensor data (older GPUs)… Missing
The following recommended kernel modules are missing:
amdgpu
drivetemp
nouveau
radeon
Not sure how I’d install those. Guess I could also try booting with the on board graphics connected instead of the Nvidia or try it with Wayland?