13.2 crashes on attempt to resume after hibernation

[New thread with appropriate title]

I checked *suspend to DISK with fatal results.
BAD MOVE: after my success at suspending and resuming from Sleep I decided to test Hibernate. It appeared to work but won’t recover from hibernation. Now I can’t even get the GRUB screen. Just a load of messages which stop at “Reached target Graphical Interface”. If I touch the On/Off button, it moves a bit further **and then powers off."

******I mean when I press the On-button my laptop keeps trying to recover the Hibernate image and there is no way to boot 13.2. The messages I mentioned just scroll up the screen and I can’t stop them. I created a new Knoppix CD and I can see the content of the SSD/HD and found the partition with the system.

In the meantime I appear to have zapped the MBR because when I try to boot from the HD I get the Lenovo screen and then nothing but a blank screen with a flashing underline cursor. This cursor does respond to the keyboard.

grubenv contain:

#GRUB Environment Block
saved_entry=0
next_entry=Advanced options for openSUSE>openSUSE, with Linux 3.16.6-2-desktop(recovery mode)
########################

Any guidance will be appreciated.


No, it doesn’t.
It tries to boot the set entry over and over again.
Remove /boot/grub2/grubenv to get the boot menu back.

In the meantime I appear to have zapped the MBR because when I try to boot from the HD I get the Lenovo screen and then nothing but a blank screen with a flashing underline cursor. This cursor does respond to the keyboard.

That’s bad.
Try to reinstall GRUB2 like instructed here:
https://forums.opensuse.org/content.php/128-Re-install-Grub2-from-DVD-Rescue

Please note that “mount /proc” and “mount /sys” will not work on a default 13.2 installation any more, use this instead:

mount -t proc proc /proc
mount -t sysfs sysfs /sys

grubenv contain:

#GRUB Environment Block
saved_entry=0
next_entry=Advanced options for openSUSE>openSUSE, with Linux 3.16.6-2-desktop(recovery mode)
########################

Apparently you stumbled over the same bug we are discussing here at the moment:
https://forums.opensuse.org/showthread.php/503756-OS-13-2-Hibernate-(aka-suspend-to-disk)-doesn-t-work-when-booting-with-quot-Advanced-options-quot

I.e. You booted an older kernel via “Advanced Options” and then hibernated. The hibernate script set the wrong boot entry, namely “recovery mode” which doesn’t resume at all, because it has the “noresume” parameter.

Sounds good to me. I’ll do this first, before I try to reinstall GRUB2. Right?

That’s bad.
Try to reinstall GRUB2 like instructed here:
https://forums.opensuse.org/content.php/128-Re-install-Grub2-from-DVD-Rescue

Please note that “mount /proc” and “mount /sys” will not work on a default 13.2 installation any more, use this instead:

mount -t proc proc /proc
mount -t sysfs sysfs /sys

I’ll get onto this in about three hours (I want to check the original 13.2 DVD and it’s with someone else).

Apparently you stumbled over the same bug we are discussing here at the moment:
OS 13.2: Hibernate (aka suspend-to-disk) doesn't work when booting with "Advanced options" - Install/Boot/Login - openSUSE Forums
Haven’t read this yet.

.e. You booted an older kernel via “Advanced Options” and then hibernated. The hibernate script set the wrong boot entry, namely “recovery mode” which doesn’t resume at all, because it has the “noresume” parameter.
Yes. This is what probably happened:

  • after checking that resume from Sleep was working, an update arrived via apper
  • I installed the update and it asked me to reboot
  • I was about to do that, got distracted and thought of testing hibernation first. [can’t be sure because my memory is sieve]
  • In retrospect that was stupid.
  • While I was trying to get out of that problem I saw a advice on the web for removing a “hibernation snapshot” by holding down the Shift key while trying to boot. That’s probably what zapped the MBR.

Doesn’t matter.
This file is loaded dynamically during boot. It’s not used at all when (re)installing GRUB2.
But with that grubenv present, grub2 will not show any menu at all because that entry tells it to directly boot Linux 3.16.6(recovery mode).

Normally you shouldn’t even have to do anything else than to remove that file, but as your system doesn’t boot/load grub at all now, a reinstallation of grub seems necessary.
You could try to just delete grubenv as a test first though.

Note that even with that entry present in that file your system should boot, just directly into “Linux 3.16.6(recovery mode)”.
That booting to recovery mode doesn’t work apparently is a different problem, unrelated to GRUB2.
Could be caused by having a proprietary video driver installed (nvidia, fglrx).

Yes. This is what probably happened:

  • after checking that resume from Sleep was working, an update arrived via apper
  • I installed the update and it asked me to reboot
  • I was about to do that, got distracted and thought of testing hibernation first. [can’t be sure because my memory is sieve]
  • In retrospect that was stupid.

Ok, so apparently you didn’t explicitely boot an older kernel, but you hibernated after you installed the kernel update.
The same problem in the end though, the currently booted kernel was one of the “Advanced Options” sub menu (the menu gets updated when a kernel package is installed e.g.), and the hibernate script set the wrong one in grubenv, i.e. the “(recovery mode)” flavour which does no resume at all.

  • While I was trying to get out of that problem I saw a advice on the web for removing a “hibernation snapshot” by holding down the Shift key while trying to boot. That’s probably what zapped the MBR.

Holding Shift doesn’t have any effect at all AFAIK (with the default settings in openSUSE at least).
At least it should not “zap” the MBR in any case (unless your BIOS maybe has such a function…).

Maybe you broke the boot loader in another way while trying to “fix” your problem?

I really appreciate your help. And I have good news. The Lenovo has recovered after deleting grubenv and after opening the BIOS to switch from “Legacy Mode” to EFI. After that Grub appeared and loaded. I don’t want to try “suspend to disk” again. Nonetheless I want to check if everything is OK.

  • I received and installed a couple of updates and looked at the Software Updater" messages. They mentioned “A transaction failed to complete successfully”.
  • Furthermore dmesg has a line in bold that ends with "
    **Please run fsck" **referring to sda1, which is FAT-fs - fsck doesn’t exist so I ran
xfs_repair -V -n sda1

That failed for reasons I don’t understand. I didn’t set up the partitions and their formats.

  • However the Updater now says my system is up to date.
  • Is there anything else I can check? If not, this issue is:
  • SOLVED
    .

[Next: get Sleep to work with the lid; install Bumblebee (had it working on this laptop under 13.1) because I need the nVidia GPU very little and without it the tiny battery will last longer; etc.]

Good to hear!
Did you maybe install in EFI mode and switch to “Legacy Mode” later?

I don’t want to try “suspend to disk” again.

Unless you boot the older kernel it should work.
If hibernate/resume itself fails for some reason and you’re stuck without a boot menu, you’ll know what to do now anyway: Remove /boot/grub2/grubenv.

Nonetheless I want to check if everything is OK.

  • I received and installed a couple of updates and looked at the Software Updater" messages. They mentioned “A transaction failed to complete successfully”.

Well, PackageKit is not too good in solving problems/conflicts.
Maybe try to run “zypper up” to see what’s wrong.

Furthermore dmesg has a line in bold that ends with **“****Please run fsck” **referring to sda1, which is FAT-fs

  • fsck doesn’t exist so I ran
xfs_repair -V -n sda1

That failed for reasons I don’t understand. I didn’t set up the partitions and their formats.

???
You say sda1 is FAT-fs, why are you running xfs_repair? That’s for XFS filesystems.
Use dosfsck instead.
But fsck should exist as well. You’ll have to be root to be able to run it though.

Is there anything else I can check?

If the system boots up fine, everything should be ok.
The worst I think that could have happened would be an inconsistent filesystem. But fsck should be run automatically on boot to fix that (for the / and /home partitions at least).

Yes, I did. I switched to Legacy because my old Knoppix wouldn’t boot.

???
You say sda1 is FAT-fs, why are you running xfs_repair? That’s for XFS filesystems.
Use dosfsck instead.
But fsck should exist as well. You’ll have to be root to be able to run it though.
I became root and issued the fsck command as requested by dmesg. The Konsole said it didn’t exist and to use xfs_repair. I thought it pretty weird. I’ll make a note of dosfsck.

Thank again. I’ll be around for a while. Season Greetings! I’ll be moving accounting and my development servers on to the Lenovo.

Well, EFI boots completely different than legacy mode (MBR), so no wonder that grub2 couldn’t be loaded afterwards.

I became root and issued the fsck command as requested by dmesg. The Konsole said it didn’t exist and to use xfs_repair.

Sorry, I don’t really understand that.
The Konsole should not suggest xfs_repair in any case, even if fsck is missing.

Can you please try to run fsck again, and post the exact message you get?

If fsck is really missing, you should probably re-install the package util-linux. You might run into problems sooner or later without fsck, as it is run automatically on boot.

On 2014-12-23 18:26, pacolaser wrote:

> - after checking that resume from Sleep was working, an update arrived
> via apper
> - I installed the update and it asked me to reboot
> - I was about to do that, got distracted and thought of testing
> hibernation first. [can’t be sure because my memory is sieve]

NEVER! When it tells you to reboot, after a kernel update, just reboot.
Don’t do anything else, specially not any testing, and never, under
death penalty, hibernate! Just reboot and don’t delay it.

> - In retrospect that was stupid.
> - While I was trying to get out of that problem I saw a advice on the
> web for removing a “hibernation snapshot” by holding down the Shift
> key while trying to boot. That’s probably what zapped the MBR.

I doubt it. Anyway, I have not seen that shift thing anywhere.


Cheers / Saludos,

Carlos E. R.
(from 13.1 x86_64 “Bottle” at Telcontar)

On 2014-12-24 00:56, pacolaser wrote:

> - I received and installed a couple of updates and looked at the
> Software Updater" messages. They mentioned “A transaction failed to
> complete successfully”.

I don’t like apper/packagekit. I prefer yast/zypper.

> - Furthermore dmesg has a line in bold that ends with *"**Please run
> fsck" *referring to sda1, which is FAT-fs
> - fsck doesn’t exist so I ran

Impossible. fsck must exist, it is crucial. Your system would not boot.

> Code:
> --------------------
> xfs_repair -V -n sda1
> --------------------
> That failed for reasons I don’t understand.

Fortunately it failed. :expressionless:

Why do you attempt and XFS repair on a FAT partition? If it
attempted to work on it, it would destroy your partition filesystem
beyond repair!

Please, please, go slow and don’t go rushing. Don’t try two things on a
single step, like update a kernel then hibernate, or repair a partition
with the wrong tool because you don’t find the right one. Slow down, and
if you had doubts, ask.


Cheers / Saludos,

Carlos E. R.
(from 13.1 x86_64 “Bottle” at Telcontar)

Yes, so I’ve been told. Here is the Konsole output - last part of the latest dmesg:

 
    8.405316] **FAT-fs (sda1): Volume was not properly unmounted. Some data may be corrupt. Please run fsck.**
    8.405931] XFS (sda4): Mounting V5 Filesystem
    9.101122] XFS (sda4): Starting recovery (logdev: internal)
    9.631453] XFS (sda4): Ending recovery (logdev: internal)
   10.139224] audit: type=1305 audit(1419447328.133:2): audit_pid=749 old=0 auid=4294967295 ses=4294967295 res=1
   11.383717] systemd-journald[447]: File /var/log/journal/b4ea4eb15ab7c29b6cc20a47544e5eb7/system.journal corrupted or uncleanly shut down, renaming and replacing.
   11.506971] Bluetooth: BNEP (Ethernet Emulation) ver 1.3
   11.506973] Bluetooth: BNEP filters: protocol multicast
   11.506978] Bluetooth: BNEP socket layer initialized
   11.616011] alx 0000:07:00.0: irq 51 for MSI/MSI-X
   11.616154] IPv6: ADDRCONF(NETDEV_UP): enp7s0: link is not ready
   11.619122] iwlwifi 0000:08:00.0: L1 Disabled; Enabling L0S
   11.626506] iwlwifi 0000:08:00.0: Radio type=0x2-0x0-0x0
   11.864351] iwlwifi 0000:08:00.0: L1 Disabled; Enabling L0S
   11.871735] iwlwifi 0000:08:00.0: Radio type=0x2-0x0-0x0
   11.934182] IPv6: ADDRCONF(NETDEV_UP): wlp8s0: link is not ready
   11.938143] NET: Registered protocol family 17
   16.888143] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20140424/nsarguments-95)
   16.888538] ACPI: \_SB_.PCI0.PEG0.PEGP: failed to evaluate _DSM
   16.888552] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20140424/nsarguments-95)
   34.860920] wlp8s0: authenticate with c8:60:00:71:d1:0c
   34.863862] wlp8s0: send auth to c8:60:00:71:d1:0c (try 1/3)
   34.868821] wlp8s0: authenticated
   34.868902] iwlwifi 0000:08:00.0 wlp8s0: disabling HT/VHT due to WEP/TKIP use
   34.869529] wlp8s0: associate with c8:60:00:71:d1:0c (try 1/3)
   34.872710] wlp8s0: RX AssocResp from c8:60:00:71:d1:0c (capab=0x411 status=0 aid=3)
   34.881634] wlp8s0: associated
   34.881657] IPv6: ADDRCONF(NETDEV_CHANGE): wlp8s0: link becomes ready
   42.115746] fuse init (API version 7.23)
  300.999259] mce: [Hardware Error]: Machine check events logged


As you can see, it still says to run fsck. All I did was to issue

fsck --help

to find out what it did. The output (which I cannot recreate) said “You probably want xsf_repair”.

I cannot recreate that either, “fsck --help” just says:

# fsck --help
fsck from util-linux 2.25.1

Well, not every command shows extensive help when you specify “–help”.
Try “man fsck” instead… :wink:

In your case it should be enough to run “fsck /dev/sda1”. fsck should be able to determine itself which fsck subtool to use.

On 2014-12-24 20:36, pacolaser wrote:

> Code:
> --------------------
>
> 8.405316] FAT-fs (sda1): Volume was not properly unmounted. Some data may be corrupt. Please run fsck.
> 8.405931] XFS (sda4): Mounting V5 Filesystem
> 9.101122] XFS (sda4): Starting recovery (logdev: internal)

> --------------------
>
> As you can see, it still says to run fsck. All I did was to issue
> Code:
> --------------------
> fsck --help
> --------------------
> to find out what it did. The output (which I cannot recreate) said “You
> probably want xsf_repair”.

Please post here the output of “fsck --help”. If you get output, then
the “fsck” command is installed.

fsck can be called as that, fsck, or fsck.something:

fsck fsck.cramfs fsck.ext3 fsck.fat
fsck.minix fsck.reiserfs fsck.xfs fsck.btrfs
fsck.ext2 fsck.ext4 fsck.jfs fsck.msdos
fsck.vfat

Called as “fsck”, it automatically determines what variant to call for
the actual job.

You probably called fsck on sda4, which is an xfs partition. And the
mkfs.xfs variant does tell you “You probably want xsf_repair”.

You were told to verify sda1, not sda4.
Or you called directly fsck.xfs
Both actions would produce that result.


Cheers / Saludos,

Carlos E. R.
(from 13.1 x86_64 “Bottle” at Telcontar)

Sorry for delay in responding (blame Xmas). Yes. I am sure you’re right about what happened. fsck does exist, of course. Anyway the important thing is that Sleep is fixed and working fine. I’ll ignore the dmesg report for now. Thanks to everyone for their help and Happy New Year from a frozen part of Spain.