Boot fail, when is Ext4 partition mounting - openSUSE 11.2

Hi all,
sometimes openSUSE boot ends with these errors:
http://www.djscore.org/tmp/boot_fail_1.jpg

(I do not know why it prints an error on Ext2 filesystem when the disk is formatted Ext4…)

http://www.djscore.org/tmp/boot_fail_2.jpg

Here is my Fstab:

/dev/disk/by-id/ata-OCZ_VERTEX-TURBO_202B960A7RM94280ELSX-part1	/	ext4	noatime,nobh,commit=100 1 1 
/dev/disk/by-id/ata-OCZ_VERTEX-TURBO_202B960A7RM94280ELSX-part2	/home	ext4	noatime,nobh,data=writeback,commit=100 1 2 
proc	/proc	proc	defaults 0 0 
sysfs	/sys	sysfs	noauto 0 0 
debugfs	/sys/kernel/debug	debugfs	noauto 0 0 
usbfs	/proc/bus/usb	usbfs	noauto 0 0 
devpts	/dev/pts	devpts	mode=0620,gid=5 0 0 
/dev/disk/by-id/ata-WDC_WD3200KS-00PFB0_WD-WCAPD1646323-part1 /home/djs_core/data  ext4       defaults                   1 2
tmpfs	/tmp	tmpfs	noatime 0 0 

Any idea ? :wink:

Here is my fstab

/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part5 swap                 swap       defaults              0 0
/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part2 /                    ext4       acl,user_xattr        1 1
/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part6 /home                ext4       acl,user_xattr        1 2
/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part1 /windows/C           ntfs-3g    defaults 0 0
proc                 /proc                proc       defaults              0 0
sysfs                /sys                 sysfs      noauto                0 0
debugfs              /sys/kernel/debug    debugfs    noauto                0 0
usbfs                /proc/bus/usb        usbfs      noauto                0 0
devpts               /dev/pts             devpts     mode=0620,gid=5       0 0

Maybe it will help you make the adjustments

On 01/04/2010 05:16 AM, djs core wrote:
>
> Hi all,
> sometimes openSUSE boot ends with these errors:
> http://www.djscore.org/tmp/boot_fail_1.jpg

Your root file system has an error and needs to have fsck run on it. When you
get that prompt for the root password that is shown in the picture, enter it and
then run the command

fsck -y /dev/sda1

These errors do not happen very often, but I have had a couple of them since I
switched to ext4.

> (I do not know why it prints an error on Ext2 filesystem when the disk
> is formatted Ext4…)

The underlying file system for ext2, ext3, and ext4 are so similar that the
fixing is all handled by the ext2 program. That is why it was mentioned here.

do as lwfinger says…

and data=writeback ???

Boy, you surely like increasing the risk of losing your data :wink:

Thanks guys,

To: caf4926
ok. i try the adjustments :wink:

To: lwfinger

Ok, when I try:

fsck -y /dev/sda1

fsck write:

/dev/sda1 is mounted. Warning!!! Running e2fsck on a mounted filesystem may cause SEVERE filesystem damage.

ok, canceled and write:

umount /dev/sda1 and again fsck -y /dev/sda1

in one second terminal write:

fsck from util-linux-ng 2.16
e2fsck 1.41.9 (22-Aug-2009)
/dev/sda1:clean, 122290/1281120 files, 950433/5120710 blocks

It’s alright ?

To: microchip8
I use on the SSD disk (data = writeback) since ext3 without problems, except this :wink:

And new adjustment fstab:


/dev/disk/by-id/ata-OCZ_VERTEX-TURBO_202B960A7RM94280ELSX-part1	/			ext4	noatime,nobh,commit=100	1 1 
/dev/disk/by-id/ata-OCZ_VERTEX-TURBO_202B960A7RM94280ELSX-part2	/home			ext4	noatime,nobh,commit=100	1 2 
/dev/disk/by-id/ata-WDC_WD3200KS-00PFB0_WD-WCAPD1646323-part1	/home/djs_core/data	ext4	acl,user_xattr		1 2
proc		/proc				proc		defaults 			0 0 
sysfs		/sys				sysfs		noauto 				0 0 
debugfs		/sys/kernel/debug		debugfs		noauto 				0 0 
usbfs		/proc/bus/usb			usbfs		noauto 				0 0 
devpts		/dev/pts			devpts		mode=0620,gid=5 		0 0 
tmpfs		/tmp				tmpfs		noatime 			0 0 
tmpfs		/var/log			tmpfs		noatime				0 0
tmpfs		/var/tmp			tmpfs		noatime				0 0

@djs_core

try forcing a full check with fsck

Also, you may use since Ext3 data=writeback without any problems, but this does not imply that writeback is a safe/good option, even most file systems developers will advise you not to use it, unless you’re a speed freak and don’t value much what’s on your partition. If you want to end up with garbage during unsafe shutdowns, sudden power outages, etc, then please do. Heck, throw away Ext4/Ext3 and go use Ext2 instead, which will give you better performance than data=writeback since there’s no journal overhead involved as Ext2 doesn’t have one :wink:

To microchip8:

I try full check with fsck:

root@PartedMagic:~# e2fsck -f /dev/sda1
e2fsck 1.41.9 (22-Aug-2009)
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Pass 5: Checking group summary information
/dev/sda1: 122290/1281120 files (0.2% non-contiguous), 950441/5120710 blocks
root@PartedMagic:~# e2fsck -f /dev/sda2
e2fsck 1.41.9 (22-Aug-2009)
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Pass 5: Checking group summary information
/dev/sda2: 6084/674624 files (1.0% non-contiguous), 333565/2694903 blocks
root@PartedMagic:~# e2fsck -f /dev/sdb1
e2fsck 1.41.9 (22-Aug-2009)
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Pass 5: Checking group summary information
Data: 59661/19537920 files (3.6% non-contiguous), 12449904/78142160 blocks

but nothing change :frowning:

I’m not certain what can cause such things… You may have better luck by using the good old device names instead of device IDs/paths…

This will involve changing all in /etc/fstab to /dev/sdXX (where XX corresponds to the correct disk and partition on it) and it will also involve changing to device names in /boot/grub/menu.lst and /etc/sysconfig/bootloader (the resume= part)

After all this, you must run mkinitrd

To microchip8:
I have changed device names in fstab and menu.lst, but in /etc/sysconfig/bootloader is no record of device names:


## Path:	System/Bootloader
## Description:	Bootloader configuration
## Type:	list(grub,lilo,none)
## Default:	grub
#
# Type of bootloader in use.
# For making the change effect run bootloader configuration tool
# and configure newly selected bootloader
#
#
LOADER_TYPE="grub"


## Path:        System/Bootloader
## Description: Bootloader configuration
## Type:        list(floppy,mbr,root,boot,custom,none)
## Default:     mbr
#
# Location of boot loader.
# For making the change effect run bootloader configuration tool
#
#
LOADER_LOCATION=""
DEFAULT_APPEND="splash=silent quiet showopts"
DEFAULT_VGA="0x346"
FAILSAFE_APPEND="showopts apm=off noresume edd=off powersaved=off nohz=off highres=off processor.max_cstate=1 x11failsafe"
FAILSAFE_VGA="0x346"
XEN_KERNEL_APPEND="splash=silent quiet showopts"
XEN_APPEND=""
XEN_VGA="0x346"
## Path:	System/Bootloader
## Description: Bootloader configuration
## Type:	yesno
## Default:	no
#
# Should the boot cycle detection be used to
# avoid unconditional reboot cycles of not
# supervised system.
#
CYCLE_DETECTION="no"

## Type:	integer(0:)
## Default:	1
#
# The number of the entry in grub's menu.lst
# which should be used on the next reboot cycle.
# Note that the first entry has the number 0.
#
CYCLE_NEXT_ENTRY="1"

I think what is being suggested is to replace the disk ID info in fstab with /dev info
If you do fdisk -l from a terminal in Parted Magic it will list them that way then replace like this:

This is just showing one example which I think is correct:

/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part5 swap swap defaults 0 0
/dev/sda2 / ext4 acl,user_xattr 1 1
/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part6 /home ext4 acl,user_xattr 1 2
/dev/disk/by-id/ata-ST9160827AS_5RF16D1B-part1 /windows/C ntfs-3g defaults 0 0
proc /proc proc defaults 0 0
sysfs /sys sysfs noauto 0 0
debugfs /sys/kernel/debug debugfs noauto 0 0
usbfs /proc/bus/usb usbfs noauto 0 0
devpts /dev/pts devpts mode=0620,gid=5 0 0

To caf4926:
No ! Pls read post again! I have successfully changed fstab and menu.lst, but not this part:

and /etc/sysconfig/bootloader (the resume= part)

What exactly is the problem. Is something not working?

Looking at your image it looks like your root fs is ok
the problem is with your data disk first the obvious if its an external disk is it plugged in.
run fsck -f on that disk no -y
expect to spend some time because when it happened to me it took a while.
It looked like my disk was defraging

Here is another thing you might try
remove the indicated line from fstab and reboot then mount it by hand

To caf4926:
Problem is described in the first post. OpenSUSE boot sometimes end with these errors:

http://www.djscore.org/tmp/boot_fail_1.jpg

  • boot fail when system mounted sdb1 partition, but when i do restart, boot run fine (or not)…

http://www.djscore.org/tmp/boot_fail_2.jpg

  • boot fail when system mounted sda1 partition, but when i do restart, boot run also fine (or not)…

These errors do not occur each time, but only sometimes…

To dale14846:
Yes, problem is sometimes with data disc (sdb device - classic sata HDD), but sometimes also with root partition (sda1) screen two.
fsck -f ended good on all partition…(see post 8 - Boot fail, when is Ext4 partition mounting - openSUSE 11.2 - openSUSE Forums)
If I delete from the fstab data disc mount point, error number 1 is gone, bun no 2. It is logical :wink:
These errors do not occur each time, but only sometimes!

On Thu, 07 Jan 2010 20:46:04 GMT, djs core <djs_core@no-mx.forums.opensuse.org> wrote:

>
>To caf4926:
>Problem is described in the first post. OpenSUSE boot sometimes end
>with these errors:
>
>http://www.djscore.org/tmp/boot_fail_1.jpg
>- boot fail when system mounted sdb1 partition, but when i do restart,
>boot run fine (or not)…
>
>http://www.djscore.org/tmp/boot_fail_2.jpg
>- boot fail when system mounted sda1 partition, but when i do restart,
>boot run also fine (or not)…
>
>These errors do not occur each time, but only sometimes…
>
>To dale14846:
>Yes, problem is sometimes with data disc (sdb device - classic sata
>HDD), but sometimes also with root partition (sda1) screen two.
>fsck -f ended good on all partition…(see post 8 - ‘Boot fail, when is
>Ext4 partition mounting - openSUSE 11.2 - openSUSE Forums’
>(http://tinyurl.com/y8qg3pb))
>If I delete from the fstab data disc mount point, error number 1 is
>gone, bun no 2. It is logical :wink:
>These errors do not occur each time, but only sometimes!

An intermittent error strongly suggests marginal HW or cabling in the system.

Try replacing the disk cables (or even just remove&reinstall the ones youhave).

If no help run memtest for at least 8 hours.

Here is my /etc/sysconfig/bootloader part on 11.1

LOADER_TYPE="grub"
DEFAULT_NAME="openSUSE 11.1"
DEFAULT_APPEND="resume=/dev/sda2 splash=silent showopts"
DEFAULT_VGA="0x31A"
FAILSAFE_APPEND="showopts ide=nodma apm=off noresume edd=off powersaved=off nohz=off highres=off processor.max_cstate=1 x11failsafe"
FAILSAFE_VGA="0x31A"

I don’t run 11.2 so some things may have changed

To JosephKK:
Cables and Hardware work fine, It is not the problem…
To microchip8:
Bootloader is ok.

I have updated to new kernel version (2.6.31.8) and it seems that the problems are gone…We will see :wink:

It says that it could mount the root partition, but that it couldn’t mount another partition, namely the following:

/dev/disk/by-id/ata-WDC_WD3200KS-00PFB0_WD-WCAPD1646323-part1 /home/djs_core/data ext4 defaults 1 2

The cause could be that the partition in question is not avaiable (e.g. it is on an external HD that is not connected), the partition has wrong parameters or the wrong file system listed in /etc/fstab, or that there’s something wrong with the partition.
The /etc/fstab entry looks ok (if it wasn’t, it would always give the error), so if the partition is always avaiable, that can only mean there’s something wrong with the partition.
Run the following from a root console:

fsck -y /dev/sda2

That one means it couldn’t mount your root partition:

/dev/disk/by-id/ata-OCZ_VERTEX-TURBO_202B960A7RM94280ELSX-part1 / ext4 noatime,nobh,commit=100 1 1

Reasons for this are the same as above. Also here, there’s probably something wrong with the partition.
Run the following from a root console:

fsck -y /dev/sda1