Hi,
I have an OpenSuse 11.0 file server with software RAID. The RAID configuration is two mirrored discs plus a hot spare. Below is the configuration showing the partitions md0, md1 and md2 (sdb was the spare):
/boot 200Mb sda1, sdb1, sdc1 md0
/ 20Gb sda2, sdb2, sdc2 md1
/data 900Gb sda3, sdb3, sdc3 md2
Yesterday morning one of the active discs (sda) started to give read errors and was failed by RAID. The spare disc was activated automatically and re-syncing started. This is all great so far but the re-syncing keeps on restarting. What happens is md1 re-syncs to 100% (see number 1. below) which takes a few minutes but md1 stays degraded. Next md2 starts to re-sync briefly (see 2. below) but never gets very far because the re-syncing stops and re-starts again on md1 (3. below). This cycles continuously and has been going on for over 26 hours.
admin@sherbet:~> cat /proc/mdstat
Personalities : [raid1] [raid0] [raid6] [raid5] [raid4]
md2 : active raid1 sda30 sdb3[2] sdc3[1]
953160408 blocks super 1.0 [2/1] [_U]
resync=DELAYED
bitmap: 1/455 pages [4KB], 1024KB chunk
md0 : active raid1 sda10 sdb1[2] sdc1[1]
216832 blocks super 1.0 [2/2] [UU]
bitmap: 0/7 pages [0KB], 16KB chunk
md1 : active raid1 sda20 sdb2[2] sdc2[1]
20972784 blocks super 1.0 [2/1] [_U]
===================>.] recovery = 98.1% (20587904/20972784) finish=0.1min speed=39262K/sec
bitmap: 1/161 pages [4KB], 64KB chunk
unused devices: <none>
admin@sherbet:~> cat /proc/mdstat
Personalities : [raid1] [raid0] [raid6] [raid5] [raid4]
md2 : active raid1 sda30 sdb3[2] sdc3[1]
953160408 blocks super 1.0 [2/1] [_U]
>…] recovery = 0.0% (768/953160408) finish=39714.9min speed=384K/sec
bitmap: 1/455 pages [4KB], 1024KB chunk
md0 : active raid1 sda10 sdb1[2] sdc1[1]
216832 blocks super 1.0 [2/2] [UU]
bitmap: 0/7 pages [0KB], 16KB chunk
md1 : active raid1 sda20 sdb2[2] sdc2[1]
20972784 blocks super 1.0 [2/1] [_U]
bitmap: 1/161 pages [4KB], 64KB chunk
unused devices: <none>
admin@sherbet:~> cat /proc/mdstat
Personalities : [raid1] [raid0] [raid6] [raid5] [raid4]
md2 : active raid1 sda30 sdb3[2] sdc3[1]
953160408 blocks super 1.0 [2/1] [_U]
resync=DELAYED
bitmap: 1/455 pages [4KB], 1024KB chunk
md0 : active raid1 sda10 sdb1[2] sdc1[1]
216832 blocks super 1.0 [2/2] [UU]
bitmap: 0/7 pages [0KB], 16KB chunk
md1 : active raid1 sda20 sdb2[2] sdc2[1]
20972784 blocks super 1.0 [2/1] [_U]
>…] recovery = 1.3% (283328/20972784) finish=8.5min speed=40475K/sec
bitmap: 0/161 pages [0KB], 64KB chunk
unused devices: <none>
My question is, is this normal? Am I being impatient or is there something wrong?
Thanks in advance for any advice.