The system is sending me this email:
A DegradedArray event had been detected on md device /dev/md/Volume1_0.
Faithfully yours, etc.
P.S. The /proc/mdstat file currently contains the following:
Personalities : [raid1]
md126 : active raid1 sda[0]
1953511424 blocks super external:/md127/0 [2/1] [U_]
md127 : inactive sda0
3160 blocks super external:imsm
unused devices: <none>
I assume this means that one of the disks in the RAID is failing. If this is correct, I would love some help in knowing how to proceed.
Background: this is a new system (about 4 months in service), set up with four disks: two SSDs with Windows and OpenSuse 15.1 respectively (the operating systems), and two hard drive for the /home directory, set up with mdadm RAID1. (Patches on the OS are up to date.) From what I read in the man page of mdadm, this is where the issue lies. It sounds like I need to “fail” the defective disk, physically remove it, get the warranty replacement, then reinstall the disk and rebuild the RAID.
If this is all correct, then my first question is how do I know which physical disk is the one that is bad? What would the exact command look like to “fail” the disk? Is there anything else I need to do before removing the disk from the computer?
Here is the output of “ll /dev/md*”:
brw-rw---- 1 root disk 9, 126 Apr 16 18:43 /dev/md126
brw-rw---- 1 root disk 259, 9 Apr 16 18:43 /dev/md126p1
brw-rw---- 1 root disk 259, 10 Apr 16 18:43 /dev/md126p2
brw-rw---- 1 root disk 9, 127 Apr 16 18:43 /dev/md127
/dev/md:
total 0
lrwxrwxrwx 1 root root 8 Apr 16 18:43 imsm0 → …/md127
lrwxrwxrwx 1 root root 8 Apr 16 18:43 Volume1_0 → …/md126
lrwxrwxrwx 1 root root 10 Apr 16 18:43 Volume1_0p1 → …/md126p1
lrwxrwxrwx 1 root root 10 Apr 16 18:43 Volume1_0p2 → …/md126p2
Here is output of “mdadm --detail”
/dev/md126:
Container : /dev/md/imsm0, member 0
Raid Level : raid1
Array Size : 1953511424 (1863.01 GiB 2000.40 GB)
Used Dev Size : 1953511424 (1863.01 GiB 2000.40 GB)
Raid Devices : 2
Total Devices : 1
State : active, degraded
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Consistency Policy : resync
UUID : 24021f90:94ce1c7b:2e66a0b1:371dad4c
Number Major Minor RaidDevice State
0 8 0 0 active sync /dev/sda
- 0 0 1 removed
Probably there is something else needed: please let me know. FWIW, I am a long-time OpenSuse end user, not afraid of CLI, but also maintaining only my own machines, so I don’t do a lot of under the hood stuff. Very little experience with software RAID (previous machine was hardware RAID). Help is appreciated!
(Fr) David Ousley