[ale] RAID mirror boot nightmare
Erik Mathis
erik at mathists.com
Tue Jul 10 08:41:30 EDT 2012
Can you also send the output from /proc/mdadm ?
On Tue, Jul 10, 2012 at 1:33 AM, Bob Toxen <transam at verysecurelinux.com> wrote:
> Additional details on this miserable problem:
>
> On Boot the kernel complains of:
>
> Creating root device
> Mounting root filesystem
> Mount: Could not find filesystem '/dev/root'
>
> after talking about md0 apparently being created successful and lastly
> panics.
>
> /boot/grub/grub.conf entry being booted:
> title CentOS-single-md4
> root (hd0,0)
> kernel /vmlinuz-2.6.18-308.4.1.el5 ro root=/dev/md4 md=4,/dev/sda6,/dev/sdb6 md=1,/dev/sda2,/dev/sdb2 md-mod.start_dirty_degraded=1 rhgb single noresume
> initrd /initrd-2.6.18-308.4.1.el5.img
>
> /etc/mdadm.conf (heavily edited by me including switching from uuid to
> devices; I don't presently list swap as that is not critical and it
> fails before even thinking about swap):
> # mdadm.conf written out by anaconda
> DEVICE /dev/sda[26] /dev/sdb[26]
> MAILADDR root
> ARRAY /dev/md4 level=raid1 num-devices=2 devices=/dev/sda6,/dev/sdb6 auto=yes
> ARRAY /dev/md1 level=raid1 num-devices=2 devices=/dev/sda2,/dev/sdb2 auto=yes
>
> fdisk output:
> Disk /dev/sda: 1000.2 GB, 1000204886016 bytes
> 255 heads, 63 sectors/track, 121601 cylinders
> Units = cylinders of 16065 * 512 = 8225280 bytes
>
> Device Boot Start End Blocks Id System
> /dev/sda1 * 1 13 104391 83 Linux
> /dev/sda2 * 14 26 104422+ fd Linux raid autodetect
> /dev/sda3 27 4200 33527655 82 Linux swap / Solaris
> /dev/sda4 4201 121601 943023532+ f W95 Ext'd (LBA)
> /dev/sda5 4201 62900 471507718+ 83 Linux
> /dev/sda6 62901 121600 471507718+ fd Linux raid autodetect
>
> /etc/fstab:
> /dev/md4 / ext3 defaults 1 2
> /dev/md1 /boot ext3 defaults 1 2
>
> #normal /dev/md3 / ext3 defaults 1 1
> #normal /dev/md0 /boot ext3 defaults 1 2
> #normal /dev/md4 /root2 ext3 defaults 1 2
> #normal /dev/md1 /boot2 ext3 defaults 1 2
> tmpfs /dev/shm tmpfs defaults 0 0
> devpts /dev/pts devpts gid=5,mode=620 0 0
> sysfs /sys sysfs defaults 0 0
> proc /proc proc defaults 0 0
> /dev/md2 swap swap defaults 0 0
>
>
> What magic am I missing? Please help!!!
>
> Bob
>
> On Mon, Jul 09, 2012 at 06:21:43PM -0400, Bob Toxen wrote:
>> I'm having problems getting my RAID mirror booting again in
>> CentOS 5.8 64-bit after GNU's gparted trashed my partition table when
>> one of the new Seagate 1 TB SATA disks failed completely.
>>
>> Booting causes the kernel to complain of an invalid root partition.
>> I can't figure out what is invalid about it. FSCK is happy.
>>
>> I rebuilt the partition table with "CD Rescue" v2.7.1 bootable CD
>> (wonderful stuff) and reinstalled my existing GRUB to the MBR.
>>
>>
>> Unfortunately, CentOS 5.8 will no longer recognize my md3 root partition
>> (still running in degrated mode since I've not yet replaced the failed
>> disk). The CentOS kernel does see the RAID partitions but just doesn't
>> want to use one for /!!!
>>
>> I CAN boot the "CD Rescue" disk and tell it to find and boot an
>> existing OS on the hard disk and it will bring up my CentOS 5.8 on
>> md3 (before I did mdadm -S /dev/md3 and used fdisk to change its
>> partition type from RAID to Linux)
>> or will boot up md4 (though apparently with a newer 3.x kernel from the CD).
>>
>> However, no matter what I do I cannot get my CentOS 5.8 to boot the RAID
>> mirror. I also have the root and /boot file systems duplicated to
>> md4 and md1 respectively.
>>
>> I tried
>>
>> mdadm -S /dev/md3
>> mdadm -S /dev/md1
>> mdadm --create -n 2 /dev/md3 --level=mirror /dev/sda6 missing
>> "/dev/sd6 appears to be part of an existing RAID; do anyway" y
>> mdadm --create -n 2 /dev/md1 --level=mirror /dev/sda2 missing
>> "/dev/sd2 appears to be part of an existing RAID; do anyway" y
>>
>> but when I try to boot CentOS 5.8 with "root-md3" it again fails.
>>
>> However, booting the CD Rescue and selecting "find and boot from hard
>> disk" happily boots either RAID partition.
>>
>>
>> Please PLEASE give me some ideas to fix this. $50 to the first right
>> answer, REALLY! I'm desperate.
>>
>> Nothing on the file systems themselves, including the /boot/initrd*
>> file and kernels, appear to be damaged. systems themselves)
>>
>>
>> Since FSCK is happy all I can think of is damage to the md superblock at
>> the end of the physical partitions. I wonder if the CD Rescue kernel
>> (being newer than the CentOS kernel) created to new a version of the
>> mdsuperblock. However, my recreating the RAID mirror under CentOS 5.8
>> should have fixed it, shouldn't it?
>>
>> HELP! $50 prize. Really!
>>
>> Bob Toxen
>> bob at VerySecureLinux.com
>> transam at VerySecureLinux.com
>> 404-216-5100 Cell (except for the next half hour)
>> _______________________________________________
>> Ale mailing list
>> Ale at ale.org
>> http://mail.ale.org/mailman/listinfo/ale
>> See JOBS, ANNOUNCE and SCHOOLS lists at
>> http://mail.ale.org/mailman/listinfo
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> http://mail.ale.org/mailman/listinfo/ale
> See JOBS, ANNOUNCE and SCHOOLS lists at
> http://mail.ale.org/mailman/listinfo
More information about the Ale
mailing list