[DRBD-user] DRBD 8.0.13 SyncTarget crashing with alloc_ee: Allocation of a page failed
Peter Luciak
Peter.Luciak at iblsoft.com
Fri Feb 20 12:29:32 CET 2009
Lars Ellenberg wrote:
> On Tue, Feb 03, 2009 at 09:47:37AM +0100, Peter Luciak wrote:
>> Hello all,
>>
>> I'm experiencing weird crashes with drbd 8.0.13 when trying to
>> resynchronize the secondary node. The secondary crashes (without any
>> oops-es or other information in /var/log/messages) after some random
>> period of resynchronization (around 20-30%).
>>
>> On the primary there is a 2.6.15.6 kernel and on the secondary I tried
>> upgrading to 2.6.26.8. Now the resync went OK, but when I tested it
>> again, it crashed again. This is a 64b kernel and the machine has
>> Adaptec AIC7902 Ultra320 SCSI adapter with 4 disks in software RAID1
>> configuration. Interestingly, this problem started to appear when we
>> replaced one disk in the RAID array.
>>
>> Another drbd-user thread which I had found suggests that this could be
>> related to Supermicro motherboards. Indeed, there is SuperMicro X6DA8
>> G2 i7525 on the primary, but TYAN Thunder i7525 on the secondary (ie.
>> the one which crashes). I've tried to load default settings on the Tyan
>> board, but to no avail.
>
> it would be nice to capture the actual reason of the "crash"...
> serial console?
Hello,
I've managed to hook up a serial console and capture the output on
another computer, but I don't get any stack trace.
The last messages I see on the serial console are:
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Resumed IO
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Becoming sync target due to disk
states.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: peer( Unknown -> Primary ) conn(
WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFBitMapT -> WFSyncUUID )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFSyncUUID -> SyncTarget )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Began resync as SyncTarget (will
sync 75431400 KB [18857850 bits set]).
drbd2: local disk flush failed with status -95
Feb 20 15:05:52 vwsrv2 kernel: drbd2: local disk flush failed with
status -95
Linux version 2.6.26.8 (root at vwsrv2.met.gov.om) (gcc version 4.0.2
20051125 (Red Hat 4.0.2-8)) #2 SMP Thu Feb 19 11:57:53 GST 2009
Command line: ro root=/dev/md0 console=ttyS0,115200 console=tty0
The machine is then self-rebooted via iTCO-wdt watchdog.
I have done a "echo 8 > /proc/sys/kernel/printk" and also enabled the
"early printk" in kernel config, but still no usable output. In
/etc/syslog.conf I have
kern.* /dev/ttyS0
Any suggestions are welcome.
Thanks,
Peter
--
Peter LUCIAK (Peter.Luciak at iblsoft.com)
IBL Software Engineering, http://www.iblsoft.com/
Mierová 103, 82105 Bratislava, Slovakia
Phone: +421-2-32662111, Fax: +421-2-32662110
Direct: +421-2-32662175
More information about the drbd-user
mailing list