[DRBD-user] DRBD 8.0.13 SyncTarget crashing with alloc_ee: Allocation of a page failed

Peter Luciak Peter.Luciak at iblsoft.com
Fri Feb 20 12:29:32 CET 2009

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Lars Ellenberg wrote:
> On Tue, Feb 03, 2009 at 09:47:37AM +0100, Peter Luciak wrote:
>> Hello all,
>>
>> I'm experiencing weird crashes with drbd 8.0.13 when trying to  
>> resynchronize the secondary node. The secondary crashes (without any  
>> oops-es or other information in /var/log/messages) after some random  
>> period of resynchronization (around 20-30%).
>>
>> On the primary there is a 2.6.15.6 kernel and on the secondary I tried  
>> upgrading to 2.6.26.8. Now the resync went OK, but when I tested it  
>> again, it crashed again. This is a 64b kernel and the machine has  
>> Adaptec AIC7902 Ultra320 SCSI adapter with 4 disks in software RAID1  
>> configuration. Interestingly, this problem started to appear when we  
>> replaced one disk in the RAID array.
>>
>> Another drbd-user thread which I had found suggests that this could be  
>> related to Supermicro motherboards. Indeed, there is  SuperMicro X6DA8  
>> G2 i7525 on the primary, but TYAN Thunder i7525 on the secondary (ie.  
>> the one which crashes). I've tried to load default settings on the Tyan  
>> board, but to no avail.
> 
> it would be nice to capture the actual reason of the "crash"...
> serial console?

Hello,
I've managed to hook up a serial console and capture the output on 
another computer, but I don't get any stack trace.

The last messages I see on the serial console are:
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Resumed IO
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Becoming sync target due to disk 
states.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: peer( Unknown -> Primary ) conn( 
WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFBitMapT -> WFSyncUUID )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFSyncUUID -> SyncTarget )
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now.
Feb 20 15:05:51 vwsrv2 kernel: drbd2: Began resync as SyncTarget (will 
sync 75431400 KB [18857850 bits set]).
drbd2: local disk flush failed with status -95
Feb 20 15:05:52 vwsrv2 kernel: drbd2: local disk flush failed with 
status -95
Linux version 2.6.26.8 (root at vwsrv2.met.gov.om) (gcc version 4.0.2 
20051125 (Red Hat 4.0.2-8)) #2 SMP Thu Feb 19 11:57:53 GST 2009
Command line: ro root=/dev/md0 console=ttyS0,115200 console=tty0

The machine is then self-rebooted via iTCO-wdt watchdog.

I have done a "echo 8 > /proc/sys/kernel/printk" and also enabled the 
"early printk" in kernel config, but still no usable output. In 
/etc/syslog.conf I have
kern.*							/dev/ttyS0

Any suggestions are welcome.

Thanks,
Peter
-- 
Peter LUCIAK (Peter.Luciak at iblsoft.com)
IBL Software Engineering, http://www.iblsoft.com/
Mierová 103, 82105 Bratislava, Slovakia
Phone: +421-2-32662111, Fax: +421-2-32662110
Direct: +421-2-32662175



More information about the drbd-user mailing list