Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Lars Ellenberg wrote: > On Tue, Feb 03, 2009 at 09:47:37AM +0100, Peter Luciak wrote: >> Hello all, >> >> I'm experiencing weird crashes with drbd 8.0.13 when trying to >> resynchronize the secondary node. The secondary crashes (without any >> oops-es or other information in /var/log/messages) after some random >> period of resynchronization (around 20-30%). >> >> On the primary there is a 2.6.15.6 kernel and on the secondary I tried >> upgrading to 2.6.26.8. Now the resync went OK, but when I tested it >> again, it crashed again. This is a 64b kernel and the machine has >> Adaptec AIC7902 Ultra320 SCSI adapter with 4 disks in software RAID1 >> configuration. Interestingly, this problem started to appear when we >> replaced one disk in the RAID array. >> >> Another drbd-user thread which I had found suggests that this could be >> related to Supermicro motherboards. Indeed, there is SuperMicro X6DA8 >> G2 i7525 on the primary, but TYAN Thunder i7525 on the secondary (ie. >> the one which crashes). I've tried to load default settings on the Tyan >> board, but to no avail. > > it would be nice to capture the actual reason of the "crash"... > serial console? Hello, I've managed to hook up a serial console and capture the output on another computer, but I don't get any stack trace. The last messages I see on the serial console are: Feb 20 15:05:51 vwsrv2 kernel: drbd2: Resumed IO Feb 20 15:05:51 vwsrv2 kernel: drbd2: Becoming sync target due to disk states. Feb 20 15:05:51 vwsrv2 kernel: drbd2: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now. Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFBitMapT -> WFSyncUUID ) Feb 20 15:05:51 vwsrv2 kernel: drbd2: conn( WFSyncUUID -> SyncTarget ) Feb 20 15:05:51 vwsrv2 kernel: drbd2: Writing meta data super block now. Feb 20 15:05:51 vwsrv2 kernel: drbd2: Began resync as SyncTarget (will sync 75431400 KB [18857850 bits set]). drbd2: local disk flush failed with status -95 Feb 20 15:05:52 vwsrv2 kernel: drbd2: local disk flush failed with status -95 Linux version 2.6.26.8 (root at vwsrv2.met.gov.om) (gcc version 4.0.2 20051125 (Red Hat 4.0.2-8)) #2 SMP Thu Feb 19 11:57:53 GST 2009 Command line: ro root=/dev/md0 console=ttyS0,115200 console=tty0 The machine is then self-rebooted via iTCO-wdt watchdog. I have done a "echo 8 > /proc/sys/kernel/printk" and also enabled the "early printk" in kernel config, but still no usable output. In /etc/syslog.conf I have kern.* /dev/ttyS0 Any suggestions are welcome. Thanks, Peter -- Peter LUCIAK (Peter.Luciak at iblsoft.com) IBL Software Engineering, http://www.iblsoft.com/ Mierová 103, 82105 Bratislava, Slovakia Phone: +421-2-32662111, Fax: +421-2-32662110 Direct: +421-2-32662175