[DRBD-user] kernel oops drbd 8.0_pre2 on Fedora Core 5 and RHEL4

Langemeyer, Werner (IBW) Werner.Langemeyer at de.bp.com
Wed Apr 12 09:18:48 CEST 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


behaviours changed, but still not working.

see /var/log/messages:
**********************
Apr 12 09:06:53 emdc-ha1 kernel: drbd: initialised. Version: 8.0-pre2
(api:81/proto:80)
Apr 12 09:06:53 emdc-ha1 kernel: drbd: SVN Revision: 2143M build by
root at emdc-devel.in-geseke.de, 2006-04-12 08:59:53
Apr 12 09:06:53 emdc-ha1 kernel: drbd: registered as block device major
147
Apr 12 09:06:53 emdc-ha1 kernel: drbd0: disk( Diskless -> Attaching )
Apr 12 09:06:53 emdc-ha1 kernel: klogd 1.4.1, ---------- state change
----------
Apr 12 09:06:53 emdc-ha1 kernel: drbd0: drbd_bm_resize called with
capacity == 786336
Apr 12 09:06:53 emdc-ha1 kernel: drbd0: bits = 98292 in
/usr/src/redhat/BUILD/drbd-0.8/drbd/drbd_bitmap.c:369
Apr 12 09:06:53 emdc-ha1 kernel: drbd0: resync bitmap: bits=98292
words=3072
Apr 12 09:06:53 emdc-ha1 kernel: drbd0: size = 383 MB (393168 KB)
Apr 12 09:07:13 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:07:13 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:07:13 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:07:13 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:07:13 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631661, sector=6631661
Apr 12 09:07:13 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:07:13 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631661
Apr 12 09:07:33 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:07:53 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:07:53 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:07:53 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:07:53 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631669, sector=6631669
Apr 12 09:07:53 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:07:53 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631669
Apr 12 09:07:53 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:07:53 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:07:53 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:07:53 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:07:53 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631677, sector=6631677
Apr 12 09:07:53 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:07:53 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631677
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: writing of bitmap took 15008
jiffies
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: we had at least one MD IO ERROR
during bitmap IO
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: disk( Attaching -> Failed )
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: Local IO failed. Detaching...
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: disk( Failed -> Diskless )
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: short sent ReportState size=12
sent=-1000
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: Notified peer that my disk is
broken.
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: ASSERT(
drbd_md_test_flag(mdev->bc,MDF_FullSync) ) in
/usr/src/redhat/BUILD/drbd-0.8/drbd/drbd_main.c:507
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: Writing meta data super block
now.
Apr 12 09:07:53 emdc-ha1 kernel: drbd0: drbd_md_sync:
(!inc_md_only(mdev,Attaching)) in
/usr/src/redhat/BUILD/drbd-0.8/drbd/drbd_main.c:2508
Apr 12 09:07:54 emdc-ha1 kernel: drbd0: Not releasing backing storage
device.
Apr 12 09:07:54 emdc-ha1 kernel: drbd0: 383 MB marked out-of-sync by on
disk bit-map.
Apr 12 09:07:54 emdc-ha1 kernel: drbd0: 393168 KB now marked out-of-sync
by on disk bit-map.
Apr 12 09:08:14 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:08:14 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:08:14 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:08:14 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:08:14 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631661, sector=6631661
Apr 12 09:08:14 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:08:14 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631661
Apr 12 09:08:34 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:08:34 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:08:34 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:08:57 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:08:57 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631669, sector=6631669
Apr 12 09:08:57 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:08:57 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631669
Apr 12 09:08:57 emdc-ha1 kernel: hda: dma_timer_expiry: dma status ==
0x26
Apr 12 09:08:57 emdc-ha1 kernel: hda: DMA interrupt recovery
Apr 12 09:08:57 emdc-ha1 kernel: hda: lost interrupt
Apr 12 09:08:57 emdc-ha1 kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Apr 12 09:08:57 emdc-ha1 kernel: hda: dma_intr: error=0xc0 { BadSector
UncorrectableError }, LBAsect=6631677, sector=6631677
Apr 12 09:08:57 emdc-ha1 kernel: ide: failed opcode was: unknown
Apr 12 09:08:57 emdc-ha1 kernel: end_request: I/O error, dev hda, sector
6631677
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: reading of bitmap took 15006
jiffies
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: we had at least one MD IO ERROR
during bitmap IO
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: disk( Diskless -> Failed )
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: Local IO failed. Detaching...
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: disk( Failed -> Diskless )
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: short sent ReportState size=12
sent=-1000
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: Notified peer that my disk is
broken.
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: ASSERT(
drbd_md_test_flag(mdev->bc,MDF_FullSync) ) in
/usr/src/redhat/BUILD/drbd-0.8/drbd/drbd_main.c:507
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: Writing meta data super block
now.
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: drbd_md_sync:
(!inc_md_only(mdev,Attaching)) in
/usr/src/redhat/BUILD/drbd-0.8/drbd/drbd_main.c:2508
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: Releasing backing storage
device.
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: recounting of set bits took
additional 0 jiffies
Apr 12 09:08:57 emdc-ha1 kernel: drbd0: 383 MB marked out-of-sync by on
disk bit-map.
Apr 12 09:08:57 emdc-ha1 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000064
Apr 12 09:08:57 emdc-ha1 kernel:  printing eip:
Apr 12 09:08:57 emdc-ha1 kernel: d09eab8f
Apr 12 09:08:57 emdc-ha1 kernel: *pde = 00000000
Apr 12 09:08:57 emdc-ha1 kernel: Oops: 0000 [#1]
Apr 12 09:08:57 emdc-ha1 kernel: last sysfs file: /block/drbd0/dev
Apr 12 09:08:58 emdc-ha1 kernel: Modules linked in: drbd(U) ipv6 autofs4
hidp rfcomm l2cap bluetooth sunrpc ip_conntrack_netbios_ns ipt_REJECT
xt_state ip_conntrack nfnetlink xt_tcpudp iptable_filter ip_tables
x_tables acpi_cpufreq video button battery ac lp parport_pc parport
floppy nvram pcnet32 mii i2c_piix4 i2c_core dm_snapshot dm_zero
dm_mirror dm_mod ext3 jbd
Apr 12 09:08:58 emdc-ha1 kernel: CPU:    0
Apr 12 09:08:58 emdc-ha1 kernel: EIP:    0060:[<d09eab8f>]    Not
tainted VLI
Apr 12 09:08:58 emdc-ha1 kernel: EFLAGS: 00010286   (2.6.16-1.2080_FC5
#1)
Apr 12 09:08:58 emdc-ha1 kernel: EIP is at drbd_al_read_tr+0x13/0x138
[drbd]
Apr 12 09:08:58 emdc-ha1 kernel: eax: 00000000   ebx: cca73000   ecx:
00000000   edx: 00000000
Apr 12 09:08:58 emdc-ha1 kernel: esi: 00000000   edi: ffffffff   ebp:
cca73000   esp: c72ced2c
Apr 12 09:08:58 emdc-ha1 kernel: ds: 007b   es: 007b   ss: 0068
Apr 12 09:08:58 emdc-ha1 kernel: Process drbdsetup (pid: 1798,
threadinfo=c72ce000 task=cc04f000)
Apr 12 09:08:58 emdc-ha1 kernel: Stack: <0>00000000 cb039000 cca73000
00000000 ffffffff ffffffff d09eae78 cb039000
Apr 12 09:08:58 emdc-ha1 kernel:        ffffffff 00000000 00000000
424d2033 38333000 00000005 cca73490 0000a002
Apr 12 09:08:58 emdc-ha1 kernel:        d0918000 00000001 0000a402
d09df1ee bfbd4064 cca73000 00000000 cebe3340
Apr 12 09:08:58 emdc-ha1 kernel: Call Trace:
Apr 12 09:08:58 emdc-ha1 kernel:  [<d09eae78>]
drbd_al_read_log+0x95/0x23e [drbd]     [<d09df1ee>]
drbd_ioctl_set_disk+0x4d1/0x700 [drbd]
Apr 12 09:08:58 emdc-ha1 kernel:  [<d09df717>] drbd_ioctl+0x2fa/0x1358
[drbd]     [<c01c065a>] _atomic_dec_and_lock+0x22/0x2c
Apr 12 09:08:58 emdc-ha1 kernel:  [<c01a0154>] avc_has_perm+0x3a/0x44
[<d09df41d>] drbd_ioctl+0x0/0x1358 [drbd]
Apr 12 09:08:58 emdc-ha1 kernel:  [<c01b938d>]
blkdev_driver_ioctl+0x39/0x3f     [<c01b99e6>] blkdev_ioctl+0x62a/0x665
Apr 12 09:08:58 emdc-ha1 kernel:  [<c01a0154>] avc_has_perm+0x3a/0x44
[<c01a06ef>] inode_has_perm+0x54/0x5c
Apr 12 09:08:58 emdc-ha1 kernel:  [<c01a0776>] file_has_perm+0x7f/0x88
[<c015898a>] block_ioctl+0x0/0x16
Apr 12 09:08:58 emdc-ha1 kernel:  [<c015899d>] block_ioctl+0x13/0x16
[<c0161776>] do_ioctl+0x16/0x48
Apr 12 09:08:58 emdc-ha1 kernel:  [<c01619a7>] vfs_ioctl+0x1ff/0x216
[<c0161a06>] sys_ioctl+0x48/0x62
Apr 12 09:08:58 emdc-ha1 kernel:  [<c0102bc1>] syscall_call+0x7/0xb
<0>Code: 8d 34 10 b9 80 00 00 00 f3 a5 eb 02 31 db 89 d8 83 c4 10 5b 5e
5f 5d c3 55 57 56 53 83 ec 04 89 c5 89 14 24 89 c8 8b 55 40 6a 00 <8b>
4a 64 89 cb c1 fb 1f 03 4a 24 13 5a 28 89 c7 c1 ff 1f 01 c1
Continuing in 85 seconds.  rnel: Continuing in 120 seconds.
Continuing in 48 seconds. ernel: tinuing in 84 seconds.
Continuing in 11 seconds. ernel: tinuing in 47 seconds.
Continuing in 1 seconds. kernel: tinuing in 10 seconds.
[root at emdc-ha1 ~]#

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

[root at emdc-ha1 ~]# modinfo drbd
filename:
/lib/modules/2.6.16-1.2080_FC5/kernel/drivers/block/drbd.ko
author:         Philipp Reisner <phil at linbit.com>, Lars Ellenberg
<lars at linbit.com>
description:    drbd - Distributed Replicated Block Device v8.0-pre2
license:        GPL
alias:          block-major-147-*
vermagic:       2.6.16-1.2080_FC5 686 REGPARM 4KSTACKS gcc-4.1
depends:
srcversion:     BEAA407474809C2576FE8CC
parm:           disable_bd_claim:DONT USE! disables block device
claiming (bool)
parm:           minor_count:Maximum number of drbd devices (1-255) (int)

[root at emdc-ha1 ~]# service drbd status
drbd driver loaded OK; device status:
version: 8.0-pre2 (api:81/proto:80)
SVN Revision: 2143M build by root at emdc-devel.in-geseke.de, 2006-04-12
08:59:53
 0: cs:Unconfigured
        resync: used:0/7 hits:0 misses:0 starving:0 dirty:0 changed:0
        act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0

[root at emdc-ha1 ~]# service drbd stop
Stopping all DRBD resourcesChild process does not terminate!
Exiting.
ERROR: Module drbd is in use
.

-----Original Message-----
From: drbd-user-bounces at linbit.com [mailto:drbd-user-bounces at linbit.com]
On Behalf Of Lars Ellenberg
Sent: Dienstag, 11. April 2006 18:19
To: drbd-user at linbit.com
Subject: Re: [DRBD-user] kernel oops drbd 8.0_pre2 on Fedora Core 5 and
RHEL4

/ 2006-04-11 15:51:18 +0100
\ Langemeyer, Werner (IBW):
> Lars,
> 
> the patch is really in use. Information below, what would you like me 
> to do?

comment out the call to blk_run_queu in drbd_bitmap.c drbd_bm_rw(), and
see what happens.

-- 
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.
_______________________________________________
drbd-user mailing list
drbd-user at lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user



More information about the drbd-user mailing list