[DRBD-user] kernel oops while syncing

Lars Ellenberg lars.ellenberg at linbit.com
Mon Nov 16 17:26:20 CET 2009

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Mon, Nov 16, 2009 at 04:30:38PM +0100, Johan Euphrosine wrote:
> Hi,
> 
> I had the following kernel oops when running an initial sync with drbd:
> drbd8-2.6.30-2-amd64                2:8.3.2-3+2.6.30-8
> 
> Let me know if you need more information:

Please try to avoid line wraps when pasting logs.
Anyways, I removed those, so it should be readable this time.

Thanks for reporting.  This appears to be a real bug in drbd
caused by a race condition when tearing down the connection.

Will be fixed.

> Nov 16 10:06:36 z2-7 kernel: [1033136.244033] block drbd3: peer( Primary -> Unknown ) conn( SyncTarget -> Timeout ) pdsk( UpToDate -> DUnknown )
> Nov 16 10:06:36 z2-7 kernel: [1033136.244050] block drbd3: short sent RSWriteAck size=32 sent=11
> Nov 16 10:06:36 z2-7 kernel: [1033136.244064] block drbd3: drbd_pp_alloc interrupted!
> Nov 16 10:06:36 z2-7 kernel: [1033136.244069] block drbd3: alloc_ee: Allocation of a page failed
> Nov 16 10:06:36 z2-7 kernel: [1033136.244074] block drbd3: error receiving RSDataReply, l: 4120!
> Nov 16 10:06:36 z2-7 kernel: [1033136.245974] block drbd3: process_done_ee() = NOT_OK
> Nov 16 10:06:36 z2-7 kernel: [1033136.246001] block drbd3: asender terminated Nov 16 10:06:36 z2-7 kernel: [1033136.246006] block drbd3: Terminating asender thread
> Nov 16 10:06:36 z2-7 kernel: [1033136.246970] BUG: unable to handle kernel NULL pointer dereference at 0000000000000008
> Nov 16 10:06:36 z2-7 kernel: [1033136.247018] IP: [<ffffffff8040d3ab>] sk_stream_wait_memory+0x88/0x1e5
> Nov 16 10:06:36 z2-7 kernel: [1033136.247051] PGD 41c536067 PUD 41c5b9067 PMD 0 Nov 16 10:06:36 z2-7 kernel: [1033136.247078] Oops: 0002 [#1] SMP
> Nov 16 10:06:36 z2-7 kernel: [1033136.247103] last sysfs file: /sys/devices/virtual/block/drbd3/removable
> Nov 16 10:06:36 z2-7 kernel: [1033136.247132] CPU 0 Nov 16 10:06:36 z2-7 kernel: [1033136.247152] Modules linked in: hmac nfs lockd fscache nfs_acl auth_rpcgss sunrpc kvm_amd kvm iptable_filter ip_tables x_tables tun bridge stp drbd cn loop snd_pcsp snd_pcm snd_timer i2c_nforce2 snd soundcore snd_page_alloc i2c_core k8temp shpchp pci_hotplug serio_raw evdev psmouse button processor ext3 jbd mbcache dm_mod usbhid hid sd_mod crc_t10dif ata_generic ide_pci_generic ohci_hcd ehci_hcd amd74xx sata_nv ide_core forcedeth libata scsi_mod floppy thermal fan thermal_sys [last unloaded: scsi_wait_scan]
> Nov 16 10:06:36 z2-7 kernel: [1033136.247392] Pid: 29255, comm: drbd3_worker Not tainted 2.6.30-2-amd64 #1 H8DMR-82
> Nov 16 10:06:36 z2-7 kernel: [1033136.247435] RIP: 0010:[<ffffffff8040d3ab>]  [<ffffffff8040d3ab>] sk_stream_wait_memory+0x88/0x1e5
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] RSP: 0018:ffff88021dda5a40  EFLAGS: 00010246
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] RAX: 0000000000000000 RBX: 00000000000005dc RCX: 000000000000afce
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] RDX: 0000000000000008 RSI: 0000000000000000 RDI: ffffffff804065c6
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] RBP: ffff88041c472380 R08: 0000000000000000 R09: ffff88041c472380
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] R10: ffff88021d1a7114 R11: ffff88021dda5b08 R12: 00000000000005dc
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] R13: 0000000000000000 R14: ffff88021dda5b08 R15: 7fffffffffffffff
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] FS: 00007f0a8edef790(0000) GS:ffffc20000000000(0000) knlGS:0000000000000000
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] CR2: 0000000000000008 CR3: 000000041c5b6000 CR4: 00000000000006e0
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] Process drbd3_worker (pid: 29255, threadinfo ffff88021dda4000, task ffff88021cdb2ab0)
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] Stack: Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  0000000000000000 ffff88021cdb2ab0 ffffffff80254742 ffff88021dda5a58
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  ffff88021dda5a58 0000000000000000 ffff88041d48a8e8 ffff88041c57f6c0
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  ffff88041c472380 ffff88021dda5b14 ffff88021cdb1000 0000000000000000
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] Call Trace: Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80254742>] ? autoremove_wake_function+0x0/0x2e
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff8043efda>] ? tcp_sendmsg+0x6fa/0x85b
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80403f24>] ? sock_sendmsg+0xa3/0xbb
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff8023bc9a>] ? default_wake_function+0x0/0x9
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80254742>] ? autoremove_wake_function+0x0/0x2e
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff8020e5a9>] ? __switch_to+0xae/0x263
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80235f65>] ? dequeue_entity+0xf/0x11f
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff804041f2>] ? kernel_sendmsg+0x2c/0x3e
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024dbb5>] ? drbd_send+0xb9/0x1cf [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff804b45e8>] ? schedule+0x9/0x1e
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024e4f1>] ? _drbd_send_cmd+0x16f/0x183 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024e81c>] ? drbd_send_cmd+0x64/0x8d [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024e998>] ? drbd_send_b_ack+0x37/0x40 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa023bdfd>] ? drbd_may_finish_epoch+0x122/0x2f8 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa023c305>] ? w_flush+0x54/0x5d [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa02368be>] ? drbd_worker+0x4c6/0x4d3 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff804b47df>] ? schedule_timeout+0x9b/0xb6
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff804b47cf>] ? schedule_timeout+0x8b/0xb6
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024ce5b>] ? drbd_thread_setup+0x16f/0x230 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80210aca>] ? child_rip+0xa/0x20
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffffa024ccec>] ? drbd_thread_setup+0x0/0x230 [drbd]
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  [<ffffffff80210ac0>] ? child_rip+0x0/0x20
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] Code: f4 ff ba 32 00 00 00 89 d1 31 d2 f7 f1 83 c2 02 41 89 d4 4d 89 e5 49 bf ff ff ff ff ff ff ff 7f 48 8b 85 e8 01 00 00 48 8d 50 08 <f0> 80 48 08 01 48 8b 7d 78 ba 01 00 00 00 48 89 e6 e8 09 75 e4
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] RIP [<ffffffff8040d3ab>] sk_stream_wait_memory+0x88/0x1e5
> Nov 16 10:06:36 z2-7 kernel: [1033136.250009]  RSP <ffff88021dda5a40> Nov 16 10:06:36 z2-7 kernel: [1033136.250009] CR2: 0000000000000008 2ddd1cdd4c0c8cf4 ]---

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
__
please don't Cc me, but send to list   --   I'm subscribed



More information about the drbd-user mailing list