[DRBD-user] Kernel crash in drbd0_asender, drbd 8.3.9

Lars Ellenberg lars.ellenberg at linbit.com
Fri Jul 8 16:08:54 CEST 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Fri, Jul 08, 2011 at 02:39:39PM +0200, Mrten wrote:
> On 08-07-2011 09:55:36, Mrten wrote:
> > Tonight my secondary crashed while importing a large set of (openstreetmap) data with
> > postgresql on the primary:
> 
> and again just now, this is copy/paste from the serial console. still importing data.

Upgrade your DRBD module to 8.3.11 please.

> [ 8808.155307] block drbd0: Digest integrity check FAILED.
> [ 8808.160558] block drbd0: error receiving Data, l: 73768!
> [ 8818.464125] general protection fault: 0000 [#1] SMP [ 8818.469136] last sysfs file: /sys/devices/system/cpu/cpu7/cache/index2/shared_cpu_map
> [ 8818.476974] CPU 0 [ 8818.478811] Modules linked in: usbhid hid sha1_generic drbd lru_cache coretemp ipmi_si ipmi_msghandler bonding ghes pl2303 usbserial lp hed dcdbas power_meter parport xfs exportfs rair
> [ 8818.509608] [ 8818.511115] Pid: 13598, comm: drbd0_asender Tainted: G        W   2.6.38-8-server #42-Ubuntu Dell Inc. PowerEdge R210/05KX61
> [ 8818.522441] RIP: 0010:[<ffffffff811175c9>]  [<ffffffff811175c9>] put_page+0x9/0x40
> [ 8818.530050] RSP: 0018:ffff88041de57a40  EFLAGS: 00010246
> [ 8818.535371] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 000000000000f0cc
> [ 8818.542504] RDX: ffff8804054d1cc0 RSI: 0000000000000006 RDI: 0e3d70a3d7415a20
> [ 8818.549643] RBP: ffff88041de57a40 R08: 0000000000010000 R09: dead000000200200
> [ 8818.556782] R10: ffff880414430680 R11: 0000000000000001 R12: ffff88041968d500
> [ 8818.563921] R13: ffff88041968d500 R14: ffff88041968d528 R15: 0000000000000020
> [ 8818.571060] FS:  0000000000000000(0000) GS:ffff8800bf200000(0000) knlGS:0000000000000000
> [ 8818.579165] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [ 8818.584910] CR2: 00007fcc97914900 CR3: 000000041dc74000 CR4: 00000000000006f0
> [ 8818.592043] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> [ 8818.599175] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> [ 8818.606313] Process drbd0_asender (pid: 13598, threadinfo ffff88041de56000, task ffff88041a8e44a0)
> [ 8818.615283] Stack:
> [ 8818.617313]  ffff88041de57a60 ffffffff814d70c4 ffff88041968d500 ffff8804054d1c62
> [ 8818.624804]  ffff88041de57a80 ffffffff814d710e ffff88041968d528 ffff880414430680
> [ 8818.632297]  ffff88041de57ad0 ffffffff81530d05 ffff880400000020 00000000db8489f7
> [ 8818.639791] Call Trace:
> [ 8818.642252]  [<ffffffff814d70c4>] skb_release_data+0xb4/0xe0
> [ 8818.647915]  [<ffffffff814d710e>] __kfree_skb+0x1e/0xa0
> [ 8818.653143]  [<ffffffff81530d05>] tcp_rcv_established+0x455/0x720
> [ 8818.659239]  [<ffffffff81538901>] tcp_v4_do_rcv+0xb1/0x1c0
> [ 8818.664734]  [<ffffffff81087940>] ? autoremove_wake_function+0x0/0x40
> [ 8818.671175]  [<ffffffff8152430d>] tcp_prequeue_process+0x5d/0x80
> [ 8818.677189]  [<ffffffff815277c8>] tcp_recvmsg+0x978/0xbb0
> [ 8818.682591]  [<ffffffff8154a88b>] inet_recvmsg+0x6b/0x80
> [ 8818.687914]  [<ffffffff8105882d>] ? enqueue_entity+0x14d/0x2a0
> [ 8818.693749]  [<ffffffff814ce3cd>] sock_recvmsg+0xfd/0x130
> [ 8818.699156]  [<ffffffff8105f554>] ? try_to_wake_up+0x244/0x3e0
> [ 8818.704992]  [<ffffffff8105f702>] ? default_wake_function+0x12/0x20
> [ 8818.711258]  [<ffffffff8104bb39>] ? __wake_up_common+0x59/0x90
> [ 8818.717096]  [<ffffffffa0267680>] drbd_recv_short.clone.22+0x70/0x80 [drbd]
> [ 8818.724062]  [<ffffffffa027109f>] drbd_asender+0x15f/0x590 [drbd]
> [ 8818.730162]  [<ffffffffa02791a0>] ? drbd_thread_setup+0x0/0xf0 [drbd]
> [ 8818.736608]  [<ffffffffa0279204>] drbd_thread_setup+0x64/0xf0 [drbd]
> [ 8818.742971]  [<ffffffffa02791a0>] ? drbd_thread_setup+0x0/0xf0 [drbd]
> [ 8818.749416]  [<ffffffff810871f6>] kthread+0x96/0xa0
> [ 8818.754304]  [<ffffffff8100cde4>] kernel_thread_helper+0x4/0x10
> [ 8818.760226]  [<ffffffff81087160>] ? kthread+0x0/0xa0
> [ 8818.765193]  [<ffffffff8100cde0>] ? kernel_thread_helper+0x0/0x10
> [ 8818.771288] Code: de fe ff ff eb c9 48 8b 03 eb e6 89 c2 0f 1f 44 00 00 e9 5d ff ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 0f 1f 44 00 00 <48> f7 07 00 c0 00 00 75 1f 8b 47 0 [ 8818.791423] RIP  [<ffffffff811175c9>] put_page+0x9/0x40
> [ 8818.796675]  RSP <ffff88041de57a40>
> [ 8818.800521] ---[ end trace f2e7cca539efb9b6 ]---
> 2011 Jul  8 14:2[ 8818.805222] Kernel panic - not syncing: Fatal exception in interrupt
> 5:17 nadir [ 881[ 8818.813006] Pid: 13598, comm: drbd0_asender Tainted: G      D W   2.6.38-8-server #42-Ubuntu
> 8.464125] genera[ 8818.813008] Call Trace:
> l protection fau
> 
> Seems it crashed (hard) the second time through, nothing after "call trace". Keyboard caps lock 
> is not functioning either, nor is sysrq.
> 
> Mrten.
> _______________________________________________
> drbd-user mailing list
> drbd-user at lists.linbit.com
> http://lists.linbit.com/mailman/listinfo/drbd-user

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
__
please don't Cc me, but send to list   --   I'm subscribed



More information about the drbd-user mailing list