[DRBD-user] Re: drbd 0.7.20 lockup

Bradley Baetz bradley.baetz at optusnet.com.au
Fri Jul 14 14:46:27 CEST 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


[please cc me on replies; I'm not subscribed to the list]

> this I have seen myself once, but were not able to reproduce.
> that time it was possible to get out of it by just doing "drbdadm
> connect all".

I actually tried that (|drbdadm connect drbd0|), but it didn't help

> the receiver on the secondary side is stuck somewhere.
> It would be interessting to know where. Once in this state,
> drbd would only get out of it if you configure ko-count to some smallish
> positive number, even though the intended usage for that option was
> something different.

sysrq-T didn't show anything in D state on the secondary:

Jul 14 10:59:02 dbtools02 kernel: drbd0_worker  S F7092F24  2652  1851      1          1864  1814 (L-TLB)
Jul 14 10:59:02 dbtools02 kernel: f7092f7c 00000046 f7092f24 f7092f24 c011e7c5 00000000 00000296 00000000
Jul 14 10:59:02 dbtools02 kernel:        c1bb7080 027c0001 c1bb7080 c180ede0 00000000 00000f94 3b223170 00000013
Jul 14 10:59:02 dbtools02 kernel:        c0320a80 f7b706b0 f7b7081c 00000000 00000002 c1bb71f4 c1bb71ec 00000246
Jul 14 10:59:02 dbtools02 kernel: Call Trace:
Jul 14 10:59:02 dbtools02 kernel:  [<c011e7c5>] __wake_up_locked+0x11/0x13
Jul 14 10:59:02 dbtools02 kernel:  [<c02cfcc7>] __down_interruptible+0xbc/0x115
Jul 14 10:59:02 dbtools02 kernel:  [<c011e71b>] default_wake_function+0x0/0xc
Jul 14 10:59:02 dbtools02 kernel:  [<f8aebdf0>] w_resume_next_sg+0x113/0x12b [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<c02cfd33>] __down_failed_interruptible+0x7/0xc
Jul 14 10:59:02 dbtools02 kernel:  [<f8aec455>] .text.lock.drbd_worker+0x37/0x3e [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<c01041f5>] kernel_thread_helper+0x5/0xb
Jul 14 10:59:02 dbtools02 kernel: drbd0_receive S F7D89640  2696  1864      1          1874  1851 (L-TLB)
Jul 14 10:59:02 dbtools02 kernel: f70b7d8c 00000046 00000246 f7d89640 f7a4472c f70b7d60 c0142742 00000000
Jul 14 10:59:02 dbtools02 kernel:        00000000 f7ba41b0 c0120291 c1816de0 00000001 00012034 39d29f18 00000013
Jul 14 10:59:02 dbtools02 kernel:        f7e110b0 f7ba41b0 f7ba431c f715b054 00000001 f73c3b80 7fffffff 00000000
Jul 14 10:59:02 dbtools02 kernel: Call Trace:
Jul 14 10:59:02 dbtools02 kernel:  [<c0142742>] mempool_alloc+0x7b/0x135
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c02d0ddf>] schedule_timeout+0x53/0xee
Jul 14 10:59:02 dbtools02 kernel:  [<c027bef6>] skb_copy_datagram_iovec+0x53/0x1e5
Jul 14 10:59:02 dbtools02 kernel:  [<c027995b>] release_sock+0xf/0x4f
Jul 14 10:59:02 dbtools02 kernel:  [<c0279490>] sk_wait_data+0x88/0xbd
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c02d0ca2>] __cond_resched+0x14/0x39
Jul 14 10:59:02 dbtools02 kernel:  [<c029df09>] tcp_recvmsg+0x379/0x681
Jul 14 10:59:02 dbtools02 kernel:  [<c022174b>] elv_merged_request+0x9/0xa
Jul 14 10:59:02 dbtools02 kernel:  [<c0279a88>] sock_common_recvmsg+0x30/0x46
Jul 14 10:59:02 dbtools02 kernel:  [<c0276754>] sock_recvmsg+0xef/0x10c
Jul 14 10:59:02 dbtools02 kernel:  [<f885e9bd>] dm_request+0x11b/0x130 [dm_mod]
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<f8aed2af>] drbd_recv+0x8f/0x14f [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8aed888>] drbd_recv_header+0x12/0xc7 [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8aee4b7>] receive_RSDataReply+0x0/0x1c7 [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af0843>] drbdd+0x14/0x10d [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af124d>] drbdd_init+0x92/0x15d [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<c01041f5>] kernel_thread_helper+0x5/0xb
Jul 14 10:59:02 dbtools02 kernel: drbd0_asender S C02A6134  2732  1874      1          1896  1864 (L-TLB)
Jul 14 10:59:02 dbtools02 kernel: f7198d94 00000046 0000ffff c02a6134 c0320a80 f6ce88e0 f707eb98 f6ce88c0
Jul 14 10:59:02 dbtools02 kernel:        c02a696d f707eb98 f73cf0c0 c180ede0 00000000 00000358 1ef5ee80 00000070
Jul 14 10:59:02 dbtools02 kernel:        c0320a80 f7b711b0 f7b7131c 00000000 00000002 0002ff2b 0002ff2b 00000000
Jul 14 10:59:02 dbtools02 kernel: Call Trace:
Jul 14 10:59:02 dbtools02 kernel:  [<c02a6134>] tcp_cwnd_restart+0x14/0xb7
Jul 14 10:59:02 dbtools02 kernel:  [<c02a696d>] tcp_transmit_skb+0x6bf/0x7d2
Jul 14 10:59:02 dbtools02 kernel:  [<c02d0e5f>] schedule_timeout+0xd3/0xee
Jul 14 10:59:02 dbtools02 kernel:  [<c012a46e>] process_timeout+0x0/0x5
Jul 14 10:59:02 dbtools02 kernel:  [<c027995b>] release_sock+0xf/0x4f
Jul 14 10:59:02 dbtools02 kernel:  [<c0279490>] sk_wait_data+0x88/0xbd
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c02acb8a>] tcp_v4_do_rcv+0x1b/0xe1
Jul 14 10:59:02 dbtools02 crond(pam_unix)[2946]: session closed for user root
Jul 14 10:59:02 dbtools02 kernel:  [<c029df09>] tcp_recvmsg+0x379/0x681
Jul 14 10:59:02 dbtools02 kernel:  [<c0279a88>] sock_common_recvmsg+0x30/0x46
Jul 14 10:59:02 dbtools02 kernel:  [<c0276754>] sock_recvmsg+0xef/0x10c
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<c011e7a1>] __wake_up+0x29/0x3c
Jul 14 10:59:02 dbtools02 kernel:  [<c0120291>] autoremove_wake_function+0x0/0x2d
Jul 14 10:59:02 dbtools02 kernel:  [<f8af63a9>] _drbd_send_cmd+0xae/0x103 [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8aed215>] drbd_recv_short+0x8f/0x9a [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af1a68>] drbd_asender+0xf6/0x3b2 [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd]
Jul 14 10:59:02 dbtools02 kernel:  [<c01041f5>] kernel_thread_helper+0x5/0xb

If I do reproduce it, I'll see if I can get a tcpdump too.

Thanks,

Bradley



More information about the drbd-user mailing list