Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
[please cc me on replies; I'm not subscribed to the list] > this I have seen myself once, but were not able to reproduce. > that time it was possible to get out of it by just doing "drbdadm > connect all". I actually tried that (|drbdadm connect drbd0|), but it didn't help > the receiver on the secondary side is stuck somewhere. > It would be interessting to know where. Once in this state, > drbd would only get out of it if you configure ko-count to some smallish > positive number, even though the intended usage for that option was > something different. sysrq-T didn't show anything in D state on the secondary: Jul 14 10:59:02 dbtools02 kernel: drbd0_worker S F7092F24 2652 1851 1 1864 1814 (L-TLB) Jul 14 10:59:02 dbtools02 kernel: f7092f7c 00000046 f7092f24 f7092f24 c011e7c5 00000000 00000296 00000000 Jul 14 10:59:02 dbtools02 kernel: c1bb7080 027c0001 c1bb7080 c180ede0 00000000 00000f94 3b223170 00000013 Jul 14 10:59:02 dbtools02 kernel: c0320a80 f7b706b0 f7b7081c 00000000 00000002 c1bb71f4 c1bb71ec 00000246 Jul 14 10:59:02 dbtools02 kernel: Call Trace: Jul 14 10:59:02 dbtools02 kernel: [<c011e7c5>] __wake_up_locked+0x11/0x13 Jul 14 10:59:02 dbtools02 kernel: [<c02cfcc7>] __down_interruptible+0xbc/0x115 Jul 14 10:59:02 dbtools02 kernel: [<c011e71b>] default_wake_function+0x0/0xc Jul 14 10:59:02 dbtools02 kernel: [<f8aebdf0>] w_resume_next_sg+0x113/0x12b [drbd] Jul 14 10:59:02 dbtools02 kernel: [<c02cfd33>] __down_failed_interruptible+0x7/0xc Jul 14 10:59:02 dbtools02 kernel: [<f8aec455>] .text.lock.drbd_worker+0x37/0x3e [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<c01041f5>] kernel_thread_helper+0x5/0xb Jul 14 10:59:02 dbtools02 kernel: drbd0_receive S F7D89640 2696 1864 1 1874 1851 (L-TLB) Jul 14 10:59:02 dbtools02 kernel: f70b7d8c 00000046 00000246 f7d89640 f7a4472c f70b7d60 c0142742 00000000 Jul 14 10:59:02 dbtools02 kernel: 00000000 f7ba41b0 c0120291 c1816de0 00000001 00012034 39d29f18 00000013 Jul 14 10:59:02 dbtools02 kernel: f7e110b0 f7ba41b0 f7ba431c f715b054 00000001 f73c3b80 7fffffff 00000000 Jul 14 10:59:02 dbtools02 kernel: Call Trace: Jul 14 10:59:02 dbtools02 kernel: [<c0142742>] mempool_alloc+0x7b/0x135 Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c02d0ddf>] schedule_timeout+0x53/0xee Jul 14 10:59:02 dbtools02 kernel: [<c027bef6>] skb_copy_datagram_iovec+0x53/0x1e5 Jul 14 10:59:02 dbtools02 kernel: [<c027995b>] release_sock+0xf/0x4f Jul 14 10:59:02 dbtools02 kernel: [<c0279490>] sk_wait_data+0x88/0xbd Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c02d0ca2>] __cond_resched+0x14/0x39 Jul 14 10:59:02 dbtools02 kernel: [<c029df09>] tcp_recvmsg+0x379/0x681 Jul 14 10:59:02 dbtools02 kernel: [<c022174b>] elv_merged_request+0x9/0xa Jul 14 10:59:02 dbtools02 kernel: [<c0279a88>] sock_common_recvmsg+0x30/0x46 Jul 14 10:59:02 dbtools02 kernel: [<c0276754>] sock_recvmsg+0xef/0x10c Jul 14 10:59:02 dbtools02 kernel: [<f885e9bd>] dm_request+0x11b/0x130 [dm_mod] Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<f8aed2af>] drbd_recv+0x8f/0x14f [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8aed888>] drbd_recv_header+0x12/0xc7 [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8aee4b7>] receive_RSDataReply+0x0/0x1c7 [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af0843>] drbdd+0x14/0x10d [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af124d>] drbdd_init+0x92/0x15d [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<c01041f5>] kernel_thread_helper+0x5/0xb Jul 14 10:59:02 dbtools02 kernel: drbd0_asender S C02A6134 2732 1874 1 1896 1864 (L-TLB) Jul 14 10:59:02 dbtools02 kernel: f7198d94 00000046 0000ffff c02a6134 c0320a80 f6ce88e0 f707eb98 f6ce88c0 Jul 14 10:59:02 dbtools02 kernel: c02a696d f707eb98 f73cf0c0 c180ede0 00000000 00000358 1ef5ee80 00000070 Jul 14 10:59:02 dbtools02 kernel: c0320a80 f7b711b0 f7b7131c 00000000 00000002 0002ff2b 0002ff2b 00000000 Jul 14 10:59:02 dbtools02 kernel: Call Trace: Jul 14 10:59:02 dbtools02 kernel: [<c02a6134>] tcp_cwnd_restart+0x14/0xb7 Jul 14 10:59:02 dbtools02 kernel: [<c02a696d>] tcp_transmit_skb+0x6bf/0x7d2 Jul 14 10:59:02 dbtools02 kernel: [<c02d0e5f>] schedule_timeout+0xd3/0xee Jul 14 10:59:02 dbtools02 kernel: [<c012a46e>] process_timeout+0x0/0x5 Jul 14 10:59:02 dbtools02 kernel: [<c027995b>] release_sock+0xf/0x4f Jul 14 10:59:02 dbtools02 kernel: [<c0279490>] sk_wait_data+0x88/0xbd Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c02acb8a>] tcp_v4_do_rcv+0x1b/0xe1 Jul 14 10:59:02 dbtools02 crond(pam_unix)[2946]: session closed for user root Jul 14 10:59:02 dbtools02 kernel: [<c029df09>] tcp_recvmsg+0x379/0x681 Jul 14 10:59:02 dbtools02 kernel: [<c0279a88>] sock_common_recvmsg+0x30/0x46 Jul 14 10:59:02 dbtools02 kernel: [<c0276754>] sock_recvmsg+0xef/0x10c Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<c011e7a1>] __wake_up+0x29/0x3c Jul 14 10:59:02 dbtools02 kernel: [<c0120291>] autoremove_wake_function+0x0/0x2d Jul 14 10:59:02 dbtools02 kernel: [<f8af63a9>] _drbd_send_cmd+0xae/0x103 [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8aed215>] drbd_recv_short+0x8f/0x9a [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af1a68>] drbd_asender+0xf6/0x3b2 [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5f5b>] drbd_thread_setup+0x9e/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<f8af5ebd>] drbd_thread_setup+0x0/0xcf [drbd] Jul 14 10:59:02 dbtools02 kernel: [<c01041f5>] kernel_thread_helper+0x5/0xb If I do reproduce it, I'll see if I can get a tcpdump too. Thanks, Bradley