Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi to all, i'm experiencing some strange connection errors. drbd latest with kernel 2.6.37 x86_64 dmesg: block drbd0: conn( Unconnected -> WFConnection ) block drbd0: Handshake successful: Agreed network protocol version 95 block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC block drbd0: conn( WFConnection -> WFReportParams ) block drbd0: Starting asender thread (from drbd0_receiver [16085]) block drbd0: data-integrity-alg: <not-used> block drbd0: max_segment_size ( = BIO size ) = 65536 block drbd0: drbd_sync_handshake: block drbd0: self 67E2E2BE025DAE64:0000000000000000:46544D5BFEFFFD86:D59D862A250C0143 bits:0 flags:0 block drbd0: peer 5BC8491385864D71:67E2E2BE025DAE65:46544D5BFEFFFD87:D59D862A250C0143 bits:1796 flags:0 block drbd0: uuid_compare()=-1 by rule 50 block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) block drbd0: conn( WFBitMapT -> WFSyncUUID ) block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0) block drbd0: conn( WFSyncUUID -> SyncTarget ) disk( UpToDate -> Inconsistent ) block drbd0: Began resync as SyncTarget (will sync 7248 KB [1812 bits set]). block drbd0: BAD! sector=223676552s enr=6826 rs_left=-1 rs_failed=0 count=1 Pid: 1584, comm: drbd0_asender Not tainted 2.6.37 #1 Call Trace: [<ffffffffa0040497>] ? lc_get+0x4b/0x1de [lru_cache] [<ffffffffa01ad55d>] drbd_try_clear_on_disk_bm+0xd5/0x29c [drbd] [<ffffffffa01af777>] __drbd_set_in_sync+0x24d/0x323 [drbd] [<ffffffffa01a3e17>] e_end_resync_block+0x68/0xd9 [drbd] [<ffffffffa01a26c7>] drbd_process_done_ee+0xe7/0x13c [drbd] [<ffffffffa01a5481>] drbd_asender+0x11c/0x4cc [drbd] [<ffffffffa01b6c93>] ? drbd_thread_setup+0x0/0xd0 [drbd] [<ffffffffa01b6cab>] drbd_thread_setup+0x18/0xd0 [drbd] [<ffffffffa01b6c93>] ? drbd_thread_setup+0x0/0xd0 [drbd] [<ffffffff810457da>] kthread+0x81/0x89 [<ffffffff81002c94>] kernel_thread_helper+0x4/0x10 [<ffffffff81045759>] ? kthread+0x0/0x89 [<ffffffff81002c90>] ? kernel_thread_helper+0x0/0x10 block drbd0: peer( Primary -> Unknown ) conn( SyncTarget -> Disconnecting ) pdsk( UpToDate -> DUnknown ) block drbd0: short read receiving data: read 3896 expected 4096 block drbd0: error receiving Data, l: 65560! block drbd0: asender terminated block drbd0: Terminating drbd0_asender block drbd0: Connection closed block drbd0: conn( Disconnecting -> StandAlone ) block drbd0: receiver terminated block drbd0: Terminating drbd0_receiver Seems like a network related problem, but it initially did the resync process at 100MB/sec without any problem. Network is provided by intel e1000 bonded together with balance-rr. Any thoughts? Regards, -- matteo