[DRBD-user] DRBD PingAck failure

Gerald Brandt gbr at majentis.com
Tue Apr 23 13:17:31 CEST 2013

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi,

I have a DRBD setup with primary and secondary.  DRBD0 is a RAID6 array, and DRBD1 is a RAID1 array.  A couple of time a day, DRBD1 forces a re-sync due to PingAck not arriving in time, yet DRBD0 never has that issue.  Both sync over the same dedicated GigE link.  DRBD0 is in use, while DRBD1 isn't yet.  Running stock Ubuntu 12.10

Is there anything I can do about this?

PRIMARY:

Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042587] block drbd1: sock was shut down by peer
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042597] block drbd1: peer( Secondary -> Unknown ) conn( Connected -> BrokenPipe ) pdsk( UpToDate -> DUnknown )
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042633] block drbd1: short read expecting header on sock: r=0
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042645] block drbd1: meta connection shut down by peer.
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042650] block drbd1: new current UUID 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042673] block drbd1: asender terminated
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042675] block drbd1: Terminating drbd1_asender
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098710] block drbd1: Connection closed
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098714] block drbd1: conn( BrokenPipe -> Unconnected )
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098718] block drbd1: receiver terminated
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098719] block drbd1: Restarting drbd1_receiver
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098721] block drbd1: receiver (re)started
Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098724] block drbd1: conn( Unconnected -> WFConnection )
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937526] block drbd1: Handshake successful: Agreed network protocol version 96
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937698] block drbd1: Peer authenticated using 20 bytes of 'sha1' HMAC
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937716] block drbd1: conn( WFConnection -> WFReportParams ) 
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937720] block drbd1: Starting asender thread (from drbd1_receiver [1620])
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937747] block drbd1: data-integrity-alg: <not-used>
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937758] block drbd1: drbd_sync_handshake:
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937760] block drbd1: self 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B bits:0 flags:0
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937763] block drbd1: peer 7308FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B bits:0 flags:0
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937766] block drbd1: uuid_compare()=1 by rule 70
Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937770] block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent )
Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.010807] block drbd1: helper command: /sbin/drbdadm before-resync-source minor-1
Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065171] block drbd1: helper command: /sbin/drbdadm before-resync-source minor-1 exit code 0 (0x0)
Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065179] block drbd1: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) 
Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065186] block drbd1: Began resync as SyncSource (will sync 0 KB [0 bits set]).
Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065191] block drbd1: updated sync UUID 7E8F445E07B7C819:7309FAB30BA7338D:7308FAB30BA7338D:3D2A5512F8677B9B
Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521667] block drbd1: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521672] block drbd1: updated UUIDs 7E8F445E07B7C819:0000000000000000:7309FAB30BA7338D:7308FAB30BA7338D
Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521676] block drbd1: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) 
Apr 23 03:56:25 iscsi-filer-1 kernel: [927118.433518] block drbd1: bitmap WRITE of 14904 pages took 228 jiffies
Apr 23 03:56:26 iscsi-filer-1 kernel: [927118.486667] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.


SECONDARY:
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.461499] block drbd1: PingAck did not arrive in time.
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.461872] block drbd1: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502525] block drbd1: asender terminated
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502533] block drbd1: Terminating drbd1_asender
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502588] block drbd1: Connection closed
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502593] block drbd1: conn( NetworkFailure -> Unconnected )
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502598] block drbd1: receiver terminated
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502600] block drbd1: Restarting drbd1_receiver
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502601] block drbd1: receiver (re)started
Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502604] block drbd1: conn( Unconnected -> WFConnection ) 
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397008] block drbd1: Handshake successful: Agreed network protocol version 96
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397249] block drbd1: Peer authenticated using 20 bytes of 'sha1' HMAC
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397264] block drbd1: conn( WFConnection -> WFReportParams ) 
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397268] block drbd1: Starting asender thread (from drbd1_receiver [1670])
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397326] block drbd1: data-integrity-alg: <not-used>
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397376] block drbd1: drbd_sync_handshake:
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397378] block drbd1: self 7308FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B bits:0 flags:0
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397380] block drbd1: peer 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B bits:0 flags:0
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397382] block drbd1: uuid_compare()=-1 by rule 50
Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397386] block drbd1: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate )
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.417113] block drbd1: conn( WFBitMapT -> WFSyncUUID ) 
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.577678] block drbd1: updated sync uuid 7309FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.606019] block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658326] block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1 exit code 0 (0x0)
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658332] block drbd1: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent )
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658338] block drbd1: Began resync as SyncTarget (will sync 0 KB [0 bits set]). 
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660583] block drbd1: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660588] block drbd1: updated UUIDs 7E8F445E07B7C818:0000000000000000:7309FAB30BA7338C:7308FAB30BA7338D
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660592] block drbd1: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate )
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.672649] block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1
Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.759827] block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1 exit code 0 (0x0)
Apr 23 03:56:25 iscsi-filer-2 kernel: [844882.698240] block drbd1: bitmap WRITE of 14904 pages took 235 jiffies
Apr 23 03:56:25 iscsi-filer-2 kernel: [844882.760933] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.



More information about the drbd-user mailing list