[DRBD-user] DRBD PingAck failure

Lars Ellenberg lars.ellenberg at linbit.com
Tue Apr 23 15:13:50 CEST 2013

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Tue, Apr 23, 2013 at 06:17:31AM -0500, Gerald Brandt wrote:
> Hi,
> 
> I have a DRBD setup with primary and secondary.  DRBD0 is a RAID6
> array, and DRBD1 is a RAID1 array.  A couple of time a day, DRBD1
> forces a re-sync due to PingAck not arriving in time, yet DRBD0 never
> has that issue.  Both sync over the same dedicated GigE link.  DRBD0
> is in use, while DRBD1 isn't yet.  Running stock Ubuntu 12.10
> 
> Is there anything I can do about this?

If drbd1 is not yet in use, down it :)

I have seen similar behavior when some "intelligent" network component
or firewall closed "idle" tcp connections, for some "definition" of idle.

> PRIMARY:
> 
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042587] block drbd1: sock was shut down by peer
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042597] block drbd1: peer( Secondary -> Unknown ) conn( Connected -> BrokenPipe ) pdsk( UpToDate -> DUnknown )
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042633] block drbd1: short read expecting header on sock: r=0
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042645] block drbd1: meta connection shut down by peer.
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042650] block drbd1: new current UUID 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042673] block drbd1: asender terminated
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.042675] block drbd1: Terminating drbd1_asender
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098710] block drbd1: Connection closed
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098714] block drbd1: conn( BrokenPipe -> Unconnected )
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098718] block drbd1: receiver terminated
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098719] block drbd1: Restarting drbd1_receiver
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098721] block drbd1: receiver (re)started
> Apr 23 03:56:22 iscsi-filer-1 kernel: [927115.098724] block drbd1: conn( Unconnected -> WFConnection )
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937526] block drbd1: Handshake successful: Agreed network protocol version 96
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937698] block drbd1: Peer authenticated using 20 bytes of 'sha1' HMAC
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937716] block drbd1: conn( WFConnection -> WFReportParams ) 
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937720] block drbd1: Starting asender thread (from drbd1_receiver [1620])
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937747] block drbd1: data-integrity-alg: <not-used>
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937758] block drbd1: drbd_sync_handshake:
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937760] block drbd1: self 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B bits:0 flags:0
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937763] block drbd1: peer 7308FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B bits:0 flags:0
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937766] block drbd1: uuid_compare()=1 by rule 70
> Apr 23 03:56:23 iscsi-filer-1 kernel: [927115.937770] block drbd1: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent )
> Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.010807] block drbd1: helper command: /sbin/drbdadm before-resync-source minor-1
> Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065171] block drbd1: helper command: /sbin/drbdadm before-resync-source minor-1 exit code 0 (0x0)
> Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065179] block drbd1: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) 
> Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065186] block drbd1: Began resync as SyncSource (will sync 0 KB [0 bits set]).
> Apr 23 03:56:24 iscsi-filer-1 kernel: [927117.065191] block drbd1: updated sync UUID 7E8F445E07B7C819:7309FAB30BA7338D:7308FAB30BA7338D:3D2A5512F8677B9B
> Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521667] block drbd1: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
> Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521672] block drbd1: updated UUIDs 7E8F445E07B7C819:0000000000000000:7309FAB30BA7338D:7308FAB30BA7338D
> Apr 23 03:56:25 iscsi-filer-1 kernel: [927117.521676] block drbd1: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) 
> Apr 23 03:56:25 iscsi-filer-1 kernel: [927118.433518] block drbd1: bitmap WRITE of 14904 pages took 228 jiffies
> Apr 23 03:56:26 iscsi-filer-1 kernel: [927118.486667] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
> 
> 
> SECONDARY:
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.461499] block drbd1: PingAck did not arrive in time.
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.461872] block drbd1: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown )
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502525] block drbd1: asender terminated
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502533] block drbd1: Terminating drbd1_asender
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502588] block drbd1: Connection closed
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502593] block drbd1: conn( NetworkFailure -> Unconnected )
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502598] block drbd1: receiver terminated
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502600] block drbd1: Restarting drbd1_receiver
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502601] block drbd1: receiver (re)started
> Apr 23 03:56:22 iscsi-filer-2 kernel: [844879.502604] block drbd1: conn( Unconnected -> WFConnection ) 
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397008] block drbd1: Handshake successful: Agreed network protocol version 96
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397249] block drbd1: Peer authenticated using 20 bytes of 'sha1' HMAC
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397264] block drbd1: conn( WFConnection -> WFReportParams ) 
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397268] block drbd1: Starting asender thread (from drbd1_receiver [1670])
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397326] block drbd1: data-integrity-alg: <not-used>
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397376] block drbd1: drbd_sync_handshake:
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397378] block drbd1: self 7308FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B bits:0 flags:0
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397380] block drbd1: peer 7E8F445E07B7C819:7308FAB30BA7338D:3D2A5512F8677B9B:3D295512F8677B9B bits:0 flags:0
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397382] block drbd1: uuid_compare()=-1 by rule 50
> Apr 23 03:56:23 iscsi-filer-2 kernel: [844880.397386] block drbd1: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate )
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.417113] block drbd1: conn( WFBitMapT -> WFSyncUUID ) 
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.577678] block drbd1: updated sync uuid 7309FAB30BA7338C:0000000000000000:3D2A5512F8677B9A:3D295512F8677B9B
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.606019] block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658326] block drbd1: helper command: /sbin/drbdadm before-resync-target minor-1 exit code 0 (0x0)
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658332] block drbd1: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent )
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.658338] block drbd1: Began resync as SyncTarget (will sync 0 KB [0 bits set]). 
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660583] block drbd1: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660588] block drbd1: updated UUIDs 7E8F445E07B7C818:0000000000000000:7309FAB30BA7338C:7308FAB30BA7338D
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.660592] block drbd1: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate )
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.672649] block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1
> Apr 23 03:56:24 iscsi-filer-2 kernel: [844881.759827] block drbd1: helper command: /sbin/drbdadm after-resync-target minor-1 exit code 0 (0x0)
> Apr 23 03:56:25 iscsi-filer-2 kernel: [844882.698240] block drbd1: bitmap WRITE of 14904 pages took 235 jiffies
> Apr 23 03:56:25 iscsi-filer-2 kernel: [844882.760933] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
> _______________________________________________
> drbd-user mailing list
> drbd-user at lists.linbit.com
> http://lists.linbit.com/mailman/listinfo/drbd-user

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
__
please don't Cc me, but send to list   --   I'm subscribed



More information about the drbd-user mailing list