[DRBD-user] Fwd: Re: drbd issue?

Nicolas nicolas at shivaserv.fr
Wed Aug 29 13:33:26 CEST 2018


Hello

Sorry for the misunderstanding of utils version.

I'm using the kernel : 4.9.88-1+deb9u1 (4.9.0-6-amd64 debian).
And the module version v8.4.7.

filename: /lib/modules/4.9.0-6-amd64/kernel/drivers/block/drbd/drbd.ko
alias: block-major-147-*
license: GPL
version: 8.4.7
description: drbd - Distributed Replicated Block Device v8.4.7
author: Philipp Reisner <phil at linbit.com>, Lars Ellenberg <lars at linbit.com>
srcversion: 0904DF2CCF7283ACE07D07A
depends: lru_cache,libcrc32c
retpoline: Y
intree: Y
vermagic: 4.9.0-6-amd64 SMP mod_unload modversions 
parm: minor_count:Approximate number of drbd devices (1-255) (uint)
parm: disable_sendpage:bool
parm: allow_oos:DONT USE! (bool)
parm: proc_details:int
parm: usermode_helper:string
For example when a node says:

[Tue Aug 28 14:32:38 2018] drbd resource10: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown ) 
[Tue Aug 28 14:32:38 2018] drbd resource10: ack_receiver terminated
[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_a_resource
[Tue Aug 28 14:32:38 2018] drbd resource10: Connection closed
[Tue Aug 28 14:32:38 2018] drbd resource10: conn( Disconnecting -> StandAlone ) 
[Tue Aug 28 14:32:38 2018] drbd resource10: receiver terminated
[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_r_resource
[Tue Aug 28 14:32:38 2018] block drbd10: disk( UpToDate -> Failed ) 
[Tue Aug 28 14:32:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
[Tue Aug 28 14:32:38 2018] block drbd10: disk( Failed -> Diskless ) 
[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_w_resource
[Tue Aug 28 14:32:40 2018] drbd resource10: Starting worker thread (from drbdsetup-84 [10222])
[Tue Aug 28 14:32:40 2018] block drbd10: disk( Diskless -> Attaching ) 
[Tue Aug 28 14:32:40 2018] drbd resource10: Method to ensure write ordering: flush
[Tue Aug 28 14:32:40 2018] block drbd10: max BIO size = 262144
[Tue Aug 28 14:32:40 2018] block drbd10: Adjusting my ra_pages to backing device's (32 -> 256)
[Tue Aug 28 14:32:40 2018] block drbd10: drbd_bm_resize called with capacity == 314572800
[Tue Aug 28 14:32:40 2018] block drbd10: resync bitmap: bits=39321600 words=614400 pages=1200
[Tue Aug 28 14:32:40 2018] block drbd10: size = 150 GB (157286400 KB)
[Tue Aug 28 14:32:40 2018] block drbd10: recounting of set bits took additional 0 jiffies
[Tue Aug 28 14:32:40 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
[Tue Aug 28 14:32:40 2018] block drbd10: disk( Attaching -> UpToDate ) 
[Tue Aug 28 14:32:40 2018] block drbd10: attached to UUIDs 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B
[Tue Aug 28 14:32:40 2018] drbd resource10: conn( StandAlone -> Unconnected ) 
[Tue Aug 28 14:32:40 2018] drbd resource10: Starting receiver thread (from drbd_w_resource [10225])
[Tue Aug 28 14:32:40 2018] drbd resource10: receiver (re)started
[Tue Aug 28 14:32:40 2018] drbd resource10: conn( Unconnected -> WFConnection ) 
[Tue Aug 28 14:32:41 2018] drbd resource10: Handshake successful: Agreed network protocol version 101
[Tue Aug 28 14:32:41 2018] drbd resource10: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.
[Tue Aug 28 14:32:41 2018] drbd resource10: Peer authenticated using 16 bytes HMAC
[Tue Aug 28 14:32:41 2018] drbd resource10: conn( WFConnection -> WFReportParams ) 
[Tue Aug 28 14:32:41 2018] drbd resource10: Starting ack_recv thread (from drbd_r_resource [10246])
[Tue Aug 28 14:32:41 2018] block drbd10: drbd_sync_handshake:
[Tue Aug 28 14:32:41 2018] block drbd10: self 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B bits:0 flags:0
[Tue Aug 28 14:32:41 2018] block drbd10: peer 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B bits:0 flags:0
[Tue Aug 28 14:32:41 2018] block drbd10: uuid_compare()=-1 by rule 50
[Tue Aug 28 14:32:41 2018] block drbd10: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate ) 
[Tue Aug 28 14:32:41 2018] block drbd10: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%
[Tue Aug 28 14:32:41 2018] block drbd10: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%
[Tue Aug 28 14:32:41 2018] block drbd10: conn( WFBitMapT -> WFSyncUUID ) 
[Tue Aug 28 14:32:41 2018] block drbd10: updated sync uuid 0749EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B
[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true before-resync-target minor-10
[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true before-resync-target minor-10 exit code 0 (0x0)
[Tue Aug 28 14:32:41 2018] block drbd10: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) 
[Tue Aug 28 14:32:41 2018] block drbd10: Began resync as SyncTarget (will sync 0 KB [0 bits set]).
[Tue Aug 28 14:32:41 2018] block drbd10: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
[Tue Aug 28 14:32:41 2018] block drbd10: updated UUIDs 629F1036CD6CA2AE:0000000000000000:0749EE11C429D3B4:0748EE11C429D3B5
[Tue Aug 28 14:32:41 2018] block drbd10: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate ) 
[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true after-resync-target minor-10
[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true after-resync-target minor-10 exit code 0 (0x0)

The second says:

[Tue Aug 28 14:35:33 2018] br0: port 8(tap6) entered disabled state
[Tue Aug 28 14:35:33 2018] device tap6 left promiscuous mode
[Tue Aug 28 14:35:33 2018] br0: port 8(tap6) entered disabled state
[Tue Aug 28 14:35:37 2018] drbd resource10: peer( Secondary -> Unknown ) conn( Connected -> TearDown ) pdsk( UpToDate -> DUnknown ) 
[Tue Aug 28 14:35:37 2018] drbd resource10: ack_receiver terminated
[Tue Aug 28 14:35:37 2018] drbd resource10: Terminating drbd_a_resource
[Tue Aug 28 14:35:37 2018] block drbd10: new current UUID 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B
[Tue Aug 28 14:35:37 2018] drbd resource10: Connection closed
[Tue Aug 28 14:35:37 2018] drbd resource10: conn( TearDown -> Unconnected ) 
[Tue Aug 28 14:35:37 2018] drbd resource10: receiver terminated
[Tue Aug 28 14:35:37 2018] drbd resource10: Restarting receiver thread
[Tue Aug 28 14:35:37 2018] drbd resource10: receiver (re)started
[Tue Aug 28 14:35:37 2018] drbd resource10: conn( Unconnected -> WFConnection ) 
[Tue Aug 28 14:35:38 2018] block drbd10: role( Primary -> Secondary ) 
[Tue Aug 28 14:35:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
[Tue Aug 28 14:35:38 2018] drbd resource10: conn( WFConnection -> Disconnecting ) 
[Tue Aug 28 14:35:38 2018] drbd resource10: Discarding network configuration.
[Tue Aug 28 14:35:38 2018] drbd resource10: Connection closed
[Tue Aug 28 14:35:38 2018] drbd resource10: conn( Disconnecting -> StandAlone ) 
[Tue Aug 28 14:35:38 2018] drbd resource10: receiver terminated
[Tue Aug 28 14:35:38 2018] drbd resource10: Terminating drbd_r_resource
[Tue Aug 28 14:35:38 2018] block drbd10: disk( UpToDate -> Failed ) 
[Tue Aug 28 14:35:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
[Tue Aug 28 14:35:38 2018] block drbd10: disk( Failed -> Diskless ) 
[Tue Aug 28 14:35:38 2018] drbd resource10: Terminating drbd_w_resource
[Tue Aug 28 14:35:40 2018] drbd resource10: Starting worker thread (from drbdsetup-84 [3025])
[Tue Aug 28 14:35:40 2018] block drbd10: disk( Diskless -> Attaching ) 
[Tue Aug 28 14:35:40 2018] drbd resource10: Method to ensure write ordering: flush
[Tue Aug 28 14:35:40 2018] block drbd10: max BIO size = 262144
[Tue Aug 28 14:35:40 2018] block drbd10: Adjusting my ra_pages to backing device's (32 -> 256)
[Tue Aug 28 14:35:40 2018] block drbd10: drbd_bm_resize called with capacity == 314572800
[Tue Aug 28 14:35:40 2018] block drbd10: resync bitmap: bits=39321600 words=614400 pages=1200
[Tue Aug 28 14:35:40 2018] block drbd10: size = 150 GB (157286400 KB)
[Tue Aug 28 14:35:41 2018] block drbd10: recounting of set bits took additional 0 jiffies
[Tue Aug 28 14:35:41 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
[Tue Aug 28 14:35:41 2018] block drbd10: disk( Attaching -> UpToDate ) 
[Tue Aug 28 14:35:41 2018] block drbd10: attached to UUIDs 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B
[Tue Aug 28 14:35:41 2018] drbd resource10: conn( StandAlone -> Unconnected ) 
[Tue Aug 28 14:35:41 2018] drbd resource10: Starting receiver thread (from drbd_w_resource [3030])
[Tue Aug 28 14:35:41 2018] drbd resource10: receiver (re)started
[Tue Aug 28 14:35:41 2018] drbd resource10: conn( Unconnected -> WFConnection ) 
[Tue Aug 28 14:35:41 2018] block drbd10: role( Secondary -> Primary ) 
[Tue Aug 28 14:35:41 2018] drbd resource10: Handshake successful: Agreed network protocol version 101
[Tue Aug 28 14:35:41 2018] drbd resource10: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.
[Tue Aug 28 14:35:41 2018] drbd resource10: Peer authenticated using 16 bytes HMAC
[Tue Aug 28 14:35:41 2018] drbd resource10: conn( WFConnection -> WFReportParams ) 
[Tue Aug 28 14:35:41 2018] drbd resource10: Starting ack_recv thread (from drbd_r_resource [3045])
[Tue Aug 28 14:35:41 2018] block drbd10: drbd_sync_handshake:
[Tue Aug 28 14:35:41 2018] block drbd10: self 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B bits:0 flags:0
[Tue Aug 28 14:35:41 2018] block drbd10: peer 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B bits:0 flags:0
[Tue Aug 28 14:35:41 2018] block drbd10: uuid_compare()=1 by rule 70
[Tue Aug 28 14:35:41 2018] block drbd10: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent ) 
[Tue Aug 28 14:35:41 2018] block drbd10: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%
[Tue Aug 28 14:35:41 2018] block drbd10: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%
[Tue Aug 28 14:35:41 2018] block drbd10: helper command: /bin/true before-resync-source minor-10
[Tue Aug 28 14:35:41 2018] block drbd10: helper command: /bin/true before-resync-source minor-10 exit code 0 (0x0)
[Tue Aug 28 14:35:41 2018] block drbd10: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) 
[Tue Aug 28 14:35:41 2018] block drbd10: Began resync as SyncSource (will sync 0 KB [0 bits set]).
[Tue Aug 28 14:35:41 2018] block drbd10: updated sync UUID 629F1036CD6CA2AF:0749EE11C429D3B5:0748EE11C429D3B5:FDAEFCD2E8D9890B
[Tue Aug 28 14:35:41 2018] block drbd10: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
[Tue Aug 28 14:35:41 2018] block drbd10: updated UUIDs 629F1036CD6CA2AF:0000000000000000:0749EE11C429D3B5:0748EE11C429D3B5
[Tue Aug 28 14:35:41 2018] block drbd10: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) 
[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered blocking state
[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered disabled state
[Tue Aug 28 14:35:41 2018] device tap6 entered promiscuous mode
[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered blocking state
[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered forwarding state

And it seems for this example the second node was the origin of this. 
This night I got another error, saying network failure, but I'm sure there was no network issue:

First node: 

[Wed Aug 29 01:39:48 2018] drbd resource0: meta connection shut down by peer.
[Wed Aug 29 01:39:48 2018] drbd resource0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) 
[Wed Aug 29 01:39:48 2018] drbd resource0: ack_receiver terminated
[Wed Aug 29 01:39:48 2018] drbd resource0: Terminating drbd_a_resource
[Wed Aug 29 01:39:48 2018] drbd resource0: Connection closed
[Wed Aug 29 01:39:48 2018] drbd resource0: conn( NetworkFailure -> Unconnected ) 
[Wed Aug 29 01:39:48 2018] drbd resource0: receiver terminated
[Wed Aug 29 01:39:48 2018] drbd resource0: Restarting receiver thread
[Wed Aug 29 01:39:48 2018] drbd resource0: receiver (re)started
[Wed Aug 29 01:39:48 2018] drbd resource0: conn( Unconnected -> WFConnection ) 
[Wed Aug 29 01:39:49 2018] drbd resource0: Handshake successful: Agreed network protocol version 101
[Wed Aug 29 01:39:49 2018] drbd resource0: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.
[Wed Aug 29 01:39:49 2018] drbd resource0: Peer authenticated using 16 bytes HMAC
[Wed Aug 29 01:39:49 2018] drbd resource0: conn( WFConnection -> WFReportParams ) 
[Wed Aug 29 01:39:49 2018] drbd resource0: Starting ack_recv thread (from drbd_r_resource [6370])
[Wed Aug 29 01:39:49 2018] block drbd0: drbd_sync_handshake:
[Wed Aug 29 01:39:49 2018] block drbd0: self 127020C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507 bits:0 flags:0
[Wed Aug 29 01:39:49 2018] block drbd0: peer ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507 bits:7 flags:0
[Wed Aug 29 01:39:49 2018] block drbd0: uuid_compare()=-1 by rule 50
[Wed Aug 29 01:39:49 2018] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate ) 
[Wed Aug 29 01:39:49 2018] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%
[Wed Aug 29 01:39:49 2018] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%
[Wed Aug 29 01:39:49 2018] block drbd0: conn( WFBitMapT -> WFSyncUUID ) 
[Wed Aug 29 01:39:49 2018] block drbd0: updated sync uuid 127120C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507
[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true before-resync-target minor-0
[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true before-resync-target minor-0 exit code 0 (0x0)
[Wed Aug 29 01:39:49 2018] block drbd0: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) 
[Wed Aug 29 01:39:49 2018] block drbd0: Began resync as SyncTarget (will sync 28 KB [7 bits set]).
[Wed Aug 29 01:39:49 2018] block drbd0: Resync done (total 1 sec; paused 0 sec; 28 K/sec)
[Wed Aug 29 01:39:49 2018] block drbd0: updated UUIDs ACAF943B769772E6:0000000000000000:127120C204C1B248:127020C204C1B249
[Wed Aug 29 01:39:49 2018] block drbd0: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate ) 
[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true after-resync-target minor-0
[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true after-resync-target minor-0 exit code 0 (0x0)

Second node:

[Wed Aug 29 01:42:48 2018] drbd resource0: PingAck did not arrive in time.
[Wed Aug 29 01:42:48 2018] drbd resource0: peer( Secondary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) 
[Wed Aug 29 01:42:48 2018] block drbd0: new current UUID ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507
[Wed Aug 29 01:42:48 2018] drbd resource0: ack_receiver terminated
[Wed Aug 29 01:42:48 2018] drbd resource0: Terminating drbd_a_resource
[Wed Aug 29 01:42:48 2018] drbd resource0: Connection closed
[Wed Aug 29 01:42:48 2018] drbd resource0: conn( NetworkFailure -> Unconnected ) 
[Wed Aug 29 01:42:48 2018] drbd resource0: receiver terminated
[Wed Aug 29 01:42:48 2018] drbd resource0: Restarting receiver thread
[Wed Aug 29 01:42:48 2018] drbd resource0: receiver (re)started
[Wed Aug 29 01:42:48 2018] drbd resource0: conn( Unconnected -> WFConnection ) 
[Wed Aug 29 01:42:50 2018] drbd resource0: Handshake successful: Agreed network protocol version 101
[Wed Aug 29 01:42:50 2018] drbd resource0: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.
[Wed Aug 29 01:42:50 2018] drbd resource0: Peer authenticated using 16 bytes HMAC
[Wed Aug 29 01:42:50 2018] drbd resource0: conn( WFConnection -> WFReportParams ) 
[Wed Aug 29 01:42:50 2018] drbd resource0: Starting ack_recv thread (from drbd_r_resource [27503])
[Wed Aug 29 01:42:50 2018] block drbd0: drbd_sync_handshake:
[Wed Aug 29 01:42:50 2018] block drbd0: self ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507 bits:7 flags:0
[Wed Aug 29 01:42:50 2018] block drbd0: peer 127020C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507 bits:0 flags:0
[Wed Aug 29 01:42:50 2018] block drbd0: uuid_compare()=1 by rule 70
[Wed Aug 29 01:42:50 2018] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent ) 
[Wed Aug 29 01:42:50 2018] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%
[Wed Aug 29 01:42:51 2018] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%
[Wed Aug 29 01:42:51 2018] block drbd0: helper command: /bin/true before-resync-source minor-0
[Wed Aug 29 01:42:51 2018] block drbd0: helper command: /bin/true before-resync-source minor-0 exit code 0 (0x0)
[Wed Aug 29 01:42:51 2018] block drbd0: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) 
[Wed Aug 29 01:42:51 2018] block drbd0: Began resync as SyncSource (will sync 28 KB [7 bits set]).
[Wed Aug 29 01:42:51 2018] block drbd0: updated sync UUID ACAF943B769772E7:127120C204C1B249:127020C204C1B249:8EF21B48CFD0C507
[Wed Aug 29 01:42:51 2018] block drbd0: Resync done (total 1 sec; paused 0 sec; 28 K/sec)
[Wed Aug 29 01:42:51 2018] block drbd0: updated UUIDs ACAF943B769772E7:0000000000000000:127120C204C1B249:127020C204C1B249
[Wed Aug 29 01:42:51 2018] block drbd0: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) 

Network card is intel I350, using igb module 5.4.0-k. I will check on this too.
Nicolas

-------- Message transféré -------
De: "Lars Ellenberg" <lars.ellenberg at linbit.com (mailto:lars.ellenberg at linbit.com?to=%22Lars%20Ellenberg%22%20<lars.ellenberg at linbit.com>)>
À: drbd-user at lists.linbit.com (mailto:drbd-user at lists.linbit.com)
Envoyé: 29 août 2018 12:09
Objet: Re: [DRBD-user] drbd issue? 

	On Tue, Aug 28, 2018 at 02:43:47PM +0000, Nicolas wrote:  Hi

I'm using some servers on debian with ganeti and drbd.

Since I've upgraded them to debian 9, and drbd 8.9.10-2 (from debian repo). 
"drbd 8.9.10" is the *utils* version
(drbdadm, drbdsetup, drbdmeta, various scripts ...)

drbd utils version is meanwhile at 9.5.0, btw. And no, that has not
much to do with what DRBD kernel module driver version you are using,
since we ship the "unified utils" for both "drbd 8" and "drbd 9",
which started years ago already, the utils version is decoupled from
the module versions.

What kernel version,
and what DRBD module version?

Maybe you want to make sure you use the latest 8.4 version (8.4.11
currently), and not whatever "shipts with the debian kernel"?
 I got a lot of issue with my drbd resources, I got randomly on my dmesg some resources disconnected:

today for example:

[Tue Aug 28 14:32:38 2018] drbd resource10: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown ) 
Well, what does the other node say?
Hit some timeouts?
Some strangeness with the new NIC drivers?
A bug in the "shipped with the debian kernel" DRBD version?
--
: Lars Ellenberg
: LINBIT | Keeping the Digital World Running
: DRBD -- Heartbeat -- Corosync -- Pacemaker

DRBD® and LINBIT® are registered trademarks of LINBIT
__
please don't Cc me, but send to list -- I'm subscribed
_______________________________________________
drbd-user mailing list
drbd-user at lists.linbit.com (mailto:drbd-user at lists.linbit.com)
http://lists.linbit.com/mailman/listinfo/drbd-user (http://lists.linbit.com/mailman/listinfo/drbd-user)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20180829/7d101ed6/attachment-0001.htm>


More information about the drbd-user mailing list