<!DOCTYPE html><html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8" /></head><body><div data-html-editor-font-wrapper="true" style="font-family: arial, sans-serif; font-size: 13px;"> <p>Hello<br><br>Sorry for the misunderstanding of utils version.<br><br>I'm using the kernel : 4.9.88-1+deb9u1 (4.9.0-6-amd64 debian).<br>And the module version v8.4.7.<br><br>filename: /lib/modules/4.9.0-6-amd64/kernel/drivers/block/drbd/drbd.ko<br>alias: block-major-147-*<br>license: GPL<br>version: 8.4.7<br>description: drbd - Distributed Replicated Block Device v8.4.7<br>author: Philipp Reisner <phil@linbit.com>, Lars Ellenberg <lars@linbit.com><br>srcversion: 0904DF2CCF7283ACE07D07A<br>depends: lru_cache,libcrc32c<br>retpoline: Y<br>intree: Y<br>vermagic: 4.9.0-6-amd64 SMP mod_unload modversions <br>parm: minor_count:Approximate number of drbd devices (1-255) (uint)<br>parm: disable_sendpage:bool<br>parm: allow_oos:DONT USE! (bool)<br>parm: proc_details:int<br>parm: usermode_helper:string<br><br><br>For example when a node says:<br><br>[Tue Aug 28 14:32:38 2018] drbd resource10: peer( Primary -> Unknown ) conn( Connected -> <strong>Disconnecting</strong> ) pdsk( UpToDate -> DUnknown ) <br>[Tue Aug 28 14:32:38 2018] drbd resource10: ack_receiver terminated<br>[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_a_resource<br>[Tue Aug 28 14:32:38 2018] drbd resource10: Connection closed<br>[Tue Aug 28 14:32:38 2018] drbd resource10: conn( Disconnecting -> StandAlone ) <br>[Tue Aug 28 14:32:38 2018] drbd resource10: receiver terminated<br>[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_r_resource<br>[Tue Aug 28 14:32:38 2018] block drbd10: disk( UpToDate -> Failed ) <br>[Tue Aug 28 14:32:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.<br>[Tue Aug 28 14:32:38 2018] block drbd10: disk( Failed -> Diskless ) <br>[Tue Aug 28 14:32:38 2018] drbd resource10: Terminating drbd_w_resource<br>[Tue Aug 28 14:32:40 2018] drbd resource10: Starting worker thread (from drbdsetup-84 [10222])<br>[Tue Aug 28 14:32:40 2018] block drbd10: disk( Diskless -> Attaching ) <br>[Tue Aug 28 14:32:40 2018] drbd resource10: Method to ensure write ordering: flush<br>[Tue Aug 28 14:32:40 2018] block drbd10: max BIO size = 262144<br>[Tue Aug 28 14:32:40 2018] block drbd10: Adjusting my ra_pages to backing device's (32 -> 256)<br>[Tue Aug 28 14:32:40 2018] block drbd10: drbd_bm_resize called with capacity == 314572800<br>[Tue Aug 28 14:32:40 2018] block drbd10: resync bitmap: bits=39321600 words=614400 pages=1200<br>[Tue Aug 28 14:32:40 2018] block drbd10: size = 150 GB (157286400 KB)<br>[Tue Aug 28 14:32:40 2018] block drbd10: recounting of set bits took additional 0 jiffies<br>[Tue Aug 28 14:32:40 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.<br>[Tue Aug 28 14:32:40 2018] block drbd10: disk( Attaching -> UpToDate ) <br>[Tue Aug 28 14:32:40 2018] block drbd10: attached to UUIDs 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B<br>[Tue Aug 28 14:32:40 2018] drbd resource10: conn( StandAlone -> Unconnected ) <br>[Tue Aug 28 14:32:40 2018] drbd resource10: Starting receiver thread (from drbd_w_resource [10225])<br>[Tue Aug 28 14:32:40 2018] drbd resource10: receiver (re)started<br>[Tue Aug 28 14:32:40 2018] drbd resource10: conn( Unconnected -> WFConnection ) <br>[Tue Aug 28 14:32:41 2018] drbd resource10: Handshake successful: Agreed network protocol version 101<br>[Tue Aug 28 14:32:41 2018] drbd resource10: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.<br>[Tue Aug 28 14:32:41 2018] drbd resource10: Peer authenticated using 16 bytes HMAC<br>[Tue Aug 28 14:32:41 2018] drbd resource10: conn( WFConnection -> WFReportParams ) <br>[Tue Aug 28 14:32:41 2018] drbd resource10: Starting ack_recv thread (from drbd_r_resource [10246])<br>[Tue Aug 28 14:32:41 2018] block drbd10: drbd_sync_handshake:<br>[Tue Aug 28 14:32:41 2018] block drbd10: self 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B bits:0 flags:0<br>[Tue Aug 28 14:32:41 2018] block drbd10: peer 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B bits:0 flags:0<br>[Tue Aug 28 14:32:41 2018] block drbd10: uuid_compare()=-1 by rule 50<br>[Tue Aug 28 14:32:41 2018] block drbd10: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate ) <br>[Tue Aug 28 14:32:41 2018] block drbd10: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%<br>[Tue Aug 28 14:32:41 2018] block drbd10: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%<br>[Tue Aug 28 14:32:41 2018] block drbd10: conn( WFBitMapT -> WFSyncUUID ) <br>[Tue Aug 28 14:32:41 2018] block drbd10: updated sync uuid 0749EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B<br>[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true before-resync-target minor-10<br>[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true before-resync-target minor-10 exit code 0 (0x0)<br>[Tue Aug 28 14:32:41 2018] block drbd10: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) <br>[Tue Aug 28 14:32:41 2018] block drbd10: Began resync as SyncTarget (will sync 0 KB [0 bits set]).<br>[Tue Aug 28 14:32:41 2018] block drbd10: Resync done (total 1 sec; paused 0 sec; 0 K/sec)<br>[Tue Aug 28 14:32:41 2018] block drbd10: updated UUIDs 629F1036CD6CA2AE:0000000000000000:0749EE11C429D3B4:0748EE11C429D3B5<br>[Tue Aug 28 14:32:41 2018] block drbd10: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate ) <br>[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true after-resync-target minor-10<br>[Tue Aug 28 14:32:41 2018] block drbd10: helper command: /bin/true after-resync-target minor-10 exit code 0 (0x0)<br><br>The second says:<br><br><strong>[Tue Aug 28 14:35:33 2018] br0: port 8(tap6) entered disabled state<br>[Tue Aug 28 14:35:33 2018] device tap6 left promiscuous mode<br>[Tue Aug 28 14:35:33 2018] br0: port 8(tap6) entered disabled state</strong><br>[Tue Aug 28 14:35:37 2018] drbd resource10: peer( Secondary -> Unknown ) conn( Connected -> <strong>TearDown</strong> ) pdsk( UpToDate -> DUnknown ) <br>[Tue Aug 28 14:35:37 2018] drbd resource10: ack_receiver terminated<br>[Tue Aug 28 14:35:37 2018] drbd resource10: Terminating drbd_a_resource<br>[Tue Aug 28 14:35:37 2018] block drbd10: new current UUID 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B<br>[Tue Aug 28 14:35:37 2018] drbd resource10: Connection closed<br>[Tue Aug 28 14:35:37 2018] drbd resource10: conn( TearDown -> Unconnected ) <br>[Tue Aug 28 14:35:37 2018] drbd resource10: receiver terminated<br>[Tue Aug 28 14:35:37 2018] drbd resource10: Restarting receiver thread<br>[Tue Aug 28 14:35:37 2018] drbd resource10: receiver (re)started<br>[Tue Aug 28 14:35:37 2018] drbd resource10: conn( Unconnected -> WFConnection ) <br>[Tue Aug 28 14:35:38 2018] block drbd10: role( Primary -> Secondary ) <br>[Tue Aug 28 14:35:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.<br>[Tue Aug 28 14:35:38 2018] drbd resource10: conn( WFConnection -> Disconnecting ) <br>[Tue Aug 28 14:35:38 2018] drbd resource10: Discarding network configuration.<br>[Tue Aug 28 14:35:38 2018] drbd resource10: Connection closed<br>[Tue Aug 28 14:35:38 2018] drbd resource10: conn( Disconnecting -> StandAlone ) <br>[Tue Aug 28 14:35:38 2018] drbd resource10: receiver terminated<br>[Tue Aug 28 14:35:38 2018] drbd resource10: Terminating drbd_r_resource<br>[Tue Aug 28 14:35:38 2018] block drbd10: disk( UpToDate -> Failed ) <br>[Tue Aug 28 14:35:38 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.<br>[Tue Aug 28 14:35:38 2018] block drbd10: disk( Failed -> Diskless ) <br>[Tue Aug 28 14:35:38 2018] drbd resource10: Terminating drbd_w_resource<br>[Tue Aug 28 14:35:40 2018] drbd resource10: Starting worker thread (from drbdsetup-84 [3025])<br>[Tue Aug 28 14:35:40 2018] block drbd10: disk( Diskless -> Attaching ) <br>[Tue Aug 28 14:35:40 2018] drbd resource10: Method to ensure write ordering: flush<br>[Tue Aug 28 14:35:40 2018] block drbd10: max BIO size = 262144<br>[Tue Aug 28 14:35:40 2018] block drbd10: Adjusting my ra_pages to backing device's (32 -> 256)<br>[Tue Aug 28 14:35:40 2018] block drbd10: drbd_bm_resize called with capacity == 314572800<br>[Tue Aug 28 14:35:40 2018] block drbd10: resync bitmap: bits=39321600 words=614400 pages=1200<br>[Tue Aug 28 14:35:40 2018] block drbd10: size = 150 GB (157286400 KB)<br>[Tue Aug 28 14:35:41 2018] block drbd10: recounting of set bits took additional 0 jiffies<br>[Tue Aug 28 14:35:41 2018] block drbd10: 0 KB (0 bits) marked out-of-sync by on disk bit-map.<br>[Tue Aug 28 14:35:41 2018] block drbd10: disk( Attaching -> UpToDate ) <br>[Tue Aug 28 14:35:41 2018] block drbd10: attached to UUIDs 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B<br>[Tue Aug 28 14:35:41 2018] drbd resource10: conn( StandAlone -> Unconnected ) <br>[Tue Aug 28 14:35:41 2018] drbd resource10: Starting receiver thread (from drbd_w_resource [3030])<br>[Tue Aug 28 14:35:41 2018] drbd resource10: receiver (re)started<br>[Tue Aug 28 14:35:41 2018] drbd resource10: conn( Unconnected -> WFConnection ) <br>[Tue Aug 28 14:35:41 2018] block drbd10: role( Secondary -> Primary ) <br>[Tue Aug 28 14:35:41 2018] drbd resource10: Handshake successful: Agreed network protocol version 101<br>[Tue Aug 28 14:35:41 2018] drbd resource10: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.<br>[Tue Aug 28 14:35:41 2018] drbd resource10: Peer authenticated using 16 bytes HMAC<br>[Tue Aug 28 14:35:41 2018] drbd resource10: conn( WFConnection -> WFReportParams ) <br>[Tue Aug 28 14:35:41 2018] drbd resource10: Starting ack_recv thread (from drbd_r_resource [3045])<br>[Tue Aug 28 14:35:41 2018] block drbd10: drbd_sync_handshake:<br>[Tue Aug 28 14:35:41 2018] block drbd10: self 629F1036CD6CA2AF:0748EE11C429D3B5:FDAEFCD2E8D9890B:FDADFCD2E8D9890B bits:0 flags:0<br>[Tue Aug 28 14:35:41 2018] block drbd10: peer 0748EE11C429D3B4:0000000000000000:FDAEFCD2E8D9890A:FDADFCD2E8D9890B bits:0 flags:0<br>[Tue Aug 28 14:35:41 2018] block drbd10: uuid_compare()=1 by rule 70<br>[Tue Aug 28 14:35:41 2018] block drbd10: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent ) <br>[Tue Aug 28 14:35:41 2018] block drbd10: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%<br>[Tue Aug 28 14:35:41 2018] block drbd10: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23; compression: 100.0%<br>[Tue Aug 28 14:35:41 2018] block drbd10: helper command: /bin/true before-resync-source minor-10<br>[Tue Aug 28 14:35:41 2018] block drbd10: helper command: /bin/true before-resync-source minor-10 exit code 0 (0x0)<br>[Tue Aug 28 14:35:41 2018] block drbd10: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) <br>[Tue Aug 28 14:35:41 2018] block drbd10: Began resync as SyncSource (will sync 0 KB [0 bits set]).<br>[Tue Aug 28 14:35:41 2018] block drbd10: updated sync UUID 629F1036CD6CA2AF:0749EE11C429D3B5:0748EE11C429D3B5:FDAEFCD2E8D9890B<br>[Tue Aug 28 14:35:41 2018] block drbd10: Resync done (total 1 sec; paused 0 sec; 0 K/sec)<br>[Tue Aug 28 14:35:41 2018] block drbd10: updated UUIDs 629F1036CD6CA2AF:0000000000000000:0749EE11C429D3B5:0748EE11C429D3B5<br>[Tue Aug 28 14:35:41 2018] block drbd10: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) <br>[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered blocking state<br>[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered disabled state<br>[Tue Aug 28 14:35:41 2018] device tap6 entered promiscuous mode<br>[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered blocking state<br>[Tue Aug 28 14:35:41 2018] br0: port 8(tap6) entered forwarding state<br><br>And it seems for this example the second node was the origin of this. <br><br><br>This night I got another error, saying network failure, but I'm sure there was no network issue:<br><br>First node: <br><br><strong>[Wed Aug 29 01:39:48 2018] drbd resource0: meta connection shut down by peer.</strong><br>[Wed Aug 29 01:39:48 2018] drbd resource0: peer( Primary -> Unknown ) conn( Connected -> <strong>NetworkFailure</strong> ) pdsk( UpToDate -> DUnknown ) <br>[Wed Aug 29 01:39:48 2018] drbd resource0: ack_receiver terminated<br>[Wed Aug 29 01:39:48 2018] drbd resource0: Terminating drbd_a_resource<br>[Wed Aug 29 01:39:48 2018] drbd resource0: Connection closed<br>[Wed Aug 29 01:39:48 2018] drbd resource0: conn( NetworkFailure -> Unconnected ) <br>[Wed Aug 29 01:39:48 2018] drbd resource0: receiver terminated<br>[Wed Aug 29 01:39:48 2018] drbd resource0: Restarting receiver thread<br>[Wed Aug 29 01:39:48 2018] drbd resource0: receiver (re)started<br>[Wed Aug 29 01:39:48 2018] drbd resource0: conn( Unconnected -> WFConnection ) <br>[Wed Aug 29 01:39:49 2018] drbd resource0: Handshake successful: Agreed network protocol version 101<br>[Wed Aug 29 01:39:49 2018] drbd resource0: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.<br>[Wed Aug 29 01:39:49 2018] drbd resource0: Peer authenticated using 16 bytes HMAC<br>[Wed Aug 29 01:39:49 2018] drbd resource0: conn( WFConnection -> WFReportParams ) <br>[Wed Aug 29 01:39:49 2018] drbd resource0: Starting ack_recv thread (from drbd_r_resource [6370])<br>[Wed Aug 29 01:39:49 2018] block drbd0: drbd_sync_handshake:<br>[Wed Aug 29 01:39:49 2018] block drbd0: self 127020C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507 bits:0 flags:0<br>[Wed Aug 29 01:39:49 2018] block drbd0: peer ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507 bits:7 flags:0<br>[Wed Aug 29 01:39:49 2018] block drbd0: uuid_compare()=-1 by rule 50<br>[Wed Aug 29 01:39:49 2018] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate ) <br>[Wed Aug 29 01:39:49 2018] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%<br>[Wed Aug 29 01:39:49 2018] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%<br>[Wed Aug 29 01:39:49 2018] block drbd0: conn( WFBitMapT -> WFSyncUUID ) <br>[Wed Aug 29 01:39:49 2018] block drbd0: updated sync uuid 127120C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507<br>[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true before-resync-target minor-0<br>[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true before-resync-target minor-0 exit code 0 (0x0)<br>[Wed Aug 29 01:39:49 2018] block drbd0: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent ) <br>[Wed Aug 29 01:39:49 2018] block drbd0: Began resync as SyncTarget (will sync 28 KB [7 bits set]).<br>[Wed Aug 29 01:39:49 2018] block drbd0: Resync done (total 1 sec; paused 0 sec; 28 K/sec)<br>[Wed Aug 29 01:39:49 2018] block drbd0: updated UUIDs ACAF943B769772E6:0000000000000000:127120C204C1B248:127020C204C1B249<br>[Wed Aug 29 01:39:49 2018] block drbd0: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate ) <br>[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true after-resync-target minor-0<br>[Wed Aug 29 01:39:49 2018] block drbd0: helper command: /bin/true after-resync-target minor-0 exit code 0 (0x0)<br><br>Second node:<br><br><strong>[Wed Aug 29 01:42:48 2018] drbd resource0: PingAck did not arrive in time.</strong><br>[Wed Aug 29 01:42:48 2018] drbd resource0: peer( Secondary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) <br>[Wed Aug 29 01:42:48 2018] block drbd0: new current UUID ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507<br>[Wed Aug 29 01:42:48 2018] drbd resource0: ack_receiver terminated<br>[Wed Aug 29 01:42:48 2018] drbd resource0: Terminating drbd_a_resource<br>[Wed Aug 29 01:42:48 2018] drbd resource0: Connection closed<br>[Wed Aug 29 01:42:48 2018] drbd resource0: conn( NetworkFailure -> Unconnected ) <br>[Wed Aug 29 01:42:48 2018] drbd resource0: receiver terminated<br>[Wed Aug 29 01:42:48 2018] drbd resource0: Restarting receiver thread<br>[Wed Aug 29 01:42:48 2018] drbd resource0: receiver (re)started<br>[Wed Aug 29 01:42:48 2018] drbd resource0: conn( Unconnected -> WFConnection ) <br>[Wed Aug 29 01:42:50 2018] drbd resource0: Handshake successful: Agreed network protocol version 101<br>[Wed Aug 29 01:42:50 2018] drbd resource0: Feature flags enabled on protocol level: 0x7 TRIM THIN_RESYNC WRITE_SAME.<br>[Wed Aug 29 01:42:50 2018] drbd resource0: Peer authenticated using 16 bytes HMAC<br>[Wed Aug 29 01:42:50 2018] drbd resource0: conn( WFConnection -> WFReportParams ) <br>[Wed Aug 29 01:42:50 2018] drbd resource0: Starting ack_recv thread (from drbd_r_resource [27503])<br>[Wed Aug 29 01:42:50 2018] block drbd0: drbd_sync_handshake:<br>[Wed Aug 29 01:42:50 2018] block drbd0: self ACAF943B769772E7:127020C204C1B249:8EF21B48CFD0C507:8EF11B48CFD0C507 bits:7 flags:0<br>[Wed Aug 29 01:42:50 2018] block drbd0: peer 127020C204C1B248:0000000000000000:8EF21B48CFD0C506:8EF11B48CFD0C507 bits:0 flags:0<br>[Wed Aug 29 01:42:50 2018] block drbd0: uuid_compare()=1 by rule 70<br>[Wed Aug 29 01:42:50 2018] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> Consistent ) <br>[Wed Aug 29 01:42:50 2018] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%<br>[Wed Aug 29 01:42:51 2018] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 33(1), total 33; compression: 100.0%<br>[Wed Aug 29 01:42:51 2018] block drbd0: helper command: /bin/true before-resync-source minor-0<br>[Wed Aug 29 01:42:51 2018] block drbd0: helper command: /bin/true before-resync-source minor-0 exit code 0 (0x0)<br>[Wed Aug 29 01:42:51 2018] block drbd0: conn( WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent ) <br>[Wed Aug 29 01:42:51 2018] block drbd0: Began resync as SyncSource (will sync 28 KB [7 bits set]).<br>[Wed Aug 29 01:42:51 2018] block drbd0: updated sync UUID ACAF943B769772E7:127120C204C1B249:127020C204C1B249:8EF21B48CFD0C507<br>[Wed Aug 29 01:42:51 2018] block drbd0: Resync done (total 1 sec; paused 0 sec; 28 K/sec)<br>[Wed Aug 29 01:42:51 2018] block drbd0: updated UUIDs ACAF943B769772E7:0000000000000000:127120C204C1B249:127020C204C1B249<br>[Wed Aug 29 01:42:51 2018] block drbd0: conn( SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate ) <br><br>Network card is intel I350, using igb module 5.4.0-k. I will check on this too.<br><br><br><signature>Nicolas</signature><br><br>-------- Message transféré -------<br>De: "Lars Ellenberg" <<a target="_blank" tabindex="-1" href="mailto:lars.ellenberg@linbit.com?to=%22Lars%20Ellenberg%22%20<lars.ellenberg@linbit.com>">lars.ellenberg@linbit.com</a>><br>À: <a target="_blank" tabindex="-1" href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br>Envoyé: 29 août 2018 12:09<br>Objet: Re: [DRBD-user] drbd issue?</p> <pre>On Tue, Aug 28, 2018 at 02:43:47PM +0000, Nicolas wrote: </pre> <blockquote>Hi<br><br>I'm using some servers on debian with ganeti and drbd.<br><br>Since I've upgraded them to debian 9, and drbd 8.9.10-2 (from debian repo).</blockquote> <br>"drbd 8.9.10" is the *utils* version<br>(drbdadm, drbdsetup, drbdmeta, various scripts ...)<br><br>drbd utils version is meanwhile at 9.5.0, btw. And no, that has not<br>much to do with what DRBD kernel module driver version you are using,<br>since we ship the "unified utils" for both "drbd 8" and "drbd 9",<br>which started years ago already, the utils version is decoupled from<br>the module versions.<br><br>What kernel version,<br>and what DRBD module version?<br><br>Maybe you want to make sure you use the latest 8.4 version (8.4.11<br>currently), and not whatever "shipts with the debian kernel"?<br><br><br> <blockquote>I got a lot of issue with my drbd resources, I got randomly on my dmesg some resources disconnected:<br><br>today for example:<br><br>[Tue Aug 28 14:32:38 2018] drbd resource10: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )</blockquote> <br>Well, what does the other node say?<br>Hit some timeouts?<br>Some strangeness with the new NIC drivers?<br>A bug in the "shipped with the debian kernel" DRBD version?<br><br><br>--<br>: Lars Ellenberg<br>: LINBIT | Keeping the Digital World Running<br>: DRBD -- Heartbeat -- Corosync -- Pacemaker<br><br>DRBD® and LINBIT® are registered trademarks of LINBIT<br>__<br>please don't Cc me, but send to list -- I'm subscribed<br>_______________________________________________<br>drbd-user mailing list<br><a target="_blank" rel="noopener noreferrer" href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br><a target="_blank" rel="noopener noreferrer" href="http://lists.linbit.com/mailman/listinfo/drbd-user">http://lists.linbit.com/mailman/listinfo/drbd-user</a><br> </div></body></html>