[DRBD-user] DRBD sync immediately fails on creation

James Carlisi - NOAA Affiliate james.carlisi at noaa.gov
Fri Mar 15 20:37:32 CET 2019


Hey all,
I'm relatively new to DRBD and I am having a really hard time figuring out
what I have done wrong or am missing.  here is everything I can think of to
share. any suggestions welcome.

[node1 drbd.d]# uname -a
Linux jetms1 3.10.0-957.5.1.el7.x86_64 #1 SMP Fri Feb 1 14:54:57 UTC 2019
x86_64 x86_64 x86_64 GNU/Linux
[node1 drbd.d]# rpm -qa |grep drbd
drbd84-utils-9.6.0-1.el7.elrepo.x86_64
kmod-drbd84-8.4.11-1.1.el7_6.elrepo.x86_64
drbdlinks-1.28-3.el7.noarch
[node1 drbd.d]# fdisk -l
WARNING: fdisk GPT support is currently new, and therefore in an
experimental phase. Use at your own discretion.

Disk /dev/sda: 1599.7 GB, 1599741100032 bytes, 3124494336 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk label type: gpt
Disk identifier: BCF52711-BBB4-4B14-8791-EB1020E33701


#         Start          End    Size  Type            Name
 1         2048      4196351      2G  EFI System      EFI System Partition
 2      4196352      8390655      2G  Microsoft basic
 3      8390656    532678655    250G  Microsoft basic
 4    532678656    637536255     50G  Microsoft basic
 5    637536256    671090687     16G  Linux swap
 6    671090688    671156223     32M  Linux filesyste
 7    671156224    671188991     16M  Linux filesyste


[node2 drbd.d]# uname -a
Linux jetms2 3.10.0-957.5.1.el7.x86_64 #1 SMP Fri Feb 1 14:54:57 UTC 2019
x86_64 x86_64 x86_64 GNU/Linux
[node2 drbd.d]# rpm -qa |grep drbd
drbd84-utils-9.6.0-1.el7.elrepo.x86_64
kmod-drbd84-8.4.11-1.1.el7_6.elrepo.x86_64
drbdlinks-1.28-3.el7.noarch
[node2 drbd.d]# fdisk -l
WARNING: fdisk GPT support is currently new, and therefore in an
experimental phase. Use at your own discretion.

Disk /dev/sda: 1599.7 GB, 1599741100032 bytes, 3124494336 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk label type: gpt
Disk identifier: 7605E5A7-CAAF-46EA-BCA4-D730A6233ADF


#         Start          End    Size  Type            Name
 1         2048      4196351      2G  EFI System      EFI System Partition
 2      4196352      8390655      2G  Microsoft basic
 3      8390656    532678655    250G  Microsoft basic
 4    532678656    566233087     16G  Linux swap
 5    566233088    671090687     50G  Microsoft basic
 6    671090688    671156223     32M  Linux filesyste
 7    671156224    671188991     16M  Linux filesyste

Disk /dev/drbd0: 33 MB, 33554432 bytes, 65536 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes

the config file:
[node1 drbd.d]# cat /etc/drbd.d/drbd0.res
resource drbd0 {
  protocol C;
  meta-disk internal;
  device  /dev/drbd0;
  net {
    verify-alg sha1;
    after-sb-0pri discard-least-changes;
    after-sb-1pri consensus;
    after-sb-2pri call-pri-lost-after-sb;
    sndbuf-size 10M;
    rcvbuf-size 10M;
    ping-int 30;
    ping-timeout 40;
    connect-int 10;
    timeout 120;
    ko-count 5;
    max-buffers 128k;
    max-epoch-size 8192;
  }
  syncer {
    c-plan-ahead 20;
    c-min-rate 1M;
    c-max-rate 300M;
    c-fill-target 2M;
    al-extents 3389;
  }
  on node1 {
    disk /dev/sda6;
    meta-disk /dev/sda7;
    address X.X.X.95:7788;
  }
  on node2 {
    disk /dev/sda6;
    meta-disk /dev/sda7;
    address X.X.X.96:7788;
  }
  disk {
    resync-rate 220M;
  }
}

[node2 drbd.d]# cat /proc/drbd
version: 8.4.11-1 (api:1/proto:86-101)
GIT-hash: 66145a308421e9c124ec391a7848ac20203bb03c build by mockbuild@,
2018-11-03 01:26:55
 0: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent C r---n-
    ns:8980 nr:0 dw:0 dr:613264 al:0 bm:0 lo:0 pe:36 ua:105 ap:0 ep:1 wo:f
oos:32512
        [=>..................] sync'ed: 12.5% (32512/32512)K
        finish: 0:04:03 speed: 0 (0) K/sec

So I originally had 1T drive space and 32M, i was originally trying to sync
from node1 to node2, I've made a number of changes to the drbd0.res files.
I've rebooted both nodes, I can ssh between both nodes.  No matter what i
changed, the speed never goes above 0. SELinux is turned off.

Here is a snippet from /car/log/messages

Mar 15 09:33:40 jetms2 kernel: drbd drbd0: receiver terminated
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: Restarting receiver thread
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: receiver (re)started
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: conn( Unconnected ->
WFConnection )
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Handshake successful: Agreed
network protocol version 101
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Feature flags enabled on
protocol level: 0xf TRIM THIN_RESYNC WRITE_SAME WRITE_ZEROES.
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: conn( WFConnection ->
WFReportParams )
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Starting ack_recv thread (from
drbd_r_drbd0 [130443])
Mar 15 09:33:44 jetms2 kernel: block drbd0: drbd_sync_handshake:
Mar 15 09:33:44 jetms2 kernel: block drbd0: self
D5273482D7DD2749:D94619D31B2CC10E:D94519D31B2CC10E:D94419D31B2CC10E
bits:8128 flags:0
Mar 15 09:33:44 jetms2 kernel: block drbd0: peer
D94619D31B2CC10E:0000000000000000:0000000000000000:0000000000000000
bits:8128 flags:0
Mar 15 09:33:44 jetms2 kernel: block drbd0: uuid_compare()=1 by rule 70
Mar 15 09:33:44 jetms2 kernel: block drbd0: Becoming sync source due to
disk states.
Mar 15 09:33:44 jetms2 kernel: block drbd0: peer( Unknown -> Secondary )
conn( WFReportParams -> WFBitMapS )
Mar 15 09:33:44 jetms2 kernel: block drbd0: send bitmap stats
[Bytes(packets)]: plain 0(0), RLE 22(1), total 22; compression: 98.0%
Mar 15 09:33:44 jetms2 kernel: block drbd0: receive bitmap stats
[Bytes(packets)]: plain 0(0), RLE 22(1), total 22; compression: 98.0%
Mar 15 09:33:44 jetms2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0
Mar 15 09:33:44 jetms2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0 exit code 0 (0x0)
Mar 15 09:33:44 jetms2 kernel: block drbd0: conn( WFBitMapS -> SyncSource )
Mar 15 09:33:44 jetms2 kernel: block drbd0: Began resync as SyncSource
(will sync 32512 KB [8128 bits set]).
Mar 15 09:33:44 jetms2 kernel: block drbd0: updated sync UUID
D5273482D7DD2749:D94719D31B2CC10E:D94619D31B2CC10E:D94519D31B2CC10E
Mar 15 09:34:08 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 4
Mar 15 09:34:20 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 3
Mar 15 09:34:32 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 2
Mar 15 09:34:44 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 1
Mar 15 09:34:56 jetms2 kernel: block drbd0: drbd_send_block() failed
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: peer( Secondary -> Unknown )
conn( SyncSource -> NetworkFailure )
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: ack_receiver terminated
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Terminating drbd_a_drbd0
Mar 15 09:34:56 jetms2 kernel: block drbd0: net_ee not empty, killed 36
entries
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Connection closed
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: conn( NetworkFailure ->
Unconnected )
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: receiver terminated
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Restarting receiver thread


-- 

JIM Carlisi
james.carlisi at noaa.gov | m: 919-332-1278

"We don't stop playing because we grow old;
we grow old because we stop playing."
- George Bernard Shaw
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20190315/af9d54df/attachment.htm>


More information about the drbd-user mailing list