[DRBD-user] DRBD sync immediately fails on creation
James Carlisi - NOAA Affiliate
james.carlisi at noaa.gov
Fri Mar 15 20:37:32 CET 2019
Hey all,
I'm relatively new to DRBD and I am having a really hard time figuring out
what I have done wrong or am missing. here is everything I can think of to
share. any suggestions welcome.
[node1 drbd.d]# uname -a
Linux jetms1 3.10.0-957.5.1.el7.x86_64 #1 SMP Fri Feb 1 14:54:57 UTC 2019
x86_64 x86_64 x86_64 GNU/Linux
[node1 drbd.d]# rpm -qa |grep drbd
drbd84-utils-9.6.0-1.el7.elrepo.x86_64
kmod-drbd84-8.4.11-1.1.el7_6.elrepo.x86_64
drbdlinks-1.28-3.el7.noarch
[node1 drbd.d]# fdisk -l
WARNING: fdisk GPT support is currently new, and therefore in an
experimental phase. Use at your own discretion.
Disk /dev/sda: 1599.7 GB, 1599741100032 bytes, 3124494336 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk label type: gpt
Disk identifier: BCF52711-BBB4-4B14-8791-EB1020E33701
# Start End Size Type Name
1 2048 4196351 2G EFI System EFI System Partition
2 4196352 8390655 2G Microsoft basic
3 8390656 532678655 250G Microsoft basic
4 532678656 637536255 50G Microsoft basic
5 637536256 671090687 16G Linux swap
6 671090688 671156223 32M Linux filesyste
7 671156224 671188991 16M Linux filesyste
[node2 drbd.d]# uname -a
Linux jetms2 3.10.0-957.5.1.el7.x86_64 #1 SMP Fri Feb 1 14:54:57 UTC 2019
x86_64 x86_64 x86_64 GNU/Linux
[node2 drbd.d]# rpm -qa |grep drbd
drbd84-utils-9.6.0-1.el7.elrepo.x86_64
kmod-drbd84-8.4.11-1.1.el7_6.elrepo.x86_64
drbdlinks-1.28-3.el7.noarch
[node2 drbd.d]# fdisk -l
WARNING: fdisk GPT support is currently new, and therefore in an
experimental phase. Use at your own discretion.
Disk /dev/sda: 1599.7 GB, 1599741100032 bytes, 3124494336 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk label type: gpt
Disk identifier: 7605E5A7-CAAF-46EA-BCA4-D730A6233ADF
# Start End Size Type Name
1 2048 4196351 2G EFI System EFI System Partition
2 4196352 8390655 2G Microsoft basic
3 8390656 532678655 250G Microsoft basic
4 532678656 566233087 16G Linux swap
5 566233088 671090687 50G Microsoft basic
6 671090688 671156223 32M Linux filesyste
7 671156224 671188991 16M Linux filesyste
Disk /dev/drbd0: 33 MB, 33554432 bytes, 65536 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
the config file:
[node1 drbd.d]# cat /etc/drbd.d/drbd0.res
resource drbd0 {
protocol C;
meta-disk internal;
device /dev/drbd0;
net {
verify-alg sha1;
after-sb-0pri discard-least-changes;
after-sb-1pri consensus;
after-sb-2pri call-pri-lost-after-sb;
sndbuf-size 10M;
rcvbuf-size 10M;
ping-int 30;
ping-timeout 40;
connect-int 10;
timeout 120;
ko-count 5;
max-buffers 128k;
max-epoch-size 8192;
}
syncer {
c-plan-ahead 20;
c-min-rate 1M;
c-max-rate 300M;
c-fill-target 2M;
al-extents 3389;
}
on node1 {
disk /dev/sda6;
meta-disk /dev/sda7;
address X.X.X.95:7788;
}
on node2 {
disk /dev/sda6;
meta-disk /dev/sda7;
address X.X.X.96:7788;
}
disk {
resync-rate 220M;
}
}
[node2 drbd.d]# cat /proc/drbd
version: 8.4.11-1 (api:1/proto:86-101)
GIT-hash: 66145a308421e9c124ec391a7848ac20203bb03c build by mockbuild@,
2018-11-03 01:26:55
0: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent C r---n-
ns:8980 nr:0 dw:0 dr:613264 al:0 bm:0 lo:0 pe:36 ua:105 ap:0 ep:1 wo:f
oos:32512
[=>..................] sync'ed: 12.5% (32512/32512)K
finish: 0:04:03 speed: 0 (0) K/sec
So I originally had 1T drive space and 32M, i was originally trying to sync
from node1 to node2, I've made a number of changes to the drbd0.res files.
I've rebooted both nodes, I can ssh between both nodes. No matter what i
changed, the speed never goes above 0. SELinux is turned off.
Here is a snippet from /car/log/messages
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: receiver terminated
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: Restarting receiver thread
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: receiver (re)started
Mar 15 09:33:40 jetms2 kernel: drbd drbd0: conn( Unconnected ->
WFConnection )
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Handshake successful: Agreed
network protocol version 101
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Feature flags enabled on
protocol level: 0xf TRIM THIN_RESYNC WRITE_SAME WRITE_ZEROES.
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: conn( WFConnection ->
WFReportParams )
Mar 15 09:33:44 jetms2 kernel: drbd drbd0: Starting ack_recv thread (from
drbd_r_drbd0 [130443])
Mar 15 09:33:44 jetms2 kernel: block drbd0: drbd_sync_handshake:
Mar 15 09:33:44 jetms2 kernel: block drbd0: self
D5273482D7DD2749:D94619D31B2CC10E:D94519D31B2CC10E:D94419D31B2CC10E
bits:8128 flags:0
Mar 15 09:33:44 jetms2 kernel: block drbd0: peer
D94619D31B2CC10E:0000000000000000:0000000000000000:0000000000000000
bits:8128 flags:0
Mar 15 09:33:44 jetms2 kernel: block drbd0: uuid_compare()=1 by rule 70
Mar 15 09:33:44 jetms2 kernel: block drbd0: Becoming sync source due to
disk states.
Mar 15 09:33:44 jetms2 kernel: block drbd0: peer( Unknown -> Secondary )
conn( WFReportParams -> WFBitMapS )
Mar 15 09:33:44 jetms2 kernel: block drbd0: send bitmap stats
[Bytes(packets)]: plain 0(0), RLE 22(1), total 22; compression: 98.0%
Mar 15 09:33:44 jetms2 kernel: block drbd0: receive bitmap stats
[Bytes(packets)]: plain 0(0), RLE 22(1), total 22; compression: 98.0%
Mar 15 09:33:44 jetms2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0
Mar 15 09:33:44 jetms2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-source minor-0 exit code 0 (0x0)
Mar 15 09:33:44 jetms2 kernel: block drbd0: conn( WFBitMapS -> SyncSource )
Mar 15 09:33:44 jetms2 kernel: block drbd0: Began resync as SyncSource
(will sync 32512 KB [8128 bits set]).
Mar 15 09:33:44 jetms2 kernel: block drbd0: updated sync UUID
D5273482D7DD2749:D94719D31B2CC10E:D94619D31B2CC10E:D94519D31B2CC10E
Mar 15 09:34:08 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 4
Mar 15 09:34:20 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 3
Mar 15 09:34:32 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 2
Mar 15 09:34:44 jetms2 kernel: drbd drbd0: [drbd_w_drbd0/108961]
sock_sendmsg time expired, ko = 1
Mar 15 09:34:56 jetms2 kernel: block drbd0: drbd_send_block() failed
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: peer( Secondary -> Unknown )
conn( SyncSource -> NetworkFailure )
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: ack_receiver terminated
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Terminating drbd_a_drbd0
Mar 15 09:34:56 jetms2 kernel: block drbd0: net_ee not empty, killed 36
entries
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Connection closed
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: conn( NetworkFailure ->
Unconnected )
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: receiver terminated
Mar 15 09:34:56 jetms2 kernel: drbd drbd0: Restarting receiver thread
--
JIM Carlisi
james.carlisi at noaa.gov | m: 919-332-1278
"We don't stop playing because we grow old;
we grow old because we stop playing."
- George Bernard Shaw
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20190315/af9d54df/attachment.htm>
More information about the drbd-user
mailing list