<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
Hello,<br>
<br>
Thank you for your suggestion. The MTU is 1500 on both nodes. I had
it at 9000, but reverted everything to 'normal' to debug this
problem. Pinging as in your example works fine.<br>
<br>
Cheers,<br>
<br>
Dirk<br>
<br>
<div class="moz-cite-prefix">On 23-05-18 21:22, Nelson Hicks wrote:<br>
</div>
<blockquote type="cite"
cite="mid:48235aab-8906-16b0-6098-c29bd982c123@socket.net">Is
there any chance this could be an MTU mismatch between the two
nodes? If you use ping with varying packet sizes from one node to
the other, do they stop working above a specific size? Does
ifconfig report the same MTU size for the interface on both nodes?
<br>
<br>
Examples:
<br>
<br>
ifconfig | grep MTU
<br>
<br>
ping -s 500 <other_ip>
<br>
<br>
ping -s 1400 <other_ip>
<br>
<br>
ping -s 1472 <other_ip>
<br>
<br>
ping -s 2000 <other_ip>
<br>
<br>
Thanks,
<br>
<br>
- Nelson Hicks
<br>
<br>
<br>
<br>
<br>
On 05/23/2018 02:07 PM, Dirk Bonenkamp - ProActive wrote:
<br>
<blockquote type="cite">Hi,
<br>
<br>
Thank you for your reply.
<br>
<br>
I am / was under the impression that DRBD9 is the new and
improved DRBD, so I figured to use this version. But this is not
the case? Could somebody enlighten me a bit?
<br>
<br>
I already have disabled all bonding and other fancy network
stuff, so I'm using 1 nic currently. This doesn't solve
anything unfortunately.
<br>
<br>
Kind regards,
<br>
<br>
Dirk
<br>
<br>
On 23-05-18 14:20, Yannis Milios wrote:
<br>
<blockquote type="cite">Two things:
<br>
<br>
- I would use drbd8 instead of drbd9 for a 2 node setup.
<br>
- I would first test with 1 nic instead of 2.
<br>
<br>
On Wed, May 23, 2018 at 11:01 AM, Dirk Bonenkamp - ProActive
<<a class="moz-txt-link-abbreviated" href="mailto:dirk@proactive.nl">dirk@proactive.nl</a> <a class="moz-txt-link-rfc2396E" href="mailto:dirk@proactive.nl"><mailto:dirk@proactive.nl></a>>
wrote:
<br>
<br>
Hi List,
<br>
<br>
I'm struggling with a new DRBD9 setup. It's a simple
Master/Slave
<br>
setup.
<br>
I'm running Ubuntu 16.04 LTS with the DRBD9 packages from
the
<br>
Launchpad PPA.
<br>
<br>
I'm running some DRBD8 systems in production for quite
some
<br>
years, so I
<br>
have some experience. This setup is very similar, the only
major
<br>
difference is that this is DRBD9 and I use LUKS encrypted
<br>
partitions as
<br>
backend.
<br>
<br>
I keep running into this 'PingAck did not arrive in time.'
error,
<br>
which
<br>
points to network issues if I am correct (see complete log
snippet
<br>
below). This error occurs when I try to reattach the
secondary node
<br>
after a reboot. Initial sync works fine.
<br>
<br>
The servers are interconnected with 2 10Gb NICs. I had
bonding &
<br>
jumbo
<br>
frames configured, but deactivated all this, to no avail.
I've also
<br>
stripped the DRBD configuration to the bare minimum (see
below).
<br>
<br>
I've tested the connection with iperf and some other tools
and it
<br>
seems
<br>
just fine.
<br>
<br>
Could somebody point me in the right direction?
<br>
<br>
Thank you in advance, regards,
<br>
<br>
Dirk Bonenkamp
<br>
<br>
syslog messages:
<br>
<br>
May 23 11:31:56 data2 kernel: [ 704.111755] drbd: loading
<br>
out-of-tree
<br>
module taints kernel.
<br>
May 23 11:31:56 data2 kernel: [ 704.112290] drbd: module
<br>
verification
<br>
failed: signature and/or required key missing - tainting
kernel
<br>
May 23 11:31:56 data2 kernel: [ 704.127677] drbd:
initialized.
<br>
Version:
<br>
9.0.14-1 (api:2/proto:86-113)
<br>
May 23 11:31:56 data2 kernel: [ 704.127680] drbd:
GIT-hash:
<br>
62f906cf44ef02a30ce0c148fec223b40c51c533 build by
root@data2,
<br>
2018-05-23
<br>
09:19:54
<br>
May 23 11:31:56 data2 kernel: [ 704.127683] drbd:
registered as
<br>
block
<br>
device major 147
<br>
May 23 11:31:56 data2 kernel: [ 704.153565] drbd r0:
Starting worker
<br>
thread (from drbdsetup [4495])
<br>
May 23 11:31:56 data2 kernel: [ 704.183031] drbd r0/0
drbd0: disk(
<br>
Diskless -> Attaching )
<br>
May 23 11:31:56 data2 kernel: [ 704.183066] drbd r0/0
drbd0: Maximum
<br>
number of peer devices = 1
<br>
May 23 11:31:56 data2 kernel: [ 704.183293] drbd r0:
Method to
<br>
ensure
<br>
write ordering: flush
<br>
May 23 11:31:56 data2 kernel: [ 704.183308] drbd r0/0
drbd0:
<br>
drbd_bm_resize called with capacity == 273437203064
<br>
May 23 11:31:58 data2 kernel: [ 706.508228] drbd r0/0
drbd0: resync
<br>
bitmap: bits=34179650383 words=534057038 pages=1043081
<br>
May 23 11:31:58 data2 kernel: [ 706.508234] drbd r0/0
drbd0:
<br>
size = 127
<br>
TB (136718601532 KB)
<br>
May 23 11:31:58 data2 kernel: [ 706.508236] drbd r0/0
drbd0:
<br>
size = 127
<br>
TB (136718601532 KB)
<br>
May 23 11:32:10 data2 kernel: [ 717.890420] drbd r0/0
drbd0:
<br>
recounting
<br>
of set bits took additional 1256ms
<br>
May 23 11:32:10 data2 kernel: [ 717.890435] drbd r0/0
drbd0: disk(
<br>
Attaching -> Outdated )
<br>
May 23 11:32:10 data2 kernel: [ 717.890439] drbd r0/0
drbd0:
<br>
attached
<br>
to current UUID: 244DD61D2781DF44
<br>
May 23 11:32:10 data2 kernel: [ 717.918473] drbd r0
data1: Starting
<br>
sender thread (from drbdsetup [4544])
<br>
May 23 11:32:10 data2 kernel: [ 717.922534] drbd r0
data1: conn(
<br>
StandAlone -> Unconnected )
<br>
May 23 11:32:10 data2 kernel: [ 717.922820] drbd r0
data1: Starting
<br>
receiver thread (from drbd_w_r0 [4498])
<br>
May 23 11:32:10 data2 kernel: [ 717.922973] drbd r0
data1: conn(
<br>
Unconnected -> Connecting )
<br>
May 23 11:32:10 data2 kernel: [ 718.421219] drbd r0
data1:
<br>
Handshake to
<br>
peer 1 successful: Agreed network protocol version 113
<br>
May 23 11:32:10 data2 kernel: [ 718.421229] drbd r0
data1: Feature
<br>
flags enabled on protocol level: 0xf TRIM THIN_RESYNC
WRITE_SAME
<br>
WRITE_ZEROES.
<br>
May 23 11:32:10 data2 kernel: [ 718.421259] drbd r0
data1: Starting
<br>
ack_recv thread (from drbd_r_r0 [4550])
<br>
May 23 11:32:10 data2 kernel: [ 718.424095] drbd r0:
Preparing
<br>
cluster-wide state change 1205605755 (0->1 499/146)
<br>
May 23 11:32:10 data2 kernel: [ 718.437172] drbd r0:
State change
<br>
1205605755: primary_nodes=2, weak_nodes=FFFFFFFFFFFFFFFC
<br>
May 23 11:32:10 data2 kernel: [ 718.437185] drbd r0:
Aborting
<br>
cluster-wide state change 1205605755 (12ms) rv = -22
<br>
May 23 11:32:12 data2 kernel: [ 719.896223] drbd r0:
Preparing
<br>
cluster-wide state change 445952355 (0->1 499/146)
<br>
May 23 11:32:12 data2 kernel: [ 719.896498] drbd r0:
State change
<br>
445952355: primary_nodes=2, weak_nodes=FFFFFFFFFFFFFFFC
<br>
May 23 11:32:12 data2 kernel: [ 719.896508] drbd r0:
Committing
<br>
cluster-wide state change 445952355 (0ms)
<br>
May 23 11:32:12 data2 kernel: [ 719.896541] drbd r0
data1: conn(
<br>
Connecting -> Connected ) peer( Unknown -> Primary )
<br>
May 23 11:32:12 data2 kernel: [ 719.912186] drbd r0/0
drbd0 data1:
<br>
drbd_sync_handshake:
<br>
May 23 11:32:12 data2 kernel: [ 719.912198] drbd r0/0
drbd0
<br>
data1: self
<br>
244DD61D2781DF44:0000000000000000:0000000000000000:0000000000000000
<br>
bits:52035 flags:20
<br>
May 23 11:32:12 data2 kernel: [ 719.912207] drbd r0/0
drbd0
<br>
data1: peer
<br>
E38BE51FE782EAE0:244DD61D2781DF44:934CAB8662DF0410:E555BDC58E528356
<br>
bits:53162 flags:20
<br>
May 23 11:32:12 data2 kernel: [ 719.912214] drbd r0/0
drbd0 data1:
<br>
uuid_compare()=-2 by rule 50
<br>
May 23 11:32:12 data2 kernel: [ 719.912248] drbd r0/0
drbd0 data1:
<br>
pdsk( DUnknown -> UpToDate ) repl( Off -> WFBitMapT
)
<br>
May 23 11:32:32 data2 kernel: [ 740.397026] drbd r0
data1:
<br>
PingAck did
<br>
not arrive in time.
<br>
May 23 11:32:32 data2 kernel: [ 740.397121] drbd r0
data1: conn(
<br>
Connected -> NetworkFailure ) peer( Primary ->
Unknown )
<br>
May 23 11:32:32 data2 kernel: [ 740.397131] drbd r0/0
drbd0 data1:
<br>
pdsk( UpToDate -> DUnknown ) repl( WFBitMapT -> Off
)
<br>
May 23 11:32:32 data2 kernel: [ 740.397176] drbd r0
data1:
<br>
ack_receiver
<br>
terminated
<br>
May 23 11:32:32 data2 kernel: [ 740.397182] drbd r0
data1:
<br>
Terminating
<br>
ack_recv thread
<br>
May 23 11:32:32 data2 kernel: [ 740.458608] drbd r0
data1:
<br>
Connection
<br>
closed
<br>
May 23 11:32:32 data2 kernel: [ 740.458650] drbd r0
data1: conn(
<br>
NetworkFailure -> Unconnected )
<br>
May 23 11:32:32 data2 kernel: [ 740.458688] drbd r0
data1:
<br>
Restarting
<br>
receiver thread
<br>
May 23 11:32:32 data2 kernel: [ 740.458723] drbd r0
data1: conn(
<br>
Unconnected -> Connecting )
<br>
<br>
resources:
<br>
<br>
resource r0 {
<br>
on data1 {
<br>
device /dev/drbd0;
<br>
disk /dev/mapper/mapper_secure;
<br>
address 172.16.11.21:7789
<a class="moz-txt-link-rfc2396E" href="http://172.16.11.21:7789"><http://172.16.11.21:7789></a>;
<br>
meta-disk internal;
<br>
}
<br>
on data2 {
<br>
device /dev/drbd0;
<br>
disk /dev/mapper/mapper_secure;
<br>
address 172.16.11.22:7789
<a class="moz-txt-link-rfc2396E" href="http://172.16.11.22:7789"><http://172.16.11.22:7789></a>;
<br>
meta-disk internal;
<br>
}
<br>
}
<br>
<br>
drbd configuration:
<br>
<br>
global {
<br>
usage-count yes;
<br>
}
<br>
<br>
common {
<br>
#handlers {
<br>
# fence-peer
"/usr/lib/drbd/crm-fence-peer.9.sh
<br>
<a class="moz-txt-link-rfc2396E" href="http://crm-fence-peer.9.sh"><http://crm-fence-peer.9.sh></a>";
<br>
# after-resync-target
<br>
"/usr/lib/drbd/crm-unfence-peer.9.sh
<a class="moz-txt-link-rfc2396E" href="http://crm-unfence-peer.9.sh"><http://crm-unfence-peer.9.sh></a>";
<br>
#}
<br>
#disk {
<br>
# on-io-error detach;
<br>
# disk-barrier no;
<br>
# disk-flushes no;
<br>
# al-extents 3833;
<br>
# c-plan-ahead 7;
<br>
# c-fill-target 2M;
<br>
# c-min-rate 80M;
<br>
# c-max-rate 720M;
<br>
#}
<br>
net {
<br>
protocol C;
<br>
#fencing resource-only;
<br>
#cram-hmac-alg sha1;
<br>
#verify-alg sha1;
<br>
#shared-secret
1e69dc721fd2e65368ae3ba1e5929979;
<br>
#after-sb-0pri disconnect;
<br>
#after-sb-1pri disconnect;
<br>
#after-sb-2pri disconnect;
<br>
#max-buffers 8000;
<br>
#max-epoch-size 8000;
<br>
#sndbuf-size 0;
<br>
#rcvbuf-size 2048k;
<br>
}
<br>
}
<br>
<br>
<br>
<br>
_______________________________________________
<br>
drbd-user mailing list
<br>
<a class="moz-txt-link-abbreviated" href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a>
<a class="moz-txt-link-rfc2396E" href="mailto:drbd-user@lists.linbit.com"><mailto:drbd-user@lists.linbit.com></a>
<br>
<a class="moz-txt-link-freetext" href="http://lists.linbit.com/mailman/listinfo/drbd-user">http://lists.linbit.com/mailman/listinfo/drbd-user</a>
<br>
<a class="moz-txt-link-rfc2396E" href="http://lists.linbit.com/mailman/listinfo/drbd-user"><http://lists.linbit.com/mailman/listinfo/drbd-user></a>
<br>
<br>
<br>
</blockquote>
<br>
-- <br>
ProActive Software
<br>
Dirk Bonenkamp
<br>
CTO <a class="moz-txt-link-rfc2396E" href="https://www.proactive-software.com"><https://www.proactive-software.com></a>
<br>
Phone: +31 (0)23 54 222 99
<br>
Mobile: +31 (0)6 250 787 93 Richard Holkade 9
<br>
2033 PZ Haarlem
<br>
LinkedIn <a class="moz-txt-link-rfc2396E" href="http://linkd.in/1V6egnk"><http://linkd.in/1V6egnk></a> Facebook
<a class="moz-txt-link-rfc2396E" href="http://bit.ly/FBProActive"><http://bit.ly/FBProActive></a> YouTube
<a class="moz-txt-link-rfc2396E" href="http://bit.ly/1Mc23L9"><http://bit.ly/1Mc23L9></a> <a class="moz-txt-link-abbreviated" href="http://www.proactive.nl">www.proactive.nl</a>
<a class="moz-txt-link-rfc2396E" href="https://www.proactive.nl"><https://www.proactive.nl></a>
<br>
<br>
<br>
<br>
_______________________________________________
<br>
drbd-user mailing list
<br>
<a class="moz-txt-link-abbreviated" href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a>
<br>
<a class="moz-txt-link-freetext" href="http://lists.linbit.com/mailman/listinfo/drbd-user">http://lists.linbit.com/mailman/listinfo/drbd-user</a>
<br>
</blockquote>
<br>
_______________________________________________
<br>
drbd-user mailing list
<br>
<a class="moz-txt-link-abbreviated" href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a>
<br>
<a class="moz-txt-link-freetext" href="http://lists.linbit.com/mailman/listinfo/drbd-user">http://lists.linbit.com/mailman/listinfo/drbd-user</a>
<br>
</blockquote>
<br>
<div class="moz-signature">-- <br>
<title>ProActive Software</title>
<link href="http://fonts.googleapis.com/css?family=Lato:400"
rel="stylesheet" type="text/css">
<style type="text/css">
html, body, table {
        font-family:                Lato;
        font-size:                12px;
}
a {
        color:                        #009FE3;
        text-decoration:        none;
}
.name {
        font-size:                18px;
        font-weight:                medium;
        color:                        333333;
}
.line {
        background-color:        #FDB100;
        padding:                0;
}
td {
        padding:                10px;
}
</style>
<table cellspacing="0" cellpadding="0" border="0">
<tbody>
<tr>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:30px; vertical-align:top; text-align:right;"
width="175" valign="top" align="right"> <span
class="name" style="font-family:Lato; font-size:16px;
color:#333333">Dirk Bonenkamp</span><br>
CTO </td>
<td rowspan="3" style="background-color:#FDB100; padding:0"
class="line"> </td>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:30px; vertical-align:center; text-align:right;"><a
href="https://www.proactive-software.com"><img alt=""
src="https://proactive-software.com/wp-content/uploads/2018/02/signature.png"
width="200" border="0" height="30"></a></td>
</tr>
<tr>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:50px; vertical-align:top; text-align:right;"
valign="top" align="right"> Phone: +31 (0)23 54 222 99<br>
Mobile: +31 (0)6 250 787 93 </td>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:50px; vertical-align:top; text-align:left;"
valign="top" align="left"> Richard Holkade 9<br>
2033 PZ Haarlem </td>
</tr>
<tr>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:30px; vertical-align:top; text-align:right;"
valign="top" align="right"> <a
href="http://linkd.in/1V6egnk"><img
src="https://proactive-software.com/wp-content/uploads/2018/02/linkedin.png"
alt="LinkedIn" width="17" border="0" height="16"></a>
<a href="http://bit.ly/FBProActive"><img
src="https://proactive-software.com/wp-content/uploads/2018/02/facebook.png"
alt="Facebook" width="17" border="0" height="16"></a>
<a href="http://bit.ly/1Mc23L9"><img
src="https://proactive-software.com/wp-content/uploads/2018/02/youtube.png"
alt="YouTube" width="17" border="0" height="16"></a> </td>
<td style="font-family:Lato; font-size:12px; padding:10px;
height:30px; vertical-align:top; text-align:left;"
valign="top" align="left"> <a
href="https://www.proactive.nl">www.proactive.nl</a> </td>
</tr>
</tbody>
</table>
</div>
</body>
</html>