<div dir="ltr">Hi Yussef!<div><br></div><div>Well, I did not dig into details here, since I was involvd in some other activities. However, we use DRBD+bonding in several deployments, and not quite long ago we twice had EXACTLY the same situation that you describe. We use mode 0 on two 1Gb cross-connected interfaces, testing with iperf gives ~ 200 Mb/sec transef rate (as expected).</div>
<div>Both situations described below had tha same symptoms: PingAck timeouts, disconnects/reconnects, sync rate less than 1000 Kb (!).</div><div><br></div><div>Situation 1: 2 servers had same MAC addresses on eth0 (we have a software that has a license bound to MAC of eth0 interface, so we have to deal with that when moving resource) by mistake. </div>
<div><br></div><div>Situation 2: 2 servers had eth0 connected correctly, however eth1 on 1st server had link up but connected to some other server, sond to the 2nd server, and 2nd server had link down on eth1 (having eth0+eth1 bonded).</div>
<div><br></div><div>Both situations led to sympoms you describe.</div><div><br></div><div>In situation 1 we just corrected MACs and ofcourse everything got right.</div><div>In situation 2 we just put eth1 down on both servers (ifdown eth1) until cabling issues were resolved, so bond interface had only one interface active, and this also resolved the issue (ofcourse having speed at 1 Gb)</div>
<div><br></div><div>So you might want to try disabling 1st or 2nd NICs pairs, correspondingly.</div><div><br></div><div>Hope this helps</div><div><br></div><div>Best regards,</div><div>Alexandr A. Alexandrov</div></div><div class="gmail_extra">
<br><br><div class="gmail_quote">2014-03-06 17:59 GMT+04:00 Latrous, Youssef <span dir="ltr"><<a href="mailto:YLatrous@broadviewnet.com" target="_blank">YLatrous@broadviewnet.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="EN-US" link="blue" vlink="purple">
<div>
<p class="MsoNormal">Hi Alexandr,<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Thank you for the response. I checked our bonding setup and I didn’t see any issues (see below for details). We use the “broadcast” mode over cross cables, with no switches in between - direct connection between the two servers, seating
side by side, connecting 2 NICs from one node to the other node’s NIC cards. Is the broadcast mode the right choice in this configuration? I don’t understand the MAC address reference in this context. Does DRBD check this info for Acks? That is if it sends
on one NIC and receives on the other NIC it would drop the packet?<u></u><u></u></p>
<p class="MsoNormal">Also, given that DRBD uses TCP with built-in retransmits, over these cross cables, I really don’t see how we could lose packets within the 6 seconds window? Please note that we monitor this network and report any issues (we use pacemaker).
We didn’t see any issues so far with this network.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">As you can notice, I’m a bit lost here <span style="font-family:Wingdings">
J</span><u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Thank you,<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Youssef<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">PS. Here is our bond setup for this HA network.<u></u><u></u></p>
<p class="MsoNormal">--<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Bonding Mode: fault-tolerance (broadcast)<u></u><u></u></p>
<p class="MsoNormal">MII Status: up<u></u><u></u></p>
<p class="MsoNormal">MII Polling Interval (ms): 100<u></u><u></u></p>
<p class="MsoNormal">Up Delay (ms): 0<u></u><u></u></p>
<p class="MsoNormal">Down Delay (ms): 0<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Slave Interface: eth0<u></u><u></u></p>
<p class="MsoNormal">MII Status: up<u></u><u></u></p>
<p class="MsoNormal">Speed: 1000 Mbps<u></u><u></u></p>
<p class="MsoNormal">Duplex: full<u></u><u></u></p>
<p class="MsoNormal">Link Failure Count: 0<u></u><u></u></p>
<p class="MsoNormal">Permanent HW addr: c8:0a:a9:f1:a9:82<u></u><u></u></p>
<p class="MsoNormal">Slave queue ID: 0<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Slave Interface: eth4<u></u><u></u></p>
<p class="MsoNormal">MII Status: up<u></u><u></u></p>
<p class="MsoNormal">Speed: 1000 Mbps<u></u><u></u></p>
<p class="MsoNormal">Duplex: full<u></u><u></u></p>
<p class="MsoNormal">Link Failure Count: 0<u></u><u></u></p>
<p class="MsoNormal">Permanent HW addr: c8:0a:a9:f1:a9:84<u></u><u></u></p>
<p class="MsoNormal">Slave queue ID: 0<u></u><u></u></p><div class="">
<div style="border:none;border-bottom:solid windowtext 1.0pt;padding:0in 0in 1.0pt 0in">
<p class="MsoNormal" style="border:none;padding:0in"><u></u> <u></u></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<pre><span style>Youssef,<u></u><u></u></span></pre>
<pre><span style><u></u> <u></u></span></pre>
<pre><span style>Check your bonding mode!<u></u><u></u></span></pre>
<pre><span style>It apperes that you loose packets, this can be because the mode is wrong or<u></u><u></u></span></pre>
<pre><span style>MAC addresses wrong.<u></u><u></u></span></pre>
<pre><span style><u></u> <u></u></span></pre>
<pre><span style>Best regards,<u></u><u></u></span></pre>
<pre><span style>Alexandr A. Alexandrov<u></u><u></u></span></pre>
<pre><span style><u></u> <u></u></span></pre>
<pre><span style><u></u> <u></u></span></pre>
</div><pre><span style>2014-03-06 0:38 GMT+04:00 Latrous, Youssef <<a href="http://lists.linbit.com/mailman/listinfo/drbd-user" target="_blank">YLatrous at broadviewnet.com</a>>:<u></u><u></u></span></pre><div class="">
<pre><span style><u></u> <u></u></span></pre>
<pre><span style>><i> Hello,<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> We are currently experiencing a weird “PingAck” timeout on a system with<u></u><u></u></i></span></pre>
<pre><span style>><i> two nodes, and an active/passive configuration. The two nodes are using a<u></u><u></u></i></span></pre>
<pre><span style>><i> cross-cabled connection in a bonded two Giga NIC cards. This network never<u></u><u></u></i></span></pre>
<pre><span style>><i> goes down and used only for DRDB and CRM cluster data exchange. It’s barely<u></u><u></u></i></span></pre>
<pre><span style>><i> used (very light load). We are running SLES 11 SP2, DRBD release 8.4.2, and<u></u><u></u></i></span></pre>
<pre><span style>><i> pacemaker 1.1.7.<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> We couldn’t find a DRBD configuration option to setup the number of<u></u><u></u></i></span></pre>
<pre><span style>><i> retries before giving up.<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> Our concern is that we do not understand how a PingAck can timeout over<u></u><u></u></i></span></pre>
<pre><span style>><i> such a reliable media? Any insight into this would be much appreciated.<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> On the same note, are there any guards against it? Any best practices<u></u><u></u></i></span></pre>
<pre><span style>><i> (setups) we could use to avoid this situation?<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> Thanks for any help,<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> Youssef<u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i> _______________________________________________<u></u><u></u></i></span></pre>
<pre><span style>><i> drbd-user mailing list<u></u><u></u></i></span></pre>
</div><pre><span style>><i> <a href="http://lists.linbit.com/mailman/listinfo/drbd-user" target="_blank">drbd-user at lists.linbit.com</a><u></u><u></u></i></span></pre><div class="">
<pre><span style>><i> <a href="http://lists.linbit.com/mailman/listinfo/drbd-user" target="_blank">http://lists.linbit.com/mailman/listinfo/drbd-user</a><u></u><u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style>><i><u></u> <u></u></i></span></pre>
<pre><span style><u></u> <u></u></span></pre>
<pre><span style><u></u> <u></u></span></pre>
<pre><span style>-- <u></u><u></u></span></pre>
<pre><span style>С уважением, ААА.<u></u><u></u></span></pre>
</div><pre><span style>-------------- next part --------------<u></u><u></u></span></pre>
<pre><span style>An HTML attachment was scrubbed...<u></u><u></u></span></pre>
<pre><span style>URL: <<a href="http://lists.linbit.com/pipermail/drbd-user/attachments/20140306/2544fc77/attachment.htm" target="_blank">http://lists.linbit.com/pipermail/drbd-user/attachments/20140306/2544fc77/attachment.htm</a>><u></u><u></u></span></pre>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div>
<br>_______________________________________________<br>
drbd-user mailing list<br>
<a href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br>
<a href="http://lists.linbit.com/mailman/listinfo/drbd-user" target="_blank">http://lists.linbit.com/mailman/listinfo/drbd-user</a><br>
<br></blockquote></div><br><br clear="all"><div><br></div>-- <br>С уважением, ААА.
</div>