Hi Felix, guys,<div><br></div><div>Thanks for all the help and advice ---</div><div><br></div><div>So, in the end, yep, guess what ... DRBD on XenServer 6 does not appear to like Open vSwitch in my case.</div><div><br></div>
<div>If the Linbit guys are listening, might be worth noting that in the XS6 setup guide.</div><div><br></div><div>If anyone else experiences this problem...</div><div><br></div><div>Just run "<span style="background-color:rgb(255,255,255);color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px">xe-switch-network-backend bridge" on your nodes, which seamlessly switches from open vswitch to linux bridging after a reboot... and poof all of my freezing and problems have now gone away at the expense of the OVS features that I wasn't using anyways.</span></div>
<div><span style="background-color:rgb(255,255,255);color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px"><br></span></div><div><span style="background-color:rgb(255,255,255);color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px">Cheers,</span></div>
<div><span style="background-color:rgb(255,255,255);color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px">Andrew</span></div><div><br><div class="gmail_quote">On Sun, Oct 7, 2012 at 12:46 PM, Andrew Eross <span dir="ltr"><<a href="mailto:eross@locatrix.com" target="_blank">eross@locatrix.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div>Hi Felix,</div><div><br></div><div>I have a very simple default type config running here:</div><div><br></div><div>
<div>global </div><div>{ </div><div> usage-count yes; </div><div>}</div><div>common </div><div>{</div>
<div> protocol C;</div><div> net </div><div> {</div><div> after-sb-0pri discard-zero-changes;</div><div> after-sb-1pri consensus;</div><div> after-sb-2pri disconnect;</div><div> }</div><div>
disk </div><div> { </div><div> }</div><div> handlers </div><div> {</div><div> split-brain "/usr/lib/drbd/notify-split-brain.sh"; </div><div> }</div><div>}</div></div><div><br></div><div>
<div>resource drbd-sr1 {</div><div>net {</div><div>} </div><div>syncer {</div><div>}</div><div>disk {</div><div>on-io-error detach;</div><div>} </div><div> on <a href="http://vh03.office.locatrix.net" target="_blank">vh03.office.locatrix.net</a> {</div>
<div> device /dev/drbd1;</div><div> disk /dev/VG_XenStorage-862f18be-6501-63bd-328d-e4a0843fac7b/drdb;</div><div> address <a href="http://10.0.0.3:7789" target="_blank">10.0.0.3:7789</a>;</div><div> meta-disk internal;</div>
<div> }</div><div> on <a href="http://vh04.office.locatrix.net" target="_blank">vh04.office.locatrix.net</a> {</div><div> device /dev/drbd1;</div><div> disk /dev/VG_XenStorage-94ac59fb-2d82-df06-5560-76570dcfa307/drbd;</div>
<div> address <a href="http://10.0.0.4:7789" target="_blank">10.0.0.4:7789</a>;</div><div> meta-disk internal;</div><div> }</div><div>}</div></div><div><br></div><div>ifconfig looks like this:</div><div>
<div>[root@vh04 ~]# ifconfig</div>
<div>eth0 Link encap:Ethernet HWaddr 40:6C:8F:24:C0:6F </div><div> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1</div><div> RX packets:311004408 errors:0 dropped:0 overruns:0 frame:229</div>
<div>
TX packets:628898462 errors:1 dropped:0 overruns:0 carrier:0</div><div> collisions:0 txqueuelen:1000 </div><div> RX bytes:3336540977 (3.1 GiB) TX bytes:1043299490 (994.9 MiB)</div><div> Interrupt:16 </div>
<div><br></div><div>eth1 Link encap:Ethernet HWaddr 7C:C3:A1:87:F1:D9 </div><div> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1</div><div> RX packets:327842282 errors:225 dropped:0 overruns:0 frame:225</div>
<div> TX packets:346387593 errors:0 dropped:0 overruns:0 carrier:0</div><div> collisions:0 txqueuelen:1000 </div><div> RX bytes:667488230 (636.5 MiB) TX bytes:997801992 (951.5 MiB)</div><div><br>
</div><div>lo Link encap:Local Loopback </div><div> inet addr:127.0.0.1 Mask:255.0.0.0</div><div> UP LOOPBACK RUNNING MTU:16436 Metric:1</div><div> RX packets:42549511 errors:0 dropped:0 overruns:0 frame:0</div>
<div> TX packets:42549511 errors:0 dropped:0 overruns:0 carrier:0</div><div> collisions:0 txqueuelen:0 </div><div> RX bytes:4111323287 (3.8 GiB) TX bytes:4111323287 (3.8 GiB)</div><div><br></div>
<div>vif18.0 Link encap:Ethernet HWaddr FE:FF:FF:FF:FF:FF </div><div> UP BROADCAST RUNNING NOARP PROMISC MTU:1500 Metric:1</div><div> RX packets:11497 errors:0 dropped:0 overruns:0 frame:0</div><div>
TX packets:11489 errors:0 dropped:17 overruns:0 carrier:0</div><div> collisions:0 txqueuelen:32 </div><div> RX bytes:2100283 (2.0 MiB) TX bytes:2682900 (2.5 MiB)</div><div><br></div><div>vif19.0 Link encap:Ethernet HWaddr FE:FF:FF:FF:FF:FF </div>
<div> UP BROADCAST RUNNING NOARP PROMISC MTU:1500 Metric:1</div><div> RX packets:10801 errors:0 dropped:0 overruns:0 frame:0</div><div> TX packets:14222 errors:0 dropped:49 overruns:0 carrier:0</div>
<div> collisions:0 txqueuelen:32 </div><div> RX bytes:2537871 (2.4 MiB) TX bytes:2558791 (2.4 MiB)</div><div><br></div><div>xenbr0 Link encap:Ethernet HWaddr 40:6C:8F:24:C0:6F </div><div> inet addr:10.0.0.4 Bcast:10.0.0.255 Mask:255.255.255.0</div>
<div> UP BROADCAST RUNNING PROMISC MULTICAST MTU:1500 Metric:1</div><div> RX packets:311004466 errors:0 dropped:0 overruns:0 frame:0</div><div> TX packets:166306729 errors:0 dropped:0 overruns:0 carrier:0</div>
<div> collisions:0 txqueuelen:0 </div><div> RX bytes:<a href="tel:2092488469" value="+12092488469" target="_blank">2092488469</a> (1.9 GiB) TX bytes:<a href="tel:2485661044" value="+12485661044" target="_blank">2485661044</a> (2.3 GiB)</div>
<div><br></div><div>xenbr1 Link encap:Ethernet HWaddr 7C:C3:A1:87:F1:D9 </div><div> inet addr:192.168.41.24 Bcast:192.168.41.255 Mask:255.255.255.0</div>
<div> UP BROADCAST RUNNING PROMISC MULTICAST MTU:1500 Metric:1</div><div> RX packets:63765301 errors:0 dropped:0 overruns:0 frame:0</div><div> TX packets:31683504 errors:0 dropped:0 overruns:0 carrier:0</div>
<div> collisions:0 txqueuelen:0 </div><div> RX bytes:1320179908 (1.2 GiB) TX bytes:2335142508 (2.1 GiB)</div></div><div><br></div>Hmmmm... in fact I think it does use OVS.<div><br></div><div>I followed the Linbit directions on setting up DRBD with XenServer 6:</div>
<div><a href="http://www.linbit.com/fileadmin/tech-guides/xenserver-drbd.pdf" target="_blank">http://www.linbit.com/fileadmin/tech-guides/xenserver-drbd.pdf</a></div><div><br></div><div>Which means I added the DRBD interface as a XS management interface, which gets setup as a bridge by XS.</div>
<div><br></div><div>I also see that Open vSwitch is now the default with XS6 as well, so yes it must be.</div><div><br></div><div>I can change from open vswitch to normal linux bridging with "xe-switch-network-backend bridge" on XenServer, so what the heck let's see if that does some magic here.</div>
<div class="HOEnZb"><div class="h5">
<div><br></div><div><div><div>
<br><br><div class="gmail_quote">On Sun, Oct 7, 2012 at 12:28 PM, Felix Frank <span dir="ltr"><<a href="mailto:ff@mpexnet.de" target="_blank">ff@mpexnet.de</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
On 10/07/2012 05:13 PM, Andrew Eross wrote<div><br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Very strange! Any ideas, guys?<br>
</blockquote>
<br></div>
Yeah. I haven't looked at your config, but if DRBD relies on an OVS port for replication, that may be a Bad Idea.<br>
<br>
HTH,<br>
Felix<br>
</blockquote></div><br></div></div></div>
</div></div></blockquote></div><br></div>