<div dir="ltr">On Mon, Oct 29, 2012 at 5:43 PM, Maurits van de Lande <span dir="ltr"><<a href="mailto:M.vandeLande@vdl-fittings.com" target="_blank">M.vandeLande@vdl-fittings.com</a>></span> wrote:<br><div class="gmail_quote">
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="NL" link="blue" vlink="purple">
<div><div class="im">
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Hello,<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">When one node unexpectedly shuts down, dlm locks down until quorum is regained AND the faulty node is fenced, before it can take over the cluster
resources.<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">I assume that you have set the “two_node” flag in cluster.conf<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u></span></p></div></div></div></blockquote><div><br></div><div>yes, I have it set because I want to primary/primary setup</div>
<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div lang="NL" link="blue" vlink="purple"><div><div class="im"><p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"> <u></u></span></p>
</div><p class="MsoNormal"><span lang="EN-US">>Oct 29 08:05:59 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<u></u><u></u></span></p><div class="im">
<p class="MsoNormal"><span lang="EN-US">Oct 29 08:05:59 node1 fenced[1401]: fence node2 failed<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
</div><p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">I think that adding the following option to the dlm section in cluster.conf<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">enable_fencing="0"<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">might solve this problem. (but I have not tested this) This will disable fencing.<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u></span></p></div></div></blockquote><div>giving a try</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="NL" link="blue" vlink="purple"><div><p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Or you can setup fencing.<u></u><u></u></span></p><div class="im">
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> </span></p></div></div></div></blockquote><div>How can I do so? </div><div><br>
</div><div>I am testing this two virtual machines on vmware workstation, do I need fence_vmware for this?</div><div> </div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="NL" link="blue" vlink="purple"><div><div class="im"><p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Best regards,<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d">Maurits van de Lande
<u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1f497d"><u></u> <u></u></span></p>
<div style="border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Van:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> <a href="mailto:drbd-user-bounces@lists.linbit.com" target="_blank">drbd-user-bounces@lists.linbit.com</a> [mailto:<a href="mailto:drbd-user-bounces@lists.linbit.com" target="_blank">drbd-user-bounces@lists.linbit.com</a>]
<b>Namens </b>Zohair Raza<br>
<b>Verzonden:</b> maandag 29 oktober 2012 11:03<br>
<b>Aan:</b> <a href="mailto:drbd-user@lists.linbit.com" target="_blank">drbd-user@lists.linbit.com</a><br>
<b>Onderwerp:</b> [DRBD-user] GFS2 freezes<u></u><u></u></span></p>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
</div><div>
<div>
<p class="MsoNormal">Hi, <u></u><u></u></p>
</div><div><div class="h5">
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I have setup a Primary/Primary cluster with GFS2.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">All works good if I shut down any node regularly, but when I unplug power of any node, GFS freezes and I can not access the device. <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Tried to use <a href="http://people.redhat.com/lhh/obliterate" target="_blank">http://people.redhat.com/lhh/obliterate</a> <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">this is what I see in logs <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: PingAck did not arrive in time.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 )<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: asender terminated<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Terminating asender thread<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Connection closed<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: conn( NetworkFailure -> Unconnected )<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: receiver terminated<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Restarting receiver thread<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: receiver (re)started<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: conn( Unconnected -> WFConnection )<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 fence_node[1912]: fence node2 failed<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0 exit code 1 (0x100)<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: fence-peer helper broken, returned 1<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:48 node1 corosync[1346]: [TOTEM ] A processor failed, forming new configuration.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [QUORUM] Members[1]: 1<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [TOTEM ] A processor joined or left the membership and a new membership was formed.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [CPG ] chosen downlist: sender r(0) ip(192.168.23.128) ; members(old:2 left:1)<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [MAIN ] Completed service synchronization, ready to provide service.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 kernel: dlm: closing connection to node 2<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fencing node node2<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 kernel: GFS2: fsid=cluster-setup:res0.0: jid=1: Trying to acquire journal lock...<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fence node2 failed<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fencing node node2<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fence node2 failed<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fencing node node2<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fence node2 failed<u></u><u></u></p>
</div>
</div>
<p class="MsoNormal"><br clear="all">
<u></u><u></u></p>
<div>
<p class="MsoNormal">Regards,<br>
Zohair Raza<u></u><u></u></p>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div>
</div></div></div>
</div>
</div>
<br>_______________________________________________<br>
drbd-user mailing list<br>
<a href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br>
<a href="http://lists.linbit.com/mailman/listinfo/drbd-user" target="_blank">http://lists.linbit.com/mailman/listinfo/drbd-user</a><br>
<br></blockquote></div><br></div>