<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 12 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
span.E-mailStijl17
        {mso-style-type:personal-reply;
        font-family:"Calibri","sans-serif";
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="NL" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Hello,<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">When one node unexpectedly shuts down, dlm locks down until quorum is regained AND the faulty node is fenced, before it can take over the cluster
resources.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">I assume that you have set the “two_node” flag in cluster.conf<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">>Oct 29 08:05:59 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Oct 29 08:05:59 node1 fenced[1401]: fence node2 failed<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">I think that adding the following option to the dlm section in cluster.conf<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">enable_fencing="0"<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">might solve this problem. (but I have not tested this) This will disable fencing.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Or you can setup fencing.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Best regards,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Maurits van de Lande
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">Van:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> drbd-user-bounces@lists.linbit.com [mailto:drbd-user-bounces@lists.linbit.com]
<b>Namens </b>Zohair Raza<br>
<b>Verzonden:</b> maandag 29 oktober 2012 11:03<br>
<b>Aan:</b> drbd-user@lists.linbit.com<br>
<b>Onderwerp:</b> [DRBD-user] GFS2 freezes<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">Hi, <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">I have setup a Primary/Primary cluster with GFS2.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">All works good if I shut down any node regularly, but when I unplug power of any node, GFS freezes and I can not access the device. <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Tried to use <a href="http://people.redhat.com/lhh/obliterate">http://people.redhat.com/lhh/obliterate</a> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">this is what I see in logs <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: PingAck did not arrive in time.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 )<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: asender terminated<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Terminating asender thread<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Connection closed<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: conn( NetworkFailure -> Unconnected )<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: receiver terminated<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: Restarting receiver thread<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: receiver (re)started<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: conn( Unconnected -> WFConnection )<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 fence_node[1912]: fence node2 failed<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0 exit code 1 (0x100)<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:41 node1 kernel: d-con res0: fence-peer helper broken, returned 1<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:48 node1 corosync[1346]: [TOTEM ] A processor failed, forming new configuration.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [QUORUM] Members[1]: 1<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [TOTEM ] A processor joined or left the membership and a new membership was formed.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [CPG ] chosen downlist: sender r(0) ip(192.168.23.128) ; members(old:2 left:1)<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 corosync[1346]: [MAIN ] Completed service synchronization, ready to provide service.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 kernel: dlm: closing connection to node 2<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fencing node node2<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 kernel: GFS2: fsid=cluster-setup:res0.0: jid=1: Trying to acquire journal lock...<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:53 node1 fenced[1401]: fence node2 failed<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fencing node node2<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:56 node1 fenced[1401]: fence node2 failed<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fencing node node2<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Oct 29 08:05:59 node1 fenced[1401]: fence node2 failed<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><br clear="all">
<o:p></o:p></p>
<div>
<p class="MsoNormal">Regards,<br>
Zohair Raza<o:p></o:p></p>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</div>
</div>
</div>
</body>
</html>