<div dir="ltr"><div>Hi, </div><div><br></div><div>I have setup a Primary/Primary cluster with GFS2.</div><div><br></div><div>All works good if I shut down any node regularly, but when I unplug power of any node, GFS freezes and I can not access the device. </div>
<div><br></div><div>Tried to use <a href="http://people.redhat.com/lhh/obliterate">http://people.redhat.com/lhh/obliterate</a> </div><div><br></div><div>this is what I see in logs </div><div><br></div><div><div>Oct 29 08:05:41 node1 kernel: d-con res0: PingAck did not arrive in time.</div>
<div>Oct 29 08:05:41 node1 kernel: d-con res0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 )</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: asender terminated</div>
<div>Oct 29 08:05:41 node1 kernel: d-con res0: Terminating asender thread</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: Connection closed</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: conn( NetworkFailure -> Unconnected )</div>
<div>Oct 29 08:05:41 node1 kernel: d-con res0: receiver terminated</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: Restarting receiver thread</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: receiver (re)started</div>
<div>Oct 29 08:05:41 node1 kernel: d-con res0: conn( Unconnected -> WFConnection )</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0</div><div>Oct 29 08:05:41 node1 fence_node[1912]: fence node2 failed</div>
<div>Oct 29 08:05:41 node1 kernel: d-con res0: helper command: /sbin/drbdadm fence-peer res0 exit code 1 (0x100)</div><div>Oct 29 08:05:41 node1 kernel: d-con res0: fence-peer helper broken, returned 1</div><div>Oct 29 08:05:48 node1 corosync[1346]: [TOTEM ] A processor failed, forming new configuration.</div>
<div>Oct 29 08:05:53 node1 corosync[1346]: [QUORUM] Members[1]: 1</div><div>Oct 29 08:05:53 node1 corosync[1346]: [TOTEM ] A processor joined or left the membership and a new membership was formed.</div><div>Oct 29 08:05:53 node1 corosync[1346]: [CPG ] chosen downlist: sender r(0) ip(192.168.23.128) ; members(old:2 left:1)</div>
<div>Oct 29 08:05:53 node1 corosync[1346]: [MAIN ] Completed service synchronization, ready to provide service.</div><div>Oct 29 08:05:53 node1 kernel: dlm: closing connection to node 2</div><div>Oct 29 08:05:53 node1 fenced[1401]: fencing node node2</div>
<div>Oct 29 08:05:53 node1 kernel: GFS2: fsid=cluster-setup:res0.0: jid=1: Trying to acquire journal lock...</div><div>Oct 29 08:05:53 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent</div>
<div>Oct 29 08:05:53 node1 fenced[1401]: fence node2 failed</div><div>Oct 29 08:05:56 node1 fenced[1401]: fencing node node2</div><div>Oct 29 08:05:56 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent</div>
<div>Oct 29 08:05:56 node1 fenced[1401]: fence node2 failed</div><div>Oct 29 08:05:59 node1 fenced[1401]: fencing node node2</div><div>Oct 29 08:05:59 node1 fenced[1401]: fence node2 dev 0.0 agent fence_ack_manual result: error from agent</div>
<div>Oct 29 08:05:59 node1 fenced[1401]: fence node2 failed</div></div><br clear="all"><div dir="ltr">Regards,<br>Zohair Raza<div><br></div></div>
</div>