Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Florian and Gordan,
Many thanks for your reply, help me so much!
On Thu, 2008-06-12 at 13:09 +0100, drbd at bobich.net wrote:
> Therein lies your problem. You need a proper fencing device for GFS
> operation to transparently continue. If you are using fence_manual,
you
> have to run fence_ack_manual manually on the remaining node (see man
page
> for details) to get it to allow access to GFS again.
I did this and so I got access again to FS, many thanks! Now I'm
studying one better way to fence my poor nodes (RSA/DRAC).
When I tried to put my node2 again on cluster (I've dropped all using
iptables), I got:
Jun 12 11:33:58 hotsite-bsb-la-2 openais[3062]: [MAIN ] Killing node
drdb_hotsite-1 because it has rejoined the cluster with existing state
Jun 12 11:33:58 hotsite-bsb-la-2 openais[3062]: [CMAN ] cman killed by
node 1 because we rejoined the cluster without a full restart
OK. I've tried to restart and lost the machine:
echo o > /proc/sysrq-trigger ; halt -f
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: old =
{ cs:NetworkFailure st:Primary/Unknown ds:UpToDate/DUnknown s--- }
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: new = { cs:Unconnected
st:Primary/Unknown ds:UpToDate/DUnknown s--- }
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: conn( NetworkFailure ->
Unconnected )
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: receiver terminated
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: receiver (re)started
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: Considering state change
from bad state. Error would be: 'Refusing to be Primary while peer is
not outdated'
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: old = { cs:Unconnected
st:Primary/Unknown ds:UpToDate/DUnknown s--- }
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: new = { cs:WFConnection
st:Primary/Unknown ds:UpToDate/DUnknown s--- }
Jun 12 11:35:14 hotsite-bsb-la-1 kernel: drbd0: conn( Unconnected ->
WFConnection
So... how can I prevent this? "Refusing to be Primary while peer is not
outdated".
I wanna to mount/use my FS when the second peer its crashed and your
status saw unknown :)
Is it possible?
Many thanks again!
--
Tiago Cruz
http://everlinux.com
Linux User #282636