[DRBD-user] Digest mismatch resulting in "split brain" after (!) automatic reconnect

Lars Ellenberg lars.ellenberg at linbit.com
Mon Feb 21 14:18:30 CET 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Mon, Feb 21, 2011 at 01:21:11PM +0100, Raoul Bhatia [IPAX] wrote:
> hi,
> 
> 
> On 02/21/2011 10:36 AM, Lars Ellenberg wrote:
> > Fix your fence-peer helper,
> > that may be the cause of trouble there.
> 
> which actuall is 'your' fence-peer helper, right? :)

Is it.
Well, then fix it, anyways.
Or maybe it does not need fixing after all.

> thus, basically coming back to [1] where florian asks:
> > Look at your paste. You have no node where DRBD is Secondary. What do
> > you expect the agent to do? 
> 
> (i know, i talked about the agent in this email. but the the agent and
> crm-fence-peer.sh are closely tied, aren't they?)

Not that much.  But I got the impression that you are mixing several
issues in those quoted threads.

> looking at crm-fence-peer.sh's source, i see:
> >         Secondary|Primary)
> >                 # WTF? We are supposed to fence the peer,
> >                 # but the replication link is just fine?
> >                 echo WARNING "peer is $DRBD_peer, did not place the constraint!"
> >                 rc=0
> >                 return
> >                 ;;
> >         esac
> 
> so, this should actually be obsoleted by fixing the following bug,
> right?

possibly.

> on the other hand, what's wrong in trying to disconnect and reconnect
> the resources and see what happens? (e.g. via a tiny contraint that is
> only valid for PT1M?

Nothing?
Everything?
I don't know.
You tell me what is wrong.

> > Feb 16 06:25:04 c02n01 kernel: [3687390.947555] block drbd1: pdsk( UpToDate -> DUnknown )
> > 
> > This should not have happened, either:
> > We must not change the pdsk state to DUnknown while keeping conn state at Connected.
> > That's nonsense.
> > 
> > Feb 16 06:25:04 c02n01 kernel: [3687390.947633] block drbd1: new current UUID 89084B22FE454C03:3C1DADF6B38C1AD7:E7E50184F3F3AC0B:E7E40184F3F3AC0B 
> 
> please let me know if you need any further input from my side.

Only if it is easily reproducible, and if so, how.
Sorry, if you wrote that somewhere already, I missed it.
Just write it again.

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
__
please don't Cc me, but send to list   --   I'm subscribed



More information about the drbd-user mailing list