[DRBD-user] drbdsetup primary with timeout [was: To stonith or not to stonith?]

Lars Ellenberg Lars.Ellenberg at linbit.com
Fri Sep 9 00:06:31 CEST 2005

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


/ 2005-09-08 11:39:28 -0500
\ Dave Dykstra:
> [ Even though Alan Cc'ed his message to drbd-user I haven't seen it
>   show up there yet. ]

well, I have.

> > Regarding DRBD not taking over when it hasn't declared the other node 
> > dead, I would think that a good solution might be to have DRBD wait up 
> > to "drbd deadtime" seconds before giving up.
> > 
> > Since Heartbeat happily has no clue about DRBD (or its internal 
> > deadtime), it would seem to be best dealt with by DRBD.
> > 
> > -- 
> >     Alan Robertson <alanr at unix.sh>
> 
> That sounds like a reasonable solution to me.
> 
> In fact, I already see code to try a 'drbdadm primary' command 6
> times with a one second sleep between each try in the 'start' case of
> /etc/heartbeat/resource.d/drbddisk, and a comment saying that it is "in
> case heartbeat deadtime was smaller than drbd ping time".   This is a
> different situation than the comment, in that heartbeat forcibly knocked
> down the remote side and immediately took over, so the timeout probably
> starts ticking when the first 'drbdadm primary' command is executed.
> I'm using the default drbd "timeout" time of six seconds, so presumably
> on the 7th or 8th try it would work.
> 
> I think that doing multiple tries in the drbddisk command is a hack,
> though, especially since it doesn't take into account any change in
> the "timeout" parameter that there may be in drbd.conf. I think the
> 'drbdsetup primary' command (possibly with a new option that drbddisk
> invokes) should try to contact the remote side and wait until there is
> either a positive response or a timeout before it exits with an error.

what is there is a "hack".

it is a misconfiguration, when heartbeat deadtime was
smaller than drbd ping time, though.

still it could be desirable to have an option like that outlined above,
 "drbdsetup /dev/drbd0 primary --I-think-peer-is-dead", and this option
would typically be used by the heartbeat resource script/agent.

this will probably be implemented in 0.8 ...

-- 
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.



More information about the drbd-user mailing list