[Fwd: [DRBD-user] heartbeat and drbd / Failover / Failback]

Rois Cannon rois at cobiz.com
Tue Dec 4 01:55:03 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.

I'm sure I'm just missing something.  Probably a timing thing.  I added
the lines to drbd.conf and ha.cf per the instructions on your 
blog (see below for for full file.)  Brought up the system and made sure
it was correctly primary on node1 and secondary on node2.  On node1, if
I do a "halt" on the machine or restart heartbeat it correctly brings up
node2 as primary.  If I pull the plug on node1, then node2 is being set
to outdated so heartbeat can't bring it up.  Can you tell me what I'm
missing?  Just FYI (in case it makes a difference) I'm running this in 2
VMServer's as a test bed.

The sample drbd.conf shows:
#  "resource-only" -> Call the outdate-peer handler if we primary and
#                     loose the connection to the secondary. As well
#                     whenn a unconnected secondary wants to become
#                     primary.

If I power off node1, how is node2 getting the command to outdate drbd?


I have 2 nodes with 2 nics and a serial.  Working on Primary/Standby

    /----- eth0 ------\
node1 --- Serial ---- node2
    \----- eth1 ------/

node1 eth0
node1 eth1

node2 eth0
node2 eth1

VirtualIP on eth0
I'm using mandriva 2008.0 with  drbd 8.0.6 and heartbeat 2.0.8.

Here is my ha.cf:
auto_failback off
logfacility     local0
debugfile /var/log/ha-debug
keepalive 2
deadtime 10
deadping 6
initdead 30
baud 460800
serial /dev/ttyS0
ucast eth0
ucast eth1
node svr91 svr92
respawn hacluster /usr/lib/heartbeat/ipfail
respawn hacluster /usr/lib/heartbeat/dopd
apiauth dopd gid=haclient uid=hacluster

Here is my drbd.conf:
global {
    usage-count no;
common {
  handlers {
    pri-on-incon-degr "echo o > /proc/sysrq-trigger ; /usr/bin/halt -p";
    pri-lost-after-sb "echo o > /proc/sysrq-trigger ; /usr/bin/halt -p";
    local-io-error "echo o > /proc/sysrq-trigger ; /usr/bin/halt -p";
    outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater";
  startup {
    degr-wfc-timeout 120;    # 2 minutes.
  disk {
    on-io-error   detach;
    fencing resource-only;
  net {
    cram-hmac-alg "[..removed..]";
    shared-secret "[..removed..]";
    after-sb-0pri disconnect;
    after-sb-1pri disconnect;
    after-sb-2pri disconnect;
    rr-conflict disconnect;
  syncer {
    rate 10M;
    al-extents 257;
resource home {
  protocol C;
  on svr91 {
    device     /dev/drbd0;
    disk       /dev/vg0/home;
    meta-disk  internal;
  on svr92 {
    device     /dev/drbd0;
    disk       /dev/vg0/home;
    meta-disk  internal;

svr91 IPaddr:: drbddisk::home

More information about the drbd-user mailing list