[DRBD-user] fence-peer

cesar brain at click.com.py
Sat Jun 29 06:41:20 CEST 2013

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi Kaloyan Kovachev

I would humbly ask for your help.
I will be very grateful if you can help me 

I have Proxmox VE for HA of VMs (based on KVM) + LVM2 + DRBD 8.4.2. (NIC to
NIC)
And have 1 VM in HA that uses 2 DRBD resources, ie for the VM  are 2 virtual
disks.

My problem is that i lost connection with DRBD:

version: 8.4.2 (api:1/proto:86-101)
GIT-hash: 7ad5f850d711223713d6dcadc3dd48860321070c build by root at kvm5,
2013-06-16 13:44:51
 0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown   r-----
    ns:12443007 nr:0 dw:46472892 dr:1719821020 al:3096 bm:1408 lo:1 pe:0
ua:0 ap:1 ep:1 wo:f oos:3319816
 1: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown   r-----
    ns:4764396 nr:0 dw:3791840 dr:1057033682 al:1164 bm:336 lo:0 pe:0 ua:0
ap:0 ep:1 wo:f oos:1783620

I think that may be for 2 causes:
Cause 1: The NIC is Realtek, as soon as I can i will change it for Intel, 
Cause 2: I use the directive net data-integrity-alg md5; (I would not want
remove it) that can have problems with my PC ASUS P8H77-M PRO , also I don't
use raid.

*And Mr. Lars Ellenberg tell me that:*
"With special purpose built fencing handlers,
we may be able to fix your setup so it will freeze IO during the
disconnected period, reconnect, and replay pending buffers,
without any reset."

But for now i like to know how apply this change?,
And since I am not an experienced in DRBD, I wish someone would show me how
it should be finally the
global_common.conf   or  *.res configuration files.

Note: My Nodes have keys for ssh comunication

This is my actual configuration with DRBD 8.4.2 Version:
global_common.conf file:

global {
        usage-count yes;
        # minor-count dialog-refresh disable-ip-verification
}

common {
        protocol C;

        handlers {
                pri-on-incon-degr
"/usr/lib/drbd/notify-pri-on-incon-degr.sh;
/usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ;
reboot -f";
                pri-lost-after-sb
"/usr/lib/drbd/notify-pri-lost-after-sb.sh;
/usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ;
reboot -f";
                local-io-error "/usr/lib/drbd/notify-io-error.sh;
/usr/lib/drbd/notify-emergency-shutdown.sh; echo o > /proc/sysrq-trigger ;
halt -f";
                # fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
                split-brain "/usr/lib/drbd/notify-split-brain.sh
some-user at my-domain.com";
                out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh
some-user at my-domain.com";
                # before-resync-target
"/usr/lib/drbd/snapshot-resync-target-lvm.sh -p 15 -- -c 16k";
                # after-resync-target
/usr/lib/drbd/unsnapshot-resync-target-lvm.sh;
        }

        startup {
                # wfc-timeout degr-wfc-timeout outdated-wfc-timeout
wait-after-sb
                wfc-timeout 30; degr-wfc-timeout 20; outdated-wfc-timeout
15;
        }

        options {
                # cpu-mask on-no-data-accessible
                cpu-mask 0;
        }

        disk {
                # size max-bio-bvecs on-io-error fencing disk-barrier
disk-flushes
                # disk-drain md-flushes resync-rate resync-after al-extents
                # c-plan-ahead c-delay-target c-fill-target c-max-rate
                # c-min-rate disk-timeout
                on-io-error detach; al-extents 3389; resync-rate 75M;
        }

        net {
                # protocol timeout max-epoch-size max-buffers
unplug-watermark
                # connect-int ping-int sndbuf-size rcvbuf-size ko-count
                # allow-two-primaries cram-hmac-alg shared-secret
after-sb-0pri
                # after-sb-1pri after-sb-2pri always-asbp rr-conflict
                # ping-timeout data-integrity-alg tcp-cork on-congestion
                # congestion-fill congestion-extents csums-alg verify-alg
                # use-rle
                sndbuf-size 0; no-tcp-cork; unplug-watermark 16; max-buffers
8000; max-epoch-size 8000;
                data-integrity-alg md5;
                verify-alg sha1;
        }
}

r0.res file:
resource r0 {

  protocol C;

  startup {
    #wfc-timeout  15;
    #degr-wfc-timeout 60;
    become-primary-on both;
  }

  net {
    #cram-hmac-alg sha1;
    #shared-secret "my-secret";
    allow-two-primaries;
    after-sb-0pri discard-zero-changes;
    after-sb-1pri discard-secondary;
    after-sb-2pri disconnect;
  }

  on kvm5 {
    device /dev/drbd0;
    disk /dev/sda3;
    address 10.2.2.50:7788;
    meta-disk internal;
  }

  on kvm6 {
    device /dev/drbd0;
    disk /dev/sda3;
    address 10.2.2.51:7788;
    meta-disk internal;
  }
}

r1.res file:
resource r1 {
  protocol C;
  startup {
    #wfc-timeout  15;
    #degr-wfc-timeout 60;
    become-primary-on both;
  }
  net {
    #cram-hmac-alg sha1;
    #shared-secret "my-secret";
    allow-two-primaries;
    after-sb-0pri discard-zero-changes;
    after-sb-1pri discard-secondary;
    after-sb-2pri disconnect;
  }
  on kvm5 {
    device /dev/drbd1;
    disk /dev/sdb3;
    address 10.2.2.50:7789;
    meta-disk internal;
  }
  on kvm6 {
    device /dev/drbd1;
    disk /dev/sdb3;
    address 10.2.2.51:7789;
    meta-disk internal;
  }
}

Best regards
Cesar 





--
View this message in context: http://drbd.10923.n7.nabble.com/fence-peer-tp3298p17982.html
Sent from the DRBD - User mailing list archive at Nabble.com.



More information about the drbd-user mailing list