[DRBD-user] disconnecting hangs after ko-count failure

Lars Ellenberg lars.ellenberg at linbit.com
Tue Jan 22 17:42:28 CET 2008

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Mon, Jan 21, 2008 at 11:15:39AM +0100, Walter Haidinger wrote:
> > > How can I further diagnose this? Do you need more information, like the
> > > drbd.conf setup?
> > 
> > Well how did you guess that? ;-)
> 
> Don't know, just a wild guess.. ;-)
> 
> drbd.conf:
> global {
>     minor-count 16;
>     usage-count yes;
> }
> common {
>   syncer { rate 10M; }
> }
> resource west {
>   protocol C;
>   handlers {
>     pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
>     pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
>     local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
>     outdate-peer "/usr/sbin/drbd-peer-outdater";
>   }
>   startup {
>     wfc-timeout 5;
>     degr-wfc-timeout 5;
>   }
>   disk {
>     on-io-error   detach;
>     fencing dont-care;
>   }
>   net {
>     timeout       100;    # < (connect-int|ping-int), (unit = 0.1 seconds)
>     connect-int   30;    # (unit = 1 second)
>     ping-int      60;    # (unit = 1 second)
>     ping-timeout  50;    # (unit = 0.1 seconds)

interessting timout settings, but drbd over openvpn over 11MBit WLAN
is strange enough in itself, so if it works for you...

>     max-buffers         256;

I seriously recommend a much higher setting here.  do at least 2000.
high latency "flaky" network with low max-buffers setting...
may even be the root cause of the symptoms you are seeing.


-- 
: Lars Ellenberg                           http://www.linbit.com :
: DRBD/HA support and consulting             sales at linbit.com :
: LINBIT Information Technologies GmbH      Tel +43-1-8178292-0  :
: Vivenotgasse 48, A-1120 Vienna/Europe     Fax +43-1-8178292-82 :
__
please use the "List-Reply" function of your email client.



More information about the drbd-user mailing list