Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hello,
I am experimenting drbd and not quite good in stability (un usable). I saw
this in dmesg log:
block drbd1: md_sync_timer expired! Worker calls drbd_md_sync
().
At fist restart it works for a while, and then all of sudden - cat
/proc/drbd show ProtocolError and system hang (mysql or any other process
read/write to the drbd partitions.
It is repeatable and when it happend network is not busy, machine load is
nearly 0 and all other network connectivity is normal.
Googling show me that many users has same problem and one suggested to
lower the rate of resync and sync, I did that (for 100Mbit ethernet I set
resync is 3M and in syncer rate 40M; I setup two volumes . Problem still.
Here is the short description of the system:
* Centos 6 x86_64
* Kernel 2.6.32.43-vs2.3.0.36.29.8-h1-32cpu-noselinux which is vanilar
kernel 2.6.32.43 with vserver patch vs2.3.0.36.29.8 - compile with HZ = 100
and SMP for 32 cpu
* DRBD compiled from source, version 8.4.0 (including kernel module)
* DRBD build on top of LVM here is the config
resource r0 {
on cosmos {
volume 0 {
#device minor 0;
device /dev/drbd0;
meta-disk internal;
disk /dev/vs-resource1/mysqldata;
}
volume 1 {
device /dev/drbd1;
meta-disk internal;
disk /dev/vs-resource1/pgsqldata;
}
address 10.200.11.4:7789;
}
on seaspray {
volume 0 {
# device minor 0;
device /dev/drbd0;
meta-disk internal;
disk /dev/vg_seaspray/mysqldata;
}
volume 1 {
device /dev/drbd1;
meta-disk internal;
disk /dev/vg_seaspray/pgsqldata;
}
address 10.200.11.3:7789;
}
startup {
#become-primary-on both;
}
net {
#allow-two-primaries;
protocol C;
after-sb-0pri discard-zero-changes;
after-sb-1pri discard-secondary;
after-sb-2pri disconnect;
#cram-hmac-alg sha1;
#shared-secret "FooFunFactory";
}
}
* DRBD runs in Primary/Secondary mode for now. The device is mounted into a
vserver instance and mysql and postgres is running from the vserver
* IPtables is setup to allow DRBD trafic - it happened even iptables is off
* Network route
route
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use
Iface
10.200.11.0 * 255.255.255.224 U 0 0 0 eth0
10.200.11.128 * 255.255.255.192 U 0 0 0
eth1.503
192.168.100.0 * 255.255.255.0 U 0 0 0
dummy0
1.1.1.0 * 255.255.255.0 U 0 0 0
vmbr0
link-local * 255.255.0.0 U 1002 0 0 eth0
link-local * 255.255.0.0 U 1003 0 0 eth1
link-local * 255.255.0.0 U 1004 0 0
eth1.503
default 10.200.11.1 0.0.0.0 UG 0 0 0 eth0
I attach the dmesg here as well if it helps to debug. I would like to have
it fixed so please help.
Many thanks,
--
Steve Kieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20111116/c649e398/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dmesg-drbd-error.gz
Type: application/x-gzip
Size: 22562 bytes
Desc: not available
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20111116/c649e398/attachment.bin>