Hello,<br><br>I am experimenting drbd and not quite good in stability (un usable). I saw this in dmesg log:<br><br>block drbd1: md_sync_timer expired! Worker calls drbd_md_sync<br>().<br clear="all"><br>At fist restart it works for a while, and then all of sudden - cat /proc/drbd show ProtocolError and system hang (mysql or any other process read/write to the drbd partitions.<br>
<br>It is repeatable and when it happend network is not busy, machine load is nearly 0 and all other network connectivity is normal.<br><br>Googling show me that many users has same problem and one suggested to lower the rate of resync and sync, I did that (for 100Mbit ethernet I set resync is 3M and in syncer rate 40M; I setup two volumes . Problem still.<br>
<br><br>Here is the short description of the system:<br><br>* Centos 6 x86_64<br>* Kernel 2.6.32.43-vs2.3.0.36.29.8-h1-32cpu-noselinux which is vanilar kernel 2.6.32.43 with vserver patch vs2.3.0.36.29.8 - compile with HZ = 100 and SMP for 32 cpu<br>
* DRBD compiled from source, version 8.4.0 (including kernel module)<br>* DRBD build on top of LVM here is the config<br><br>resource r0 {<br><br> on cosmos {<br> volume 0 {<br> #device minor 0;<br>
device /dev/drbd0;<br> meta-disk internal;<br> disk /dev/vs-resource1/mysqldata;<br> }<br><br> volume 1 {<br> device /dev/drbd1;<br>
meta-disk internal;<br> disk /dev/vs-resource1/pgsqldata;<br> }<br><br> address <a href="http://10.200.11.4:7789">10.200.11.4:7789</a>;<br> }<br>
<br> on seaspray {<br> volume 0 {<br> # device minor 0;<br> device /dev/drbd0;<br> meta-disk internal;<br> disk /dev/vg_seaspray/mysqldata;<br>
}<br><br> volume 1 {<br> device /dev/drbd1;<br> meta-disk internal;<br> disk /dev/vg_seaspray/pgsqldata;<br> }<br>
<br> address <a href="http://10.200.11.3:7789">10.200.11.3:7789</a>;<br> }<br><br> startup {<br> #become-primary-on both;<br><br> }<br> net {<br> #allow-two-primaries;<br>
protocol C;<br> after-sb-0pri discard-zero-changes;<br> after-sb-1pri discard-secondary;<br> after-sb-2pri disconnect;<br> #cram-hmac-alg sha1;<br>
#shared-secret "FooFunFactory";<br><br> }<br><br><br>}<br><br>* DRBD runs in Primary/Secondary mode for now. The device is mounted into a vserver instance and mysql and postgres is running from the vserver<br>
* IPtables is setup to allow DRBD trafic - it happened even iptables is off<br><br>* Network route <br>route<br>Kernel IP routing table<br>Destination Gateway Genmask Flags Metric Ref Use Iface<br>10.200.11.0 * 255.255.255.224 U 0 0 0 eth0<br>
10.200.11.128 * 255.255.255.192 U 0 0 0 eth1.503<br>192.168.100.0 * 255.255.255.0 U 0 0 0 dummy0<br>1.1.1.0 * 255.255.255.0 U 0 0 0 vmbr0<br>
link-local * 255.255.0.0 U 1002 0 0 eth0<br>link-local * 255.255.0.0 U 1003 0 0 eth1<br>link-local * 255.255.0.0 U 1004 0 0 eth1.503<br>
default 10.200.11.1 0.0.0.0 UG 0 0 0 eth0<br><br>I attach the dmesg here as well if it helps to debug. I would like to have it fixed so please help.<br><br>Many thanks,<br><br><br><br><br>
-- <br>Steve Kieu<br>