<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.19120"></HEAD>
<BODY>
<DIV><SPAN class=087551922-15082011><FONT size=2
face=Arial>Hi,</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2
face=Arial></FONT></SPAN> </DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2 face=Arial>We have two servers
built, different hardware and capacity, on 64 bit centos 5.6 with drbd 8.3.8,
the backing devices are gfs2 devices layer on top of LVM, because different
capacity on both nodes and that limits our ability to play with the barriers
effectively. The server has two Gb ports, one connect to the network and the
other connected with a crossover cable for dedicated drbd sync. We are running a
number of tests to see how well they perform before putting into production.
During the weeks long testing, one thing doesn't seem right and we've tried to
play with the config to tweak it, but not gaining anything, so we go back to
default for most settings as shown in the drbd show below. </FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2 face=Arial>It is a duo primary
setup on a 10TB device. The sync/resync operations between nodes, without other
activities on the drbd devices, is 63MB (sustained) on both way. However, adding
the write to either node during the sync can cause the both write and sync speed
to drop to 2KB. The situation can happen on any new write operation as
well, initially the writing, first 10 seconds, are at the
expected speed, then drbd sync kicks in, can see it from the /proc/drbd, it
slows to almost stall, once the sync stopped, observed from from both ifstat and
/proc/drbd, the writing to the node resume to expected speed, but once the sync
start again the writing slows down. The pattern repeats until the entire
write operation is complete. We have tested with small size files, the effect is
minimum so no problems in there, but we intent to use these serves to store
large files, which can be few GByte each. Initially, we though
that high IO might be the culprit, then we took drbd out of the picture and
just runs simultaneous read and write tests and they were fine with large and
small files. Now we think that drbd might need to locked the file to
perform the sync, during locked time the continues write stream on
the file is not permitted, then once the sync is done drbd releases it
for writing again. Then we tried tweaking the buffer and bio, no help. Of
course, this just a guess, but if it is true, is that any ways to tweak
drbd to perform better with big files.</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2 face=Arial>We also tried
swapping a cross over with a straight cable no improvements,
</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2
face=Arial></FONT></SPAN> </DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2 face=Arial>Thanks in
advance,</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2
face=Arial>Dennis</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2
face=Arial>###################drbdsetup /dev/drbd0 show
##############</FONT></SPAN></DIV>
<DIV><SPAN class=087551922-15082011><FONT size=2 face=Arial>disk
{<BR>
size
0s _is_default; # bytes<BR>
on-io-error
detach;<BR>
fencing
dont-care _is_default;<BR>
max-bio-bvecs 0
_is_default;<BR>}<BR>net {<BR>
timeout
60 _is_default; # 1/10 seconds<BR>
max-epoch-size 2048
_is_default;<BR>
max-buffers
2048 _is_default;<BR>
unplug-watermark
256;<BR>
connect-int
10 _is_default; # seconds<BR>
ping-int
10 _is_default; # seconds<BR>
sndbuf-size
0 _is_default; # bytes<BR>
rcvbuf-size
0 _is_default; # bytes<BR>
ko-count
0 _is_default;<BR>
allow-two-primaries;<BR>
cram-hmac-alg
"sha1";<BR>
shared-secret "<A
href="mailto:U$eP@sswd">U$eP@sswd</A>";<BR>
after-sb-0pri
discard-zero-changes;<BR>
after-sb-1pri
discard-secondary;<BR>
after-sb-2pri
disconnect _is_default;<BR>
rr-conflict
disconnect _is_default;<BR>
ping-timeout 5
_is_default; # 1/10 seconds<BR>}<BR>syncer
{<BR>
rate
112640k; # bytes/second<BR>
after
-1 _is_default;<BR>
al-extents
257;<BR>
delay-probe-volume 16384k _is_default; #
bytes<BR>
delay-probe-interval 5 _is_default; # 1/10
seconds<BR>
throttle-threshold 20 _is_default; # 1/10
seconds<BR>
hold-off-threshold 100 _is_default; # 1/10
seconds<BR>}<BR>protocol C;<BR>_this_host
{<BR>
device
minor 0;<BR>
disk
"/dev/mapper/vg0-r0";<BR>
meta-disk
internal;<BR>
address
ipv4 10.1.1.35:7788;<BR>}<BR>_remote_host
{<BR>
address
ipv4
10.1.1.29:7788;<BR>}<BR>##############################</FONT></SPAN></DIV><P><FONT color=#0000ff size=2 face=Tahoma>Hemlock Printers, Ltd.<BR>(604) 439-5075</FONT></P><A title="gfidisc.c8ee20078354b54fb5fd4f35cb32bcdd" href="#"> </A></BODY></HTML>