Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
On Tuesday 21 October 2008 21:33:21 Robinson, Eric wrote: > Can someone help please me diagnose a DRBD slow sync problem? > > The servers have been syncing for 3-4 days and are now only 12.5% > sync'd. > > I'm getting conflicting results from benchmark tests. > > Summary: > > netperf is fast > iperf is fast > bonnie++ is fast > dd|nc is extremely slow > drbd sync is extremely slow > > Details: > > > Here's iperf... very fast... > > > [root at ha03 download]# iperf -t TCP_STREAM -c 2.1.1.2 > ------------------------------------------------------------ > Client connecting to 2.1.1.2, TCP port 5001 > TCP window size: 16.0 KByte (default) > ------------------------------------------------------------ > [ 3] local 2.1.1.1 port 59472 connected with 2.1.1.2 port 5001 > [ ID] Interval Transfer Bandwidth > [ 3] 0.0-127.4 sec 13.9 GBytes 939 Mbits/sec > > > > Now netperf... very fast... > > > > [root at ha03 download]# netperf -H 2.1.1.2 > TCP STREAM TEST from 0.0.0.0 (0.0.0.0) port 0 AF_INET to 2.1.1.2 > (2.1.1.2) port 0 AF_INET > Recv Send Send > Socket Socket Message Elapsed > Size Size Size Time Throughput > bytes bytes bytes secs. 10^6bits/sec > 87380 16384 16384 10.03 938.80 > > > > > bonnie++ on one server... fast... > > Version 1.03 ------Sequential Output------ --Sequential Input- > --Random- > -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- > --Seeks-- > Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP > /sec %CP > ha03.mycharts.md 6G 68212 90 187783 31 91553 17 73379 90 271470 28 > 1099 1 > ------Sequential Create------ --------Random > Create-------- > -Create-- --Read--- -Delete-- -Create-- --Read--- > -Delete-- > files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP > /sec %CP > 16 +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++ > +++++ +++ > ha03.mycharts.md,6G,68212,90,187783,31,91553,17,73379,90,271470,28,1098. > 6,1,16,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++ > ++,+++ > > > bonnie++ on the other server... fast... > > Version 1.03 ------Sequential Output------ --Sequential Input- > --Random- > -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- > --Seeks-- > Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP > /sec %CP > ha04.mycharts.md 6G 69619 91 191459 31 83074 16 72740 89 265057 27 > 964.9 1 > ------Sequential Create------ --------Random > Create-------- > -Create-- --Read--- -Delete-- -Create-- --Read--- > -Delete-- > files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP > /sec %CP > 16 +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++ > +++++ +++ > ha04.mycharts.md,6G,69619,91,191459,31,83074,16,72740,89,265057,27,964.9 > ,1,16,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,++++ > > > > Now the dd command... very slow... > > > [root at ha03 download]# dd if=/dev/zero bs=1M count=100 | nc 2.1.1.2 10000 > 2+0 records in > 1+0 records out > 1048576 bytes (1.0 MB) copied, 12.7668 seconds, 82.1 kB/s Maybe I'm missing something, but why does dd report only 2 blocks in and 1 out when there should be 100? peter. > > > drbd stats... very slow... > > [root at ha04 download]# cat /proc/drbd > version: 8.0.13 (api:86/proto:86) > GIT-hash: ee3ad77563d2e87171a3da17cc002ddfd1677dbe build by > buildsvn at c5-i386-build, 2008-08-07 13:42:56 > 0: cs:Connected st:Secondary/Primary ds:UpToDate/UpToDate C r--- > ns:0 nr:196 dw:196 dr:0 al:0 bm:1 lo:0 pe:0 ua:0 ap:0 > resync: used:0/61 hits:9 misses:1 starving:0 dirty:0 changed:1 > act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0 > 1: cs:SyncTarget st:Secondary/Primary ds:Inconsistent/UpToDate C r--- > ns:0 nr:89232960 dw:89232960 dr:0 al:0 bm:5446 lo:0 pe:0 ua:0 ap:0 > [=>..................] sync'ed: 13.8% (547719/634860)M > finish: 389:29:20 speed: 320 (316) K/sec > resync: used:0/61 hits:5571613 misses:5447 starving:0 dirty:0 > changed:5447 > act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0 > > > drbd.conf... > > > [root at ha04 download]# cat /etc/drbd.conf > # drbd.conf > > global { > usage-count no; > } > > > common { > syncer { > rate 33M; > al-extents 257; > } > } > > resource mysql_ha { > protocol C; > handlers { > pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f"; > pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f"; > local-io-error "echo o > /proc/sysrq-trigger ; halt -f"; > outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5"; > #pri-lost "echo pri-lost. Have a look at the log files. | mail -s > 'DRBD Alert' root"; > # split-brain "echo split-brain. drbdadm -- --discard-my-data > connect $DRBD_RESOURCE ? | mail -s 'DRBD Alert' admin at pmcipa.com"; > #out-of-sync "echo out-of-sync. drbdadm down $DRBD_RESOURCE. drbdadm > > ::::0 set-gi $DRBD_RESOURCE. drbdadm up $DRBD_RESOURCE. | mail -s 'DRBD > > Alert' root"; > } > startup { > wfc-timeout 0; # infinite > degr-wfc-timeout 120; # 2 minutes. > } > disk { > on-io-error detach; > } > net { > cram-hmac-alg "sha1"; > shared-secret "NerfBurger23!"; > after-sb-0pri disconnect; > after-sb-1pri disconnect; > after-sb-2pri disconnect; > rr-conflict disconnect; > # data-integrity-alg "md5"; > } > on ha03.mycharts.md { > device /dev/drbd0; > disk /dev/vg01/lv01; > address 2.1.1.1:7788; > meta-disk internal; > } > on ha04.mycharts.md { > device /dev/drbd0; > disk /dev/vg01/lv01; > address 2.1.1.2:7788; > meta-disk internal; > } > } > > resource ftp_ha { > protocol C; > handlers { > pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f"; > pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f"; > local-io-error "echo o > /proc/sysrq-trigger ; halt -f"; > outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5"; > #pri-lost "echo pri-lost. Have a look at the log files. | mail -s > 'DRBD Alert' root"; > # split-brain "echo split-brain. drbdadm -- --discard-my-data > connect $DRBD_RESOURCE ? | mail -s 'DRBD Alert' admin at pmcipa.com"; > #out-of-sync "echo out-of-sync. drbdadm down $DRBD_RESOURCE. drbdadm > > ::::0 set-gi $DRBD_RESOURCE. drbdadm up $DRBD_RESOURCE. | mail -s 'DRBD > > Alert' root"; > } > startup { > wfc-timeout 0; # infinite > degr-wfc-timeout 120; # 2 minutes. > } > disk { > on-io-error detach; > } > net { > cram-hmac-alg "sha1"; > shared-secret "NerfBurger23!"; > after-sb-0pri disconnect; > after-sb-1pri disconnect; > after-sb-2pri disconnect; > rr-conflict disconnect; > # data-integrity-alg "md5"; > } > on ha03.mycharts.md { > device /dev/drbd1; > disk /dev/vg01/lv02; > address 2.1.1.1:7789; > meta-disk internal; > } > on ha04.mycharts.md { > device /dev/drbd1; > disk /dev/vg01/lv02; > address 2.1.1.2:7789; > meta-disk internal; > } > } > > > > > > -- > Eric Robinson > > > > > Disclaimer - October 21, 2008 > This email and any files transmitted with it are confidential and intended > solely for drbd-user at lists.linbit.com. If you are not the named addressee > you should not disseminate, distribute, copy or alter this email. Any views > or opinions presented in this email are solely those of the author and > might not represent those of . Warning: Although has taken reasonable > precautions to ensure no viruses are present in this email, the company > cannot accept responsibility for any loss or damage arising from the use of > this email or attachments. This disclaimer was added by Policy Patrol: > http://www.policypatrol.com/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part. URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20081021/2eb62e8d/attachment.pgp>