Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi guys, We've got a bit of a problem at a customer site and I was wondering if anybody had any suggestions. With drbd up and running on both the primary and backup, massive amounts of data were copied over to the relevant mount points on the primary. Apparently this slowed the machine down so much (haven't yet been able to get from them if it was the CPU usage or memory) that users were getting kicked off. When drbd was taken down on the backup, then everything was o.k. They started with drbd version 0.7.13. I perused the archives of this mailing list and found something which suggested that this was a problem fixed after 0.7.13, so they upgraded to 0.7.19 and are still having the problem. Parameters we've thought might be appropriate in the drbd.conf file are protocol (using protocol A: I'm sure this is fine), sndbuf-size (warnings using large values like 1M), max-buffers (this looks promising to me), max-epoch-size, and maybe rate. I'm a bit nervous about changing anything, so does anybody have some good ideas? Appropriate environmental information such as proc/drbd and system info is below: Thanks, Tim ----------------------------------------------------------------- Output from /proc/drbd smb-mtl01:~ # cat /proc/drbd version: 0.7.19 (api:78/proto:74) SVN Revision: 2212 build by root at smb-mtl01, 2006-06-13 10:56:44 0: cs:Connected st:Primary/Secondary ld:Consistent ns:0 nr:0 dw:2112 dr:209304 al:522 bm:0 lo:0 pe:0 ua:0 ap:0 1: cs:Connected st:Primary/Secondary ld:Consistent ns:32 nr:0 dw:352 dr:86220 al:3 bm:2 lo:0 pe:0 ua:0 ap:0 2: cs:Connected st:Primary/Secondary ld:Consistent ns:32 nr:0 dw:1632 dr:123328 al:393 bm:3 lo:0 pe:0 ua:0 ap:0 smb-sjo01:~ # cat /proc/drbd version: 0.7.19 (api:78/proto:74) SVN Revision: 2212 build by root at smb-sjo01, 2006-06-12 14:55:54 0: cs:Connected st:Secondary/Primary ld:Consistent ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 1: cs:Connected st:Secondary/Primary ld:Consistent ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 2: cs:Connected st:Secondary/Primary ld:Consistent ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ------------------------------------------------------------------ Machine 1: (primary, I believe) Link speed: 10 MBps (T1 line) 1 NIC 100 MB/s. SuSe 9 Hardware Configuration: Manufactured by IBM (iSeries) running on top of AS/400 810. Has 1 CPU at 1GHz. Memory is 1024 MB. 200 GB disk Machine 2: Also SuSE 9. Same IBM running on iSeries. 1 CPU at 1 GHz. 512 MB memory. 150 GB disk space. 1 NIC at 10 MB/s Replicating about 100 GB of data. Tim Johnson Senior Software Engineer Vision Solutions, Inc. 17911 Von Karman Ave, 5th Floor Irvine, CA 92614 UNITED STATES Tel: +1 (949) 253-6528 Fax: +1 (949) 225-0287 Email: tjohnson at visionsolutions.com <http://www.visionsolutions.com/> Disclaimer - 6/21/2006 The contents of this e-mail (and any attachments) are confidential, may be privileged, and may contain copyright material of Vision Solutions, Inc. or third parties. You may only reproduce or distribute the material if you are expressly authorized by Vision Solutions to do so. If you are not the intended recipient, any use, disclosure or copying of this e-mail (and any attachments) is unauthorized. If you have received this e-mail in error, please immediately delete it and any copies of it from your system and notify us via e-mail at helpdesk at visionsolutions.com