<div dir="ltr">Hi,<div><br></div><div>I'm using 8.4.4 on CentOS 6.5<br></div><div>I've created a drbd partition on two nodes (see pg.res below).</div><div>The machines are actually VMs running on Xen hosts.</div><div>
The drbd partition is 800Go large.<br></div><div>(There isn't a dedicated channel channel for drbd but I'll have that soon)</div><div><br></div><div>Suppose I make VMNode1 primary and mount the drbd drive (ext4) I can consistently crash the Xen host running VMNode1 by copying a large file, call it F (~200Go) to VMNode1.</div>
<div>What happens is that Dom0 runs out of memory and starts killing processes eventually making the VMNode1 inaccessible.</div><div><br></div><div>If I take the secondary node offline, I can copy F to node1 without any issues then bring up node2 and it will synch.</div>
<div><br></div><div>I've tried copying huge files between the VMNode1 and VMNode2 to standard partitions (not drbd managed) and that worked as expected.</div><div><br></div><div>I'm wondering if anyone's ever run across a similar issue and if there are any recommendations for or against running DRBD in VMs running on Xen.</div>
<div><br></div><div>Thanks,</div><div>Raymond</div><div><br></div><div>### pg.res</div><div><div>resource pg {</div><div> protocol C;</div><div> startup {</div><div> wfc-timeout 15;</div>
<div> degr-wfc-timeout 60;</div><div> }</div><div> net {</div><div> cram-hmac-alg sha1;</div><div> shared-secret "mybigsecret";</div><div> after-sb-0pri discard-least-changes;</div>
<div> after-sb-1pri discard-secondary;</div><div> after-sb-2pri call-pri-lost-after-sb;</div><div> }</div><div> syncer {</div><div> csums-alg sha1;</div><div> }</div>
<div> disk {</div><div> fencing resource-only;</div><div> }</div><div> handlers {</div><div> fence-peer "/usr/lib/drbd/crm-fence-peer.sh";</div><div>
after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh";</div><div> }</div><div> on hapostgresdev01 {</div><div> address <a href="http://192.168.0.10:7788">192.168.0.10:7788</a>;</div>
<div> device /dev/drbd0;</div><div> disk /dev/xvdg;</div><div> meta-disk internal;</div><div> }</div><div><br></div><div> on hapostgresdev02 {</div>
<div> address <a href="http://192.168.0.11:7788">192.168.0.11:7788</a>;</div><div> device /dev/drbd0;</div><div> disk /dev/xvdg;</div><div> meta-disk internal;</div>
<div> }</div><div><br></div><div>}</div></div><div><br></div></div>