<div dir="ltr">Hi,<br><br>With the async congestion mode, local disk I/O performance is too slow than sync replication mode.<br><br><br>1.version<div>Â  - Â V9.0.1-1, GIT-hash: f57acfc22d29a95697e683fb6bbacd9a1ad4348e <br>Â  - VM: CentOS 7</div><div><br>2. conf</div><div>Â  Â  protocol A;<br>Â  Â  sndbuf-size 256K;<br>Â  Â  on-congestion pull-ahead;<br>Â  Â  congestion-fill 128K;<div><br></div><div>3. test</div>Â  Â  [root@drbd9-01 drbd.d]# time cp /3GB.dataÂ  /mnt<div><br></div><div>4. result</div><div><br></div><div>1) Async(congestion mode)<br></div><div><br></div><div>(1) test1</div><div>realÂ Â Â  2m50.570s<br>userÂ Â Â  0m0.015s<br>sysÂ Â Â  0m10.086s<div><br>(2) test2<br>realÂ Â Â  2m24.809s<br>userÂ Â Â  0m0.022s<br>sysÂ Â Â  0m10.415s<br></div><div><br><br>2) Sync</div><div><br></div><div>(1) test1<br>realÂ Â Â  0m46.559s<br>userÂ Â Â  0m0.026s<br>sysÂ Â Â  0m9.863s<br><br>(2) test2<br>realÂ Â Â  0m58.031s<br>userÂ Â Â  0m0.035s<br>sys Â  Â 0m10.451s<br><br></div><div><br></div><div>I think there seems to be a problem in the following areas:<br>Â - Before congestion, completion for local disk I/O is treated at complete_master_bio function in drbd_sender thread.<br>Â - But even if the congestion occured, I think, it may be treated at the same position.<br>Â - In other words although local disk write it is already finished, the copy application is not receiving this completion signal and pending.</div><div>Â - This applicationÂ waits for this completionÂ until got_BarrierAck receives just requested-block from the peer.Â </div><div>Â - I think the local I/O completion should be done as soon as detecting congestion without waiting peer ack.</div><div><br></div><div>Is there any my misunderstand about drbd congestion mechanism?</div><div><br></div><div>Thanks.</div><div><div><div dir="ltr"><div dir="ltr"><p style="margin:0cm 0cm 0pt"><br></p></div></div></div>

</div></div></div></div>