<div class="gmail_quote">On Tue, Apr 21, 2009 at 4:37 AM, Lars Ellenberg <span dir="ltr"><<a href="mailto:lars.ellenberg@linbit.com">lars.ellenberg@linbit.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div class="im">On Mon, Apr 20, 2009 at 07:05:31PM -0400, Gennadiy Nerubayev wrote:<br>
> On Mon, Apr 20, 2009 at 6:28 AM, Lars Ellenberg<br>
> <<a href="mailto:lars.ellenberg@linbit.com">lars.ellenberg@linbit.com</a>>wrote:<br>
><br>
> > On Mon, Apr 20, 2009 at 12:13:58PM +0200, Lars Ellenberg wrote:<br>
> > ><br></div>
possibly the target "announces" the equivalent of "tagged command<br>
queueing" in iSCSI, and the initiator tries to take advantage of that,<br>
but either target or initiator implement that incorrectly.<br>
not sure how to verify this assumption, maybe using wireshark on the<br>
iSCSI layer (which would also be a way to get to the actual data<br>
of the overlapping requests).</blockquote><div><br>I'll try to get some more info about this, but I'm currently completely out of ideas :( <br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
data divergence due to conflicting (overlapping)<br>
writes cannot happen when DRBD is not connected.<br>
so in this case DRBD does not care.</blockquote><div><br>Gah. But wait, you mentioned in the first email that "submitting a new write request overlapping with an in flight write
request is bad practice on any IO subsystem, as it may violate write
ordering, and the result in general is undefined". So why don't we care about it in the standalone mode? Why can't it happen when DRBD is disconnected? And if it can, why doesn't it cause data corruption? I'm still trying to understand why this is not causing issues for so many people that are running IET in blockio mode on standalone targets (including those built on IET such as openfiler), yet when DRBD is introduced, we run into this situation. <br>
<br>Sorry if it seems like I'm trying to single out DRBD as the culprit, but I can't quite grasp why this only appears to be a problem on DRBD (paranoia checking for the condition aside), and that the problem is big enough to discard writes.<br>
<br>Thanks,<br><br>-Gennadiy<br></div></div>