<div dir="ltr">Many Thanks for valuable suggestions and sharing your experiences. <br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, 1 Aug 2022 at 22:26, GM &lt;<a href="mailto:gianni.milo22@gmail.com">gianni.milo22@gmail.com</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><p style="margin-bottom:0cm;line-height:100%;background:transparent none repeat scroll 0% 0%"><span style="background-color:transparent">But Why do we need to

preserve the metadata via snapshot at the first place as it is

believed that once you rollback the using the snapshot drbd would get

confused and would attempt resynchronisation of the entire device

again any way</span></p></div></blockquote><div>Â </div><div>Consider the following scenario, Two nodes A and B. A is the Primary and B is the Secondary. You create a zfs snapshot (both data and drbd metadata) at 08:00 am on both nodes. At 08:30 am you realise that a serious corruption has taken place and you urgently need to rollback *both* nodes from the snapshot created at 08:00 am. You execute a zfs rollback on both nodes while the drbd resource is down of course. Before bringing the drbd resource up on both nodes, you must decide which way the replication must take place (e.g A -&gt; B or B -&gt; A). Once you decide, bring the resource up. If all goes well, drbd should bring up the resource on both nodes *without* needing to do a full sync but rather just a small increment instead, as the metadata is consistent on both nodes (as it was at the time the snapshot was taken). So it&#39;s important to snapshot the drbd metadata on both nodes, if you want to prevent a full sync.</div><div>Â </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">

<p style="margin-bottom:0cm;line-height:100%;background:transparent none repeat scroll 0% 0%">3) Do i need to

suspend-io  first before taking the snapshot and then check up to

date Status mandatorily ?</p></div></blockquote><div><br>Ideally yes but that depends if the layers above drbd supports that functionality. For example, I&#39;m using qemu VMs on top of drbd/zvol. QEMU can suspend i/o before issuing a qemu based snapshot (via guest tools)Â  which then it will propagateÂ at the layers below (e.g drbd -&gt; zfs). If the layers above drbd cannot handle this, and you could simply take a snapshot at the layer below drbd (zfs in this case), then that would have the same effect as when removing the power from the physical machine (e.g the data would still be consistent due to zfs transaction based nature, but you may or may have have not lost the last few writes issued by the layers above).</div><div>Â </div></div></div>

_______________________________________________<br>

Star us on GITHUB: <a href="https://github.com/LINBIT" rel="noreferrer" target="_blank">https://github.com/LINBIT</a><br>

drbd-user mailing list<br>

<a href="mailto:drbd-user@lists.linbit.com" target="_blank">drbd-user@lists.linbit.com</a><br>

<a href="https://lists.linbit.com/mailman/listinfo/drbd-user" rel="noreferrer" target="_blank">https://lists.linbit.com/mailman/listinfo/drbd-user</a><br>

</blockquote></div>