<div dir="ltr">Thanks Lars for the detailed answer and explanation.... Â  and thanks Trevor</div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Oct 15, 2015 at 4:34 PM, Lars Ellenberg <span dir="ltr">&lt;<a href="mailto:lars.ellenberg@linbit.com" target="_blank">lars.ellenberg@linbit.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Thu, Oct 15, 2015 at 03:02:18PM +0300, Ammar Sheikh Saleh wrote:<br>

&gt; &gt; &quot;JBD: Spotted dirty metadata buffer&quot; turns up several bugzillas from<br>

&gt; &gt; 2010 and 2012, suggesting kernel upgrade would help.<br>

&gt;<br>

</span><span class="">&gt; When ==&gt;Â  During normal operation<br>

&gt;<br>

&gt; What&#39;s the IO stack below DRBDÂ  ==&gt;Â  4 Disks in JBOD<br>

&gt;<br>

&gt; Kernel upgrade is not possible nowÂ  ... but at the same time its not too<br>

&gt; oldÂ  ( v.2.6.32-431.20.3)<br>

<br>

</span>What the message is complaining about is that JBD (the journaling<br>

component of ext*) detects a buffer that is &quot;dirty&quot; (under write-out),<br>

when it did not expect it to be.<br>

<br>

This all happens before any IO request related to that buffer even<br>

reaches the block layer or DRBD.<br>

<br>

Maybe you have some &quot;live dump&quot; going on or do some other &quot;unexpected&quot;<br>

(by JBD) things. Or your kernel simply still has some bug/race in the<br>

related IO paths.<br>

<br>

In any case, this has nothing to do with DRBD, as the error message<br>

complains about something happening in the buffer/page cache,<br>

which is *above* DRBD.<br>

<br>

So whatever happens there, would happen just the same, even without DRBD.<br>

(Though, of course, DRBD changes timings and memory pressure, so it may<br>

make it more or less likely to trigger this).<br>

<span class=""><br>

&gt; &gt; On Thu, Oct 15, 2015 at 12:32:11PM +0300, Ammar Sheikh Saleh wrote:<br>

&gt; &gt; &gt; Hi all,<br>

&gt; &gt; &gt;<br>

&gt; &gt; &gt; I have one cluster with 2 nodes in active-active - master/slave setup<br>

&gt; &gt; &gt;<br>

&gt; &gt; &gt; I see the following error messages in the system logs:<br>

&gt; &gt;<br>

&gt; &gt; *when*.<br>

&gt; &gt;<br>

&gt; &gt; After a hard crash?<br>

&gt; &gt; After a simulated Crash?<br>

&gt; &gt; After a switchover?<br>

&gt; &gt; After a failover?<br>

&gt; &gt; During normal operation?<br>

&gt; &gt;<br>

&gt; &gt; What&#39;s the IO stack below DRBD?<br>

&gt; &gt;<br>

</span>&gt; &gt; &gt; JanÂ  5 22:02:18 lws1h1 kernel: EXT4-fs error (device drbd1): ext4_mb_generate_buddy: EXT4-fs: group 3795: 32532 blocks in bitmap, 29780 in gd<br>

<div class="HOEnZb"><div class="h5">&gt; &gt; &gt; JanÂ  5 22:02:19 lws1h1 kernel: JBD: Spotted dirty metadata buffer (dev = drbd1, blocknr = 0). There&#39;s a risk of filesystem corruption in case of system crash.<br>

&gt; &gt; &gt;<br>

&gt; &gt; &gt; What is the appropriate action to fix this issue<br>

&gt; &gt;<br>

&gt; &gt; Did you even google for the message?<br>

&gt; &gt; &quot;JBD: Spotted dirty metadata buffer&quot; turns up several bugzillas from<br>

&gt; &gt; 2010 and 2012, suggesting kernel upgrade would help.<br>

<br>

<br>

--<br>

: Lars Ellenberg<br>

: <a href="http://www.LINBIT.com" rel="noreferrer" target="_blank">http://www.LINBIT.com</a> | Your Way to High Availability<br>

: DRBD, Linux-HAÂ  andÂ  Pacemaker support and consulting<br>

<br>

DRBDÂ® and LINBITÂ® are registered trademarks of LINBIT, Austria.<br>

__<br>

please don&#39;t Cc me, but send to listÂ  Â --Â  Â I&#39;m subscribed<br>

_______________________________________________<br>

drbd-user mailing list<br>

<a href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br>

<a href="http://lists.linbit.com/mailman/listinfo/drbd-user" rel="noreferrer" target="_blank">http://lists.linbit.com/mailman/listinfo/drbd-user</a><br>

</div></div></blockquote></div><br></div>