[DRBD-user] BUG alert: protocol A and B broken

Wed Oct 18 15:39:36 CEST 2006

due to the serious problems that may result from this bug,
this mail is sent to drbd-user as well as drbd-announce.
sorry for duplicates.

while rewriting the request handling code in drbd 8 we noticed that
drbd network protocols B and especially A have been broken ever since:
uppon connection loss, they could "forget" to set certain areas
as "out of sync".
if there is no later write to those areas,
they will never be in sync again!

	you have diverging data sets,
	and you will likely see
	file system corruption
	once you do a failover.

  if you are using

     *  protocol A,
	it is very likely that you are affected.

      *	protocol B,
	it is unlikely that you are affected.

	you could be affected, if there has been
	a crash of the Secondary directly
	after connection loss, after the RecvAck has been sent out
	(and been correctly received by the Primary), but before the
	data was actually written on the Secondary disk.

      *	protocol C,
	you are not affected.

if you have been using protocol A or B,
we recommend this workaround for now:

use protocol C, and start a full sync.
 * edit drbd.conf to have "protocol C;" everywhere.
 * do "drbdadm disconnect all" on both nodes
 * do "drbdadm connect all" on both nodes
 * for all drbd devices,
   make sure the "last known good" (current Primary) is ok,
   if neccessary do a fsck on it.
 * for all resources in Secondary state,
   do "drbdadm invalidate <resourcename>"

we believe that the problem has been fixed already in svn,
but we want to be very sure about the fix,
and about some other pending things.
thus we will delay the release of drbd 0.7.22,
which will contain those fixes, for some more days.

again:
	if you do use protocol C, you are NOT affected.

thanks,

-- 
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.