[DRBD-user] Digest mismatch resulting in "split brain" after (!) automatic reconnect

Lars Ellenberg lars.ellenberg at linbit.com
Wed Feb 16 16:49:53 CET 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Wed, Feb 16, 2011 at 03:49:34PM +0100, Raoul Bhatia [IPAX] wrote:
> hi,
> 
> debian lenny,
> pacemaker 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b,
> drbd 8.3.10 5c0b0469666682443d4785d90a2c603378f9017b,
> ocf ra 1.3 shipped with (self-compiled drbd debian package)
> kernel 2.6.27.57+ipax
> 
> 
> every couple of hours, i encounter a digest mismatch:
> > Digest mismatch, buffer modified by upper layers during write: 0s +4096
> 
> leading ro a disconnect and reconnect (by pacemaker+drbd) and
> a split view after the resync, e.g.:
> 
> node1:
> > version: 8.3.10 (api:88/proto:86-96)
> > GIT-hash: 5c0b0469666682443d4785d90a2c603378f9017b build by root at ipax.at, 2011-02-03 14:58:22
> >  0: cs:Connected ro:Primary/Secondary ds:UpToDate/DUnknown C r-----


> as you can see, node1 reports ds: "UpToDate/DUnknown" whereas

conn == Connected with pdsk == DUnknown is an invalid state.

So yes, that looks like a bug.

Grep for state changes in your kernel logs, and find the place where it
changes to Connected while not changing pdsk to something != DUnknown.

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
__
please don't Cc me, but send to list   --   I'm subscribed



More information about the drbd-user mailing list