[DRBD-user] Digest mismatch resulting in "split brain" after (!) automatic reconnect

Raoul Bhatia [IPAX] r.bhatia at ipax.at
Wed Feb 16 15:49:34 CET 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


hi,

debian lenny,
pacemaker 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b,
drbd 8.3.10 5c0b0469666682443d4785d90a2c603378f9017b,
ocf ra 1.3 shipped with (self-compiled drbd debian package)
kernel 2.6.27.57+ipax


every couple of hours, i encounter a digest mismatch:
> Digest mismatch, buffer modified by upper layers during write: 0s +4096

leading ro a disconnect and reconnect (by pacemaker+drbd) and
a split view after the resync, e.g.:

node1:
> version: 8.3.10 (api:88/proto:86-96)
> GIT-hash: 5c0b0469666682443d4785d90a2c603378f9017b build by root at ipax.at, 2011-02-03 14:58:22
>  0: cs:Connected ro:Primary/Secondary ds:UpToDate/DUnknown C r-----
>     ns:88040564 nr:0 dw:89438380 dr:199396053 al:787279 bm:9 lo:1 pe:0 ua:0 ap:1 ep:1 wo:b oos:343052

node2:
> version: 8.3.10 (api:88/proto:86-96)
> GIT-hash: 5c0b0469666682443d4785d90a2c603378f9017b build by root at ipax.at, 2011-02-03 14:58:22
>  0: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
>     ns:0 nr:87855316 dw:87855316 dr:0 al:0 bm:9 lo:0 pe:0 ua:0 ap:0 ep:1 wo:d oos:0


as you can see, node1 reports ds: "UpToDate/DUnknown" whereas
node2 reports "UpToDate/UpToDate"


config and dmesg logs attached. for your information:

Feb 16 06:25:03: devices get out of sync.
Feb 16 13:34:32: i manually disconnect and reconnect from node01 to
                 start resync.


looks like a bug to me, doesn't it?

i have a couple of 2 node clusters running this setup.
for a test, i will upgrade one of them to a more recent kernel from
squeeze and thus will downgrade drbd to squezze's drbd 8.3.7.


cheers,
raoul

ps. some of my previous posts are, quite possibly, related to this:
http://www.gossamer-threads.com/lists/drbd/users/20717#20717
http://www.gossamer-threads.com/lists/drbd/users/20605#20605
+ talks via irc
-- 
____________________________________________________________________
DI (FH) Raoul Bhatia M.Sc.          email.          r.bhatia at ipax.at
Technischer Leiter

IPAX - Aloy Bhatia Hava OG          web.          http://www.ipax.at
Barawitzkagasse 10/2/2/11           email.            office at ipax.at
1190 Wien                           tel.               +43 1 3670030
FN 277995t HG Wien                  fax.            +43 1 3670030 15
____________________________________________________________________
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: drbd.conf
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20110216/46288e2e/attachment.asc>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node01_drbd1.log
Type: text/x-log
Size: 8439 bytes
Desc: not available
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20110216/46288e2e/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: node02_drbd1.log
Type: text/x-log
Size: 7831 bytes
Desc: not available
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20110216/46288e2e/attachment-0001.bin>


More information about the drbd-user mailing list