[DRBD-user] error receiving CsumRSRequest, l: 44!

Alessandro Bono alessandro.bono at gmail.com
Mon Aug 8 00:23:34 CEST 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi all

I'm having problem attaching two drbd machine
this is an old cluster migrated from drbd8.0/heartbeat2 to drbd8.3/corosync some months ago
this combination worked till now but after a swap primary/secondary it's not possible to reconnect

any idea?

ga2-srv is a kvm 64bit vm, ga1-srv is a xen 64bit vm
kernel 2.6.32-33 ubuntu lucid, drbd 8.3.11 from git

ga2-srv machine

[  118.275923] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96)
[  118.275927] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5 debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21 12:29:37
[  118.275930] drbd: registered as block device major 147
[  118.275932] drbd: minor_table @ 0xffff880119989a00
[  118.512597] block drbd0: Starting worker thread (from cqueue [1343])
[  118.516728] block drbd0: disk( Diskless -> Attaching ) 
[  118.517163] block drbd0: ASSERT( from_tnr - cnr + i - from == mx+1 ) in /usr/src/modules/drbd/drbd/drbd_actlog.c:514
[  118.541669] block drbd0: ASSERT( from_tnr - cnr + i - from == mx+1 ) in /usr/src/modules/drbd/drbd/drbd_actlog.c:514
[  118.556393] block drbd0: Found 3 transactions (72 active extents) in activity log.
[  118.556417] block drbd0: Method to ensure write ordering: barrier
[  118.556425] block drbd0: max BIO size = 4294966784
[  118.556445] block drbd0: drbd_bm_resize called with capacity == 3221127096
[  118.660476] block drbd0: resync bitmap: bits=402640887 words=6291264 pages=12288
[  118.660483] block drbd0: size = 1536 GB (1610563548 KB)
[  118.705459] block drbd0: bitmap READ of 12288 pages took 4 jiffies
[  118.776528] block drbd0: recounting of set bits took additional 7 jiffies
[  118.776544] block drbd0: 78 MB (20094 bits) marked out-of-sync by on disk bit-map.
[  118.776567] block drbd0: disk( Attaching -> UpToDate ) pdsk( DUnknown -> Outdated ) 
[  118.776571] block drbd0: attached to UUIDs B567B75768F63E0C:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74
[  118.874708] block drbd0: conn( StandAlone -> Unconnected ) 
[  118.876675] block drbd0: Starting receiver thread (from drbd0_worker [1373])
[  118.886220] block drbd0: receiver (re)started
[  118.886228] block drbd0: conn( Unconnected -> WFConnection ) 
[  119.520504] block drbd0: role( Secondary -> Primary ) 
[  119.994679] XFS mounting filesystem drbd0
[  120.038783] Ending clean XFS mount for filesystem: drbd0
[ 2615.120092] block drbd0: Handshake successful: Agreed network protocol version 96
[ 2615.120105] block drbd0: conn( WFConnection -> WFReportParams ) 
[ 2615.120139] block drbd0: Starting asender thread (from drbd0_receiver [1394])
[ 2615.120729] block drbd0: data-integrity-alg: sha1
[ 2615.121283] block drbd0: drbd_sync_handshake:
[ 2615.121287] block drbd0: self B567B75768F63E0D:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74 bits:20525 flags:0
[ 2615.121296] block drbd0: peer 1F2B1CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC bits:20094 flags:0
[ 2615.121326] block drbd0: uuid_compare()=1 by rule 70
[ 2615.121328] block drbd0: Becoming sync source due to disk states.
[ 2615.121334] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( Outdated -> Inconsistent ) 
[ 2616.515283] block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0
[ 2616.522098] block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0)
[ 2616.522107] block drbd0: conn( WFBitMapS -> SyncSource ) 
[ 2616.522113] block drbd0: Began resync as SyncSource (will sync 82100 KB [20525 bits set]).
[ 2616.522128] block drbd0: updated sync UUID B567B75768F63E0D:1F2C1CCF41A2DA74:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74
[ 2616.629374] block drbd0: /usr/src/modules/drbd/drbd/drbd_receiver.c:2204: sector: 1023242240s, size: 262144
[ 2616.655139] block drbd0: error receiving CsumRSRequest, l: 44!
[ 2616.662718] block drbd0: peer( Secondary -> Unknown ) conn( SyncSource -> ProtocolError ) 
[ 2616.699875] block drbd0: asender terminated
[ 2616.699892] block drbd0: Terminating asender thread
[ 2616.701435] block drbd0: bitmap WRITE of 12284 pages took 4 jiffies
[ 2616.701440] block drbd0: 80 MB (20514 bits) marked out-of-sync by on disk bit-map.
[ 2616.701461] block drbd0: Connection closed
[ 2616.701470] block drbd0: conn( ProtocolError -> Unconnected ) 
[ 2616.701481] block drbd0: receiver terminated
[ 2616.701483] block drbd0: Restarting receiver thread
[ 2616.701486] block drbd0: receiver (re)started
[ 2616.701490] block drbd0: conn( Unconnected -> WFConnection ) 

and so on


on ga1-srv machine

[   11.390918] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96)
[   11.390923] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5 debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21 12:29:37
[   11.390926] drbd: registered as block device major 147
[   11.390929] drbd: minor_table @ 0xffff880001e75b00
[   11.642435] block drbd0: Starting worker thread (from cqueue [988])
[   11.644385] block drbd0: disk( Diskless -> Attaching ) 
[   11.652930] block drbd0: Found 6 transactions (324 active extents) in activity log.
[   11.652935] block drbd0: Method to ensure write ordering: barrier
[   11.652939] block drbd0: max BIO size = 4096
[   11.652947] block drbd0: drbd_bm_resize called with capacity == 3221127096
[   11.675027] block drbd0: resync bitmap: bits=402640887 words=6291264 pages=12288
[   11.675035] block drbd0: size = 1536 GB (1610563548 KB)
[   12.067368] block drbd0: bitmap READ of 12288 pages took 39 jiffies
[   12.116378] block drbd0: recounting of set bits took additional 5 jiffies
[   12.116386] block drbd0: 39 MB (9988 bits) marked out-of-sync by on disk bit-map.
[   12.116397] block drbd0: disk( Attaching -> Inconsistent ) 
[   12.116402] block drbd0: attached to UUIDs 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
[   12.152591] block drbd0: conn( StandAlone -> Unconnected ) 
[   12.155071] block drbd0: Starting receiver thread (from drbd0_worker [1025])
[   12.157579] block drbd0: receiver (re)started
[   12.157587] block drbd0: conn( Unconnected -> WFConnection ) 
[   15.161152] block drbd0: Handshake successful: Agreed network protocol version 96
[   15.162211] block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[   15.162223] block drbd0: conn( WFConnection -> WFReportParams ) 
[   15.162251] block drbd0: Starting asender thread (from drbd0_receiver [1071])
[   15.163958] block drbd0: data-integrity-alg: sha1
[   15.163973] block drbd0: max BIO size = 4294966784
[   15.163986] block drbd0: drbd_sync_handshake:
[   15.163991] block drbd0: self 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC bits:9988 flags:0
[   15.163995] block drbd0: peer B567B75768F63E0D:1EE21CCF41A2DA74:1EE11CCF41A2DA74:1EE01CCF41A2DA74 bits:10588 flags:0
[   15.164000] block drbd0: uuid_compare()=-1 by rule 50
[   15.164003] block drbd0: Becoming sync target due to disk states.
[   15.164011] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) 
[   17.826566] block drbd0: conn( WFBitMapT -> WFSyncUUID ) 
[   17.898960] block drbd0: updated sync uuid 1EE31CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
[   17.899256] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0
[   17.903674] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0)
[   17.903683] block drbd0: conn( WFSyncUUID -> SyncTarget ) 
[   17.903691] block drbd0: Began resync as SyncTarget (will sync 42356 KB [10589 bits set]).
[   18.737711] block drbd0: sock was shut down by peer
[   18.737723] block drbd0: peer( Primary -> Unknown ) conn( SyncTarget -> BrokenPipe ) pdsk( UpToDate -> DUnknown ) 
[   18.737735] block drbd0: short read expecting header on sock: r=0
[   18.737986] block drbd0: asender terminated
[   18.737991] block drbd0: Terminating asender thread
[   19.456024] block drbd0: bitmap WRITE of 12285 pages took 72 jiffies
[   19.456030] block drbd0: 41 MB (10586 bits) marked out-of-sync by on disk bit-map.
[   19.456047] block drbd0: Connection closed
[   19.456056] block drbd0: conn( BrokenPipe -> Unconnected ) 
[   19.456064] block drbd0: receiver terminated
[   19.456067] block drbd0: Restarting receiver thread
[   19.456071] block drbd0: receiver (re)started
[   19.456077] block drbd0: conn( Unconnected -> WFConnection ) 


configuration

resource r0 {
 syncer {
    rate 25M;
    csums-alg sha1;
    verify-alg sha1;
  }

  net {
    data-integrity-alg sha1;
    cram-hmac-alg "sha1";
    shared-secret "xxxxxxxxxx";
  }

  disk {
    no-disk-flushes;
    no-md-flushes;
  }

  on ga1-srv {
    device     /dev/drbd0;
    disk       /dev/xvda4;
    address    10.12.24.206:7788;
    meta-disk  internal;
  }

  on ga2-srv {
    device    /dev/drbd0;
    disk      /dev/vdd;
    address   10.12.24.207:7788;
    meta-disk internal;
  }

}


-- 
Cordiali saluti

Alessandro Bono




More information about the drbd-user mailing list