[DRBD-user] error receiving CsumRSRequest, l: 44!

Alessandro Bono alessandro.bono at gmail.com
Mon Aug 8 14:47:06 CEST 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On Sun, 07 Aug 2011 22:23:34 +0000, Alessandro Bono wrote:

> Hi all
> 
> I'm having problem attaching two drbd machine this is an old cluster
> migrated from drbd8.0/heartbeat2 to drbd8.3/corosync some months ago
> this combination worked till now but after a swap primary/secondary it's
> not possible to reconnect
> 
> any idea?

switching xen vm to kvm solved the problem
mixing two different type of vm it's not a good idea 

> 
> ga2-srv is a kvm 64bit vm, ga1-srv is a xen 64bit vm kernel 2.6.32-33
> ubuntu lucid, drbd 8.3.11 from git
> 
> ga2-srv machine
> 
> [  118.275923] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96) [
>  118.275927] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5
> debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21
> 12:29:37 [  118.275930] drbd: registered as block device major 147 [ 
> 118.275932] drbd: minor_table @ 0xffff880119989a00 [  118.512597] block
> drbd0: Starting worker thread (from cqueue [1343]) [  118.516728] block
> drbd0: disk( Diskless -> Attaching ) [  118.517163] block drbd0: ASSERT(
> from_tnr - cnr + i - from == mx+1 ) in
> /usr/src/modules/drbd/drbd/drbd_actlog.c:514 [  118.541669] block drbd0:
> ASSERT( from_tnr - cnr + i - from == mx+1 ) in
> /usr/src/modules/drbd/drbd/drbd_actlog.c:514 [  118.556393] block drbd0:
> Found 3 transactions (72 active extents) in activity log. [  118.556417]
> block drbd0: Method to ensure write ordering: barrier [  118.556425]
> block drbd0: max BIO size = 4294966784 [  118.556445] block drbd0:
> drbd_bm_resize called with capacity == 3221127096 [  118.660476] block
> drbd0: resync bitmap: bits=402640887 words=6291264 pages=12288 [ 
> 118.660483] block drbd0: size = 1536 GB (1610563548 KB) [  118.705459]
> block drbd0: bitmap READ of 12288 pages took 4 jiffies [  118.776528]
> block drbd0: recounting of set bits took additional 7 jiffies [ 
> 118.776544] block drbd0: 78 MB (20094 bits) marked out-of-sync by on
> disk bit-map. [  118.776567] block drbd0: disk( Attaching -> UpToDate )
> pdsk( DUnknown -> Outdated ) [  118.776571] block drbd0: attached to
> UUIDs
> B567B75768F63E0C:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74 [ 
> 118.874708] block drbd0: conn( StandAlone -> Unconnected ) [ 
> 118.876675] block drbd0: Starting receiver thread (from drbd0_worker
> [1373]) [  118.886220] block drbd0: receiver (re)started [  118.886228]
> block drbd0: conn( Unconnected -> WFConnection ) [  119.520504] block
> drbd0: role( Secondary -> Primary ) [  119.994679] XFS mounting
> filesystem drbd0 [  120.038783] Ending clean XFS mount for filesystem:
> drbd0 [ 2615.120092] block drbd0: Handshake successful: Agreed network
> protocol version 96 [ 2615.120105] block drbd0: conn( WFConnection ->
> WFReportParams ) [ 2615.120139] block drbd0: Starting asender thread
> (from drbd0_receiver [1394]) [ 2615.120729] block drbd0:
> data-integrity-alg: sha1 [ 2615.121283] block drbd0:
> drbd_sync_handshake: [ 2615.121287] block drbd0: self
> B567B75768F63E0D:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74
> bits:20525 flags:0 [ 2615.121296] block drbd0: peer
> 1F2B1CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
> bits:20094 flags:0 [ 2615.121326] block drbd0: uuid_compare()=1 by rule
> 70 [ 2615.121328] block drbd0: Becoming sync source due to disk states.
> [ 2615.121334] block drbd0: peer( Unknown -> Secondary ) conn(
> WFReportParams -> WFBitMapS ) pdsk( Outdated -> Inconsistent ) [
> 2616.515283] block drbd0: helper command: /sbin/drbdadm
> before-resync-source minor-0 [ 2616.522098] block drbd0: helper command:
> /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0) [
> 2616.522107] block drbd0: conn( WFBitMapS -> SyncSource ) [ 2616.522113]
> block drbd0: Began resync as SyncSource (will sync 82100 KB [20525 bits
> set]). [ 2616.522128] block drbd0: updated sync UUID
> B567B75768F63E0D:1F2C1CCF41A2DA74:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74 [
> 2616.629374] block drbd0:
> /usr/src/modules/drbd/drbd/drbd_receiver.c:2204: sector: 1023242240s,
> size: 262144 [ 2616.655139] block drbd0: error receiving CsumRSRequest,
> l: 44! [ 2616.662718] block drbd0: peer( Secondary -> Unknown ) conn(
> SyncSource -> ProtocolError ) [ 2616.699875] block drbd0: asender
> terminated [ 2616.699892] block drbd0: Terminating asender thread [
> 2616.701435] block drbd0: bitmap WRITE of 12284 pages took 4 jiffies [
> 2616.701440] block drbd0: 80 MB (20514 bits) marked out-of-sync by on
> disk bit-map. [ 2616.701461] block drbd0: Connection closed [
> 2616.701470] block drbd0: conn( ProtocolError -> Unconnected ) [
> 2616.701481] block drbd0: receiver terminated [ 2616.701483] block
> drbd0: Restarting receiver thread [ 2616.701486] block drbd0: receiver
> (re)started [ 2616.701490] block drbd0: conn( Unconnected ->
> WFConnection )
> 
> and so on
> 
> 
> on ga1-srv machine
> 
> [   11.390918] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96) [
>   11.390923] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5
> debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21
> 12:29:37 [   11.390926] drbd: registered as block device major 147 [  
> 11.390929] drbd: minor_table @ 0xffff880001e75b00 [   11.642435] block
> drbd0: Starting worker thread (from cqueue [988]) [   11.644385] block
> drbd0: disk( Diskless -> Attaching ) [   11.652930] block drbd0: Found 6
> transactions (324 active extents) in activity log. [   11.652935] block
> drbd0: Method to ensure write ordering: barrier [   11.652939] block
> drbd0: max BIO size = 4096 [   11.652947] block drbd0: drbd_bm_resize
> called with capacity == 3221127096 [   11.675027] block drbd0: resync
> bitmap: bits=402640887 words=6291264 pages=12288 [   11.675035] block
> drbd0: size = 1536 GB (1610563548 KB) [   12.067368] block drbd0: bitmap
> READ of 12288 pages took 39 jiffies [   12.116378] block drbd0:
> recounting of set bits took additional 5 jiffies [   12.116386] block
> drbd0: 39 MB (9988 bits) marked out-of-sync by on disk bit-map. [  
> 12.116397] block drbd0: disk( Attaching -> Inconsistent ) [   12.116402]
> block drbd0: attached to UUIDs
> 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC [  
> 12.152591] block drbd0: conn( StandAlone -> Unconnected ) [   12.155071]
> block drbd0: Starting receiver thread (from drbd0_worker [1025]) [  
> 12.157579] block drbd0: receiver (re)started [   12.157587] block drbd0:
> conn( Unconnected -> WFConnection ) [   15.161152] block drbd0:
> Handshake successful: Agreed network protocol version 96 [   15.162211]
> block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC [  
> 15.162223] block drbd0: conn( WFConnection -> WFReportParams ) [  
> 15.162251] block drbd0: Starting asender thread (from drbd0_receiver
> [1071]) [   15.163958] block drbd0: data-integrity-alg: sha1 [  
> 15.163973] block drbd0: max BIO size = 4294966784 [   15.163986] block
> drbd0: drbd_sync_handshake: [   15.163991] block drbd0: self
> 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
> bits:9988 flags:0 [   15.163995] block drbd0: peer
> B567B75768F63E0D:1EE21CCF41A2DA74:1EE11CCF41A2DA74:1EE01CCF41A2DA74
> bits:10588 flags:0 [   15.164000] block drbd0: uuid_compare()=-1 by rule
> 50 [   15.164003] block drbd0: Becoming sync target due to disk states.
> [   15.164011] block drbd0: peer( Unknown -> Primary ) conn(
> WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) [  
> 17.826566] block drbd0: conn( WFBitMapT -> WFSyncUUID ) [   17.898960]
> block drbd0: updated sync uuid
> 1EE31CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC [  
> 17.899256] block drbd0: helper command: /sbin/drbdadm
> before-resync-target minor-0 [   17.903674] block drbd0: helper command:
> /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0) [  
> 17.903683] block drbd0: conn( WFSyncUUID -> SyncTarget ) [   17.903691]
> block drbd0: Began resync as SyncTarget (will sync 42356 KB [10589 bits
> set]). [   18.737711] block drbd0: sock was shut down by peer [  
> 18.737723] block drbd0: peer( Primary -> Unknown ) conn( SyncTarget ->
> BrokenPipe ) pdsk( UpToDate -> DUnknown ) [   18.737735] block drbd0:
> short read expecting header on sock: r=0 [   18.737986] block drbd0:
> asender terminated [   18.737991] block drbd0: Terminating asender
> thread [   19.456024] block drbd0: bitmap WRITE of 12285 pages took 72
> jiffies [   19.456030] block drbd0: 41 MB (10586 bits) marked
> out-of-sync by on disk bit-map. [   19.456047] block drbd0: Connection
> closed [   19.456056] block drbd0: conn( BrokenPipe -> Unconnected ) [  
> 19.456064] block drbd0: receiver terminated [   19.456067] block drbd0:
> Restarting receiver thread [   19.456071] block drbd0: receiver
> (re)started [   19.456077] block drbd0: conn( Unconnected ->
> WFConnection )
> 
> 
> configuration
> 
> resource r0 {
>  syncer {
>     rate 25M;
>     csums-alg sha1;
>     verify-alg sha1;
>   }
> 
>   net {
>     data-integrity-alg sha1;
>     cram-hmac-alg "sha1";
>     shared-secret "xxxxxxxxxx";
>   }
> 
>   disk {
>     no-disk-flushes;
>     no-md-flushes;
>   }
> 
>   on ga1-srv {
>     device     /dev/drbd0;
>     disk       /dev/xvda4;
>     address    10.12.24.206:7788;
>     meta-disk  internal;
>   }
> 
>   on ga2-srv {
>     device    /dev/drbd0;
>     disk      /dev/vdd;
>     address   10.12.24.207:7788;
>     meta-disk internal;
>   }
> 
> }





-- 
Cordiali saluti

Alessandro Bono




More information about the drbd-user mailing list