Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi all
I'm having problem attaching two drbd machine
this is an old cluster migrated from drbd8.0/heartbeat2 to drbd8.3/corosync some months ago
this combination worked till now but after a swap primary/secondary it's not possible to reconnect
any idea?
ga2-srv is a kvm 64bit vm, ga1-srv is a xen 64bit vm
kernel 2.6.32-33 ubuntu lucid, drbd 8.3.11 from git
ga2-srv machine
[ 118.275923] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96)
[ 118.275927] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5 debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21 12:29:37
[ 118.275930] drbd: registered as block device major 147
[ 118.275932] drbd: minor_table @ 0xffff880119989a00
[ 118.512597] block drbd0: Starting worker thread (from cqueue [1343])
[ 118.516728] block drbd0: disk( Diskless -> Attaching )
[ 118.517163] block drbd0: ASSERT( from_tnr - cnr + i - from == mx+1 ) in /usr/src/modules/drbd/drbd/drbd_actlog.c:514
[ 118.541669] block drbd0: ASSERT( from_tnr - cnr + i - from == mx+1 ) in /usr/src/modules/drbd/drbd/drbd_actlog.c:514
[ 118.556393] block drbd0: Found 3 transactions (72 active extents) in activity log.
[ 118.556417] block drbd0: Method to ensure write ordering: barrier
[ 118.556425] block drbd0: max BIO size = 4294966784
[ 118.556445] block drbd0: drbd_bm_resize called with capacity == 3221127096
[ 118.660476] block drbd0: resync bitmap: bits=402640887 words=6291264 pages=12288
[ 118.660483] block drbd0: size = 1536 GB (1610563548 KB)
[ 118.705459] block drbd0: bitmap READ of 12288 pages took 4 jiffies
[ 118.776528] block drbd0: recounting of set bits took additional 7 jiffies
[ 118.776544] block drbd0: 78 MB (20094 bits) marked out-of-sync by on disk bit-map.
[ 118.776567] block drbd0: disk( Attaching -> UpToDate ) pdsk( DUnknown -> Outdated )
[ 118.776571] block drbd0: attached to UUIDs B567B75768F63E0C:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74
[ 118.874708] block drbd0: conn( StandAlone -> Unconnected )
[ 118.876675] block drbd0: Starting receiver thread (from drbd0_worker [1373])
[ 118.886220] block drbd0: receiver (re)started
[ 118.886228] block drbd0: conn( Unconnected -> WFConnection )
[ 119.520504] block drbd0: role( Secondary -> Primary )
[ 119.994679] XFS mounting filesystem drbd0
[ 120.038783] Ending clean XFS mount for filesystem: drbd0
[ 2615.120092] block drbd0: Handshake successful: Agreed network protocol version 96
[ 2615.120105] block drbd0: conn( WFConnection -> WFReportParams )
[ 2615.120139] block drbd0: Starting asender thread (from drbd0_receiver [1394])
[ 2615.120729] block drbd0: data-integrity-alg: sha1
[ 2615.121283] block drbd0: drbd_sync_handshake:
[ 2615.121287] block drbd0: self B567B75768F63E0D:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74:1F291CCF41A2DA74 bits:20525 flags:0
[ 2615.121296] block drbd0: peer 1F2B1CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC bits:20094 flags:0
[ 2615.121326] block drbd0: uuid_compare()=1 by rule 70
[ 2615.121328] block drbd0: Becoming sync source due to disk states.
[ 2615.121334] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk( Outdated -> Inconsistent )
[ 2616.515283] block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0
[ 2616.522098] block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0)
[ 2616.522107] block drbd0: conn( WFBitMapS -> SyncSource )
[ 2616.522113] block drbd0: Began resync as SyncSource (will sync 82100 KB [20525 bits set]).
[ 2616.522128] block drbd0: updated sync UUID B567B75768F63E0D:1F2C1CCF41A2DA74:1F2B1CCF41A2DA74:1F2A1CCF41A2DA74
[ 2616.629374] block drbd0: /usr/src/modules/drbd/drbd/drbd_receiver.c:2204: sector: 1023242240s, size: 262144
[ 2616.655139] block drbd0: error receiving CsumRSRequest, l: 44!
[ 2616.662718] block drbd0: peer( Secondary -> Unknown ) conn( SyncSource -> ProtocolError )
[ 2616.699875] block drbd0: asender terminated
[ 2616.699892] block drbd0: Terminating asender thread
[ 2616.701435] block drbd0: bitmap WRITE of 12284 pages took 4 jiffies
[ 2616.701440] block drbd0: 80 MB (20514 bits) marked out-of-sync by on disk bit-map.
[ 2616.701461] block drbd0: Connection closed
[ 2616.701470] block drbd0: conn( ProtocolError -> Unconnected )
[ 2616.701481] block drbd0: receiver terminated
[ 2616.701483] block drbd0: Restarting receiver thread
[ 2616.701486] block drbd0: receiver (re)started
[ 2616.701490] block drbd0: conn( Unconnected -> WFConnection )
and so on
on ga1-srv machine
[ 11.390918] drbd: initialized. Version: 8.3.11 (api:88/proto:86-96)
[ 11.390923] drbd: GIT-hash: 0de839cee13a4160eed6037c4bddd066645e23c5 debian/changelog debian/control build by root at nebbiolo-dev, 2011-07-21 12:29:37
[ 11.390926] drbd: registered as block device major 147
[ 11.390929] drbd: minor_table @ 0xffff880001e75b00
[ 11.642435] block drbd0: Starting worker thread (from cqueue [988])
[ 11.644385] block drbd0: disk( Diskless -> Attaching )
[ 11.652930] block drbd0: Found 6 transactions (324 active extents) in activity log.
[ 11.652935] block drbd0: Method to ensure write ordering: barrier
[ 11.652939] block drbd0: max BIO size = 4096
[ 11.652947] block drbd0: drbd_bm_resize called with capacity == 3221127096
[ 11.675027] block drbd0: resync bitmap: bits=402640887 words=6291264 pages=12288
[ 11.675035] block drbd0: size = 1536 GB (1610563548 KB)
[ 12.067368] block drbd0: bitmap READ of 12288 pages took 39 jiffies
[ 12.116378] block drbd0: recounting of set bits took additional 5 jiffies
[ 12.116386] block drbd0: 39 MB (9988 bits) marked out-of-sync by on disk bit-map.
[ 12.116397] block drbd0: disk( Attaching -> Inconsistent )
[ 12.116402] block drbd0: attached to UUIDs 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
[ 12.152591] block drbd0: conn( StandAlone -> Unconnected )
[ 12.155071] block drbd0: Starting receiver thread (from drbd0_worker [1025])
[ 12.157579] block drbd0: receiver (re)started
[ 12.157587] block drbd0: conn( Unconnected -> WFConnection )
[ 15.161152] block drbd0: Handshake successful: Agreed network protocol version 96
[ 15.162211] block drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 15.162223] block drbd0: conn( WFConnection -> WFReportParams )
[ 15.162251] block drbd0: Starting asender thread (from drbd0_receiver [1071])
[ 15.163958] block drbd0: data-integrity-alg: sha1
[ 15.163973] block drbd0: max BIO size = 4294966784
[ 15.163986] block drbd0: drbd_sync_handshake:
[ 15.163991] block drbd0: self 1EE21CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC bits:9988 flags:0
[ 15.163995] block drbd0: peer B567B75768F63E0D:1EE21CCF41A2DA74:1EE11CCF41A2DA74:1EE01CCF41A2DA74 bits:10588 flags:0
[ 15.164000] block drbd0: uuid_compare()=-1 by rule 50
[ 15.164003] block drbd0: Becoming sync target due to disk states.
[ 15.164011] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
[ 17.826566] block drbd0: conn( WFBitMapT -> WFSyncUUID )
[ 17.898960] block drbd0: updated sync uuid 1EE31CCF41A2DA74:0000000000000000:EEC3A25A1800DCBC:EEC2A25A1800DCBC
[ 17.899256] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0
[ 17.903674] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0)
[ 17.903683] block drbd0: conn( WFSyncUUID -> SyncTarget )
[ 17.903691] block drbd0: Began resync as SyncTarget (will sync 42356 KB [10589 bits set]).
[ 18.737711] block drbd0: sock was shut down by peer
[ 18.737723] block drbd0: peer( Primary -> Unknown ) conn( SyncTarget -> BrokenPipe ) pdsk( UpToDate -> DUnknown )
[ 18.737735] block drbd0: short read expecting header on sock: r=0
[ 18.737986] block drbd0: asender terminated
[ 18.737991] block drbd0: Terminating asender thread
[ 19.456024] block drbd0: bitmap WRITE of 12285 pages took 72 jiffies
[ 19.456030] block drbd0: 41 MB (10586 bits) marked out-of-sync by on disk bit-map.
[ 19.456047] block drbd0: Connection closed
[ 19.456056] block drbd0: conn( BrokenPipe -> Unconnected )
[ 19.456064] block drbd0: receiver terminated
[ 19.456067] block drbd0: Restarting receiver thread
[ 19.456071] block drbd0: receiver (re)started
[ 19.456077] block drbd0: conn( Unconnected -> WFConnection )
configuration
resource r0 {
syncer {
rate 25M;
csums-alg sha1;
verify-alg sha1;
}
net {
data-integrity-alg sha1;
cram-hmac-alg "sha1";
shared-secret "xxxxxxxxxx";
}
disk {
no-disk-flushes;
no-md-flushes;
}
on ga1-srv {
device /dev/drbd0;
disk /dev/xvda4;
address 10.12.24.206:7788;
meta-disk internal;
}
on ga2-srv {
device /dev/drbd0;
disk /dev/vdd;
address 10.12.24.207:7788;
meta-disk internal;
}
}
--
Cordiali saluti
Alessandro Bono