[DRBD-user] Unable to reconnect the secondary after shrinking the primary

Cyril Bouthors cyril at bouthors.org
Fri Oct 13 08:58:20 CEST 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


I've shrinked the primary when it was Primary/Unknown with

drbdsetup /dev/drbd0 resize -d 140G

Worked fine but the secondary does not see the size changement and is
unable to reconnect. I'm getting continuously one error and 2 asserts:

Oct 13 07:45:23 sqlb1 kernel: drbd: initialised. Version: 0.7.21 (api:79/proto:74)
Oct 13 07:45:23 sqlb1 kernel: drbd: SVN Revision: 2326 build by root at nfsb4, 2006-08-03 09:36:16
Oct 13 07:45:23 sqlb1 kernel: drbd: registered as block device major 147
Oct 13 07:45:23 sqlb1 kernel: drbd0: Adjusting my ra_pages to backing device's (32 -> 64)
Oct 13 07:45:23 sqlb1 kernel: klogd 1.4.1, ---------- state change ---------- 
Oct 13 07:45:23 sqlb1 kernel: No module symbols loaded - kernel modules not enabled. 
Oct 13 07:45:23 sqlb1 kernel: drbd0: resync bitmap: bits=39033104 words=1219786
Oct 13 07:45:23 sqlb1 kernel: drbd0: size = 148 GB (156132416 KB)
                                            ^^^^^^
Oct 13 07:45:24 sqlb1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map.
Oct 13 07:45:24 sqlb1 kernel: drbd0: Found 4 transactions (136 active extents) in activity log.
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27933]: cstate Unconfigured --> StandAlone
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27946]: cstate StandAlone --> Unconnected
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFConnection --> WFReportParams
Oct 13 07:45:24 sqlb1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:24 sqlb1 kernel: drbd0: Connection established.
Oct 13 07:45:24 sqlb1 kernel: drbd0: I am(S): 1:00000003:00000006:00000292:00000008:01
Oct 13 07:45:24 sqlb1 kernel: drbd0: Peer(P): 1:00000003:00000006:00000293:0000000c:10
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFReportParams --> WFBitMapT
Oct 13 07:45:24 sqlb1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Oct 13 07:45:24 sqlb1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map.
Oct 13 07:45:24 sqlb1 kernel: drbd0: Found 4 transactions (136 active extents) in activity log.
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27933]: cstate Unconfigured --> StandAlone
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27946]: cstate StandAlone --> Unconnected
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFConnection --> WFReportParams
Oct 13 07:45:24 sqlb1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:24 sqlb1 kernel: drbd0: Connection established.
Oct 13 07:45:24 sqlb1 kernel: drbd0: I am(S): 1:00000003:00000006:00000292:00000008:01
Oct 13 07:45:24 sqlb1 kernel: drbd0: Peer(P): 1:00000003:00000006:00000293:0000000c:10
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFReportParams --> WFBitMapT
Oct 13 07:45:24 sqlb1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Oct 13 07:45:24 sqlb1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map.
Oct 13 07:45:24 sqlb1 kernel: drbd0: Found 4 transactions (136 active extents) in activity log.
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27933]: cstate Unconfigured --> StandAlone
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbdsetup [27946]: cstate StandAlone --> Unconnected
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFConnection --> WFReportParams
Oct 13 07:45:24 sqlb1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:24 sqlb1 kernel: drbd0: Connection established.
Oct 13 07:45:24 sqlb1 kernel: drbd0: I am(S): 1:00000003:00000006:00000292:00000008:01
Oct 13 07:45:24 sqlb1 kernel: drbd0: Peer(P): 1:00000003:00000006:00000293:0000000c:10
Oct 13 07:45:24 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFReportParams --> WFBitMapT
Oct 13 07:45:24 sqlb1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Oct 13 07:45:25 sqlb1 kernel: drbd0: receive_bitmap: (want != h->length) in /usr/src/modules/drbd/drbd/drbd_receiver.c:1637
Oct 13 07:45:25 sqlb1 kernel: drbd0: error receiving ReportBitMap, l: 784!
Oct 13 07:45:25 sqlb1 kernel: drbd0: ASSERT( mdev->cstate < Connected ) in /usr/src/modules/drbd/drbd/drbd_receiver.c:1836
Oct 13 07:45:25 sqlb1 kernel: drbd0: asender terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: worker terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFBitMapT --> Unconnected
Oct 13 07:45:25 sqlb1 kernel: drbd0: Connection lost.
Oct 13 07:45:25 sqlb1 kernel: drbd0: ASSERT( thi->t_state == Restarting ) in /usr/src/modules/drbd/drbd/drbd_receiver.c:2103
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:25 sqlb1 kernel: drbd0: asender terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: worker terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFBitMapT --> Unconnected
Oct 13 07:45:25 sqlb1 kernel: drbd0: Connection lost.
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:25 sqlb1 kernel: drbd0: receive_bitmap: (want != h->length) in /usr/src/modules/drbd/drbd/drbd_receiver.c:1637
Oct 13 07:45:25 sqlb1 kernel: drbd0: error receiving ReportBitMap, l: 784!
Oct 13 07:45:25 sqlb1 kernel: drbd0: ASSERT( mdev->cstate < Connected ) in /usr/src/modules/drbd/drbd/drbd_receiver.c:1836
Oct 13 07:45:25 sqlb1 kernel: drbd0: asender terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: worker terminated
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFBitMapT --> Unconnected
Oct 13 07:45:25 sqlb1 kernel: drbd0: Connection lost.
Oct 13 07:45:25 sqlb1 kernel: drbd0: ASSERT( thi->t_state == Restarting ) in /usr/src/modules/drbd/drbd/drbd_receiver.c:2103
Oct 13 07:45:25 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate Unconnected --> WFConnection
Oct 13 07:45:27 sqlb1 kernel: drbd0: drbd0_receiver [27947]: cstate WFConnection --> WFReportParams
Oct 13 07:45:27 sqlb1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
etc. etc.

I'm running DRBD 0.7.21 with Linux 2.6.16 on both primary and
secondary, Debian flavor.

On the primary, here are the logs:

Oct 13 07:45:24 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFConnection --> WFReportParams
Oct 13 07:45:24 sqla1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:24 sqla1 kernel: drbd0: Connection established.
Oct 13 07:45:24 sqla1 kernel: drbd0: I am(P): 1:00000003:00000006:00000293:0000000c:10
Oct 13 07:45:24 sqla1 kernel: drbd0: Peer(S): 1:00000003:00000006:00000292:00000008:01
Oct 13 07:45:24 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFReportParams --> WFBitMapS
Oct 13 07:45:25 sqla1 kernel: drbd0: Primary/Unknown --> Primary/Secondary
Oct 13 07:45:25 sqla1 kernel: drbd0: sock was reset by peer
Oct 13 07:45:25 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFBitMapS --> BrokenPipe
Oct 13 07:45:25 sqla1 kernel: drbd0: short read expecting header on sock: r=-104
Oct 13 07:45:25 sqla1 kernel: drbd0: meta connection shut down by peer.
Oct 13 07:45:25 sqla1 kernel: drbd0: asender terminated
Oct 13 07:45:25 sqla1 kernel: drbd0: worker terminated
Oct 13 07:45:25 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate BrokenPipe --> Unconnected
Oct 13 07:45:25 sqla1 kernel: drbd0: Connection lost.
Oct 13 07:45:25 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate Unconnected --> WFConnection
Oct 13 07:45:27 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFConnection --> WFReportParams
Oct 13 07:45:27 sqla1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:27 sqla1 kernel: drbd0: Connection established.
Oct 13 07:45:27 sqla1 kernel: drbd0: I am(P): 1:00000003:00000006:00000294:0000000c:10
Oct 13 07:45:27 sqla1 kernel: drbd0: Peer(S): 0:00000003:00000006:00000293:0000000c:01
Oct 13 07:45:27 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFReportParams --> WFBitMapS
Oct 13 07:45:28 sqla1 kernel: drbd0: Primary/Unknown --> Primary/Secondary
Oct 13 07:45:28 sqla1 kernel: drbd0: sock was reset by peer
Oct 13 07:45:28 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFBitMapS --> BrokenPipe
Oct 13 07:45:28 sqla1 kernel: drbd0: short read expecting header on sock: r=-104
Oct 13 07:45:28 sqla1 kernel: drbd0: meta connection shut down by peer.
Oct 13 07:45:28 sqla1 kernel: drbd0: asender terminated
Oct 13 07:45:28 sqla1 kernel: drbd0: worker terminated
Oct 13 07:45:28 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate BrokenPipe --> Unconnected
Oct 13 07:45:28 sqla1 kernel: drbd0: Connection lost.
Oct 13 07:45:28 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate Unconnected --> WFConnection
Oct 13 07:45:30 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFConnection --> WFReportParams
Oct 13 07:45:30 sqla1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:30 sqla1 kernel: drbd0: Connection established.
Oct 13 07:45:30 sqla1 kernel: drbd0: I am(P): 1:00000003:00000006:00000295:0000000c:10
Oct 13 07:45:30 sqla1 kernel: drbd0: Peer(S): 0:00000003:00000006:00000294:0000000c:01
Oct 13 07:45:30 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFReportParams --> WFBitMapS
Oct 13 07:45:31 sqla1 kernel: drbd0: Primary/Unknown --> Primary/Secondary
Oct 13 07:45:31 sqla1 kernel: drbd0: sock was reset by peer
Oct 13 07:45:31 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFBitMapS --> BrokenPipe
Oct 13 07:45:31 sqla1 kernel: drbd0: short read expecting header on sock: r=-104
Oct 13 07:45:31 sqla1 kernel: drbd0: meta connection shut down by peer.
Oct 13 07:45:31 sqla1 kernel: drbd0: asender terminated
Oct 13 07:45:31 sqla1 kernel: drbd0: worker terminated
Oct 13 07:45:31 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate BrokenPipe --> Unconnected
Oct 13 07:45:31 sqla1 kernel: drbd0: Connection lost.
Oct 13 07:45:31 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate Unconnected --> WFConnection
Oct 13 07:45:33 sqla1 kernel: drbd0: drbd0_receiver [8418]: cstate WFConnection --> WFReportParams
Oct 13 07:45:33 sqla1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Oct 13 07:45:33 sqla1 kernel: drbd0: Connection established.

I even tried zeroing the backend storage and restarting drbd, it's
still the same.

:(
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 188 bytes
Desc: not available
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20061013/81b72217/attachment.pgp>


More information about the drbd-user mailing list