[DRBD-user] drbd does'nt synchronize. Perhaps a bug ?

Regis Gras Regis.Gras at ujf-grenoble.fr
Thu Feb 2 09:37:59 CET 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


 I desire to simulate a crash on the main server.
 To do that, I stop the network and after I stop drbd 
on the main server.
 At this time, the Secondary server becomes Primary server, whitout any
problem. 
 Later, I reconnect the network on the main server, and start drbd
 At this time, the two computer do not synchronize.
 The old main server becomes Secondary, Waiting for connection
and the actual Primary becomes Primary StandAlone

 The only way, I found, to synchronize the two computers, is to start
twice, on the actual Primary the command:

drbdsetup /dev/drbd0 net IP_actual_Primary IP_actual_Secondary  C -d reconnect

In the logs, I find a great number of error messages.

Cloud someone help me to solve this problem ?

 Best regards.


The main server is nammed "pcvalid", and the other is "bruegel"
I am using drbd-0.7.15-1, and kernel-2.6.14-1.1653_FC4 on Fedora Core 4


===========================================================================
Reconnect network and start drbd on pcvalid
(At this time, bruegel is Primary, Waiting for connection)
===========================================================================
Logs on pcvalid:

Feb  1 17:34:25 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 17:34:25 pcvalid kernel: drbd0: Connection established.
Feb  1 17:34:25 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 17:34:25 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d7:0000002c:10
Feb  1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapS
Feb  1 17:34:25 pcvalid kernel: drbd0: sock_sendmsg returned -32
Feb  1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFBitMapS --> BrokenPipe
Feb  1 17:34:25 pcvalid kernel: drbd0: short sent ReportBitMap size=4096 sent=160
Feb  1 17:34:25 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Feb  1 17:34:25 pcvalid kernel: drbd0: meta connection shut down by peer.
Feb  1 17:34:25 pcvalid kernel: drbd0: asender terminated
Feb  1 17:34:25 pcvalid kernel: drbd0: sock was shut down by peer
Feb  1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> BrokenPipe
Feb  1 17:34:26 pcvalid kernel: drbd0: short read expecting header on sock: r=0
Feb  1 17:34:26 pcvalid kernel: drbd0: worker terminated
Feb  1 17:34:26 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> Unconnected
Feb  1 17:34:26 pcvalid kernel: drbd0: Connection lost.
Feb  1 17:34:26 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate Unconnected --> WFConnection


Logs on bruegel


Feb  1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate WFConnection --> WFReportParams
Feb  1 17:34:25 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 17:34:25 bruegel kernel: drbd0: Connection established.
Feb  1 17:34:25 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d7:0000002c:10
Feb  1 17:34:25 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 17:34:25 bruegel kernel: drbd0: Current Primary shall become sync TARGET! Aborting to prevent data corruption.
Feb  1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate WFReportParams --> StandAlone
Feb  1 17:34:25 bruegel kernel: drbd0: error receiving ReportParams, l: 72!
Feb  1 17:34:25 bruegel kernel: drbd0: worker terminated
Feb  1 17:34:25 bruegel kernel: drbd0: asender terminated
Feb  1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate StandAlone --> StandAlone
Feb  1 17:34:25 bruegel kernel: drbd0: Connection lost.
Feb  1 17:34:25 bruegel kernel: drbd0: receiver terminated

At this point, the statutes are:
bruegel cs:StandAlone st:Primary/Unknown ld:Consistent
pcvalid cs:WFConnection st:Secondary/Unknown ld:Consistent


===========================================================================
First command drbdsetup on bruegel
drbdsetup /dev/drbd0 net IP_bruegel IP_pcvalid  C -d reconnect
===========================================================================

Logs on bruegel

Feb  1 18:20:24 bruegel kernel: drbd0: drbdsetup [897]: cstate StandAlone --> Unconnected
Feb  1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate Unconnected --> WFConnection
Feb  1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate WFConnection --> WFReportParams
Feb  1 18:20:24 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 18:20:24 bruegel kernel: drbd0: Connection established.
Feb  1 18:20:24 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d8:0000002c:10
Feb  1 18:20:24 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 18:20:24 bruegel kernel: drbd0: Current Primary shall become sync TARGET! Aborting to prevent data corruption.
Feb  1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate WFReportParams --> StandAlone
Feb  1 18:20:24 bruegel kernel: drbd0: error receiving ReportParams, l: 72!
Feb  1 18:20:24 bruegel kernel: drbd0: worker terminated
Feb  1 18:20:24 bruegel kernel: drbd0: asender terminated
Feb  1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate StandAlone --> StandAlone
Feb  1 18:20:24 bruegel kernel: drbd0: Connection lost.
Feb  1 18:20:24 bruegel kernel: drbd0: receiver terminated


Logs on pcvalid

Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFConnection --> WFReportParams
Feb  1 18:20:24 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 18:20:24 pcvalid kernel: drbd0: Connection established.
Feb  1 18:20:24 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 18:20:24 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d8:0000002c:10
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapS
Feb  1 18:20:24 pcvalid kernel: drbd0: meta connection shut down by peer.
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_asender [8949]: cstate WFBitMapS --> NetworkFailure
Feb  1 18:20:24 pcvalid kernel: drbd0: asender terminated
Feb  1 18:20:24 pcvalid kernel: drbd0: sock_sendmsg returned -32
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate NetworkFailure --> BrokenPipe
Feb  1 18:20:24 pcvalid kernel: drbd0: short sent ReportBitMap size=4096 sent=160
Feb  1 18:20:24 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Feb  1 18:20:24 pcvalid kernel: drbd0: sock was shut down by peer
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> BrokenPipe
Feb  1 18:20:24 pcvalid kernel: drbd0: short read expecting header on sock: r=0
Feb  1 18:20:24 pcvalid kernel: drbd0: worker terminated
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> Unconnected
Feb  1 18:20:24 pcvalid kernel: drbd0: Connection lost.
Feb  1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate Unconnected --> WFConnection


At this point, the statutes are:
bruegel cs:StandAlone st:Primary/Unknown ld:Consistent
pcvalid cs:WFConnection st:Secondary/Unknown ld:Consistent


===========================================================================
Second command drbdsetup on bruegel
drbdsetup /dev/drbd0 net IP_bruegel IP_pcvalid  C -d reconnect
===========================================================================

Logs on bruegel

Feb  1 18:24:39 bruegel kernel: drbd0: drbdsetup [938]: cstate StandAlone --> Unconnected
Feb  1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate Unconnected --> WFConnection
Feb  1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFConnection --> WFReportParams
Feb  1 18:24:39 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 18:24:39 bruegel kernel: drbd0: Connection established.
Feb  1 18:24:39 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d9:0000002c:10
Feb  1 18:24:39 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFReportParams --> WFBitMapS
Feb  1 18:24:39 bruegel kernel: drbd0: Primary/Unknown --> Primary/Secondary
Feb  1 18:24:40 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFBitMapS --> SyncSource
Feb  1 18:24:40 bruegel kernel: drbd0: Resync started as SyncSource (need to sync 42016 KB [10504 bits set]).
Feb  1 18:25:27 bruegel kernel: drbd0: Resync done (total 47 sec; paused 0 sec; 892 K/sec)
Feb  1 18:25:27 bruegel kernel: drbd0: drbd0_worker [900]: cstate SyncSource --> Connected


Logs on pcvalid

Feb  1 18:24:39 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFConnection --> WFReportParams
Feb  1 18:24:39 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Feb  1 18:24:39 pcvalid kernel: drbd0: Connection established.
Feb  1 18:24:39 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00
Feb  1 18:24:39 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d9:0000002c:10
Feb  1 18:24:39 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapT
Feb  1 18:24:39 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary
Feb  1 18:24:40 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFBitMapT --> SyncTarget
Feb  1 18:24:40 pcvalid kernel: drbd0: Resync started as SyncTarget (need to sync 42016 KB [10504 bits set]).
Feb  1 18:25:27 pcvalid kernel: drbd0: Resync done (total 47 sec; paused 0 sec; 892 K/sec)
Feb  1 18:25:27 pcvalid kernel: drbd0: drbd0_worker [8950]: cstate SyncTarget --> Connected


At this point, the statutes are:
bruegel cs:Connected st:Primary/Secondary ld:Consistent
pcvalid cs:Connected st:Secondary/Secondary ld:Consistent

-- 
==================================================================
| Régis Gras                 | http://www-ledss.ujf-grenoble.fr  |
| L.E.D.S.S.                 | mailto:Regis.Gras at ujf-grenoble.fr |
| 301, rue de la chimie      |         --------------            |
| DU BP 53                   | Tel 04 76 51 41 76                |
| 38041 Grenoble Cedex 9     | Fax 04 76 51 40 89                |
==================================================================




More information about the drbd-user mailing list