Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
I desire to simulate a crash on the main server. To do that, I stop the network and after I stop drbd on the main server. At this time, the Secondary server becomes Primary server, whitout any problem. Later, I reconnect the network on the main server, and start drbd At this time, the two computer do not synchronize. The old main server becomes Secondary, Waiting for connection and the actual Primary becomes Primary StandAlone The only way, I found, to synchronize the two computers, is to start twice, on the actual Primary the command: drbdsetup /dev/drbd0 net IP_actual_Primary IP_actual_Secondary C -d reconnect In the logs, I find a great number of error messages. Cloud someone help me to solve this problem ? Best regards. The main server is nammed "pcvalid", and the other is "bruegel" I am using drbd-0.7.15-1, and kernel-2.6.14-1.1653_FC4 on Fedora Core 4 =========================================================================== Reconnect network and start drbd on pcvalid (At this time, bruegel is Primary, Waiting for connection) =========================================================================== Logs on pcvalid: Feb 1 17:34:25 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 17:34:25 pcvalid kernel: drbd0: Connection established. Feb 1 17:34:25 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 17:34:25 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d7:0000002c:10 Feb 1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapS Feb 1 17:34:25 pcvalid kernel: drbd0: sock_sendmsg returned -32 Feb 1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFBitMapS --> BrokenPipe Feb 1 17:34:25 pcvalid kernel: drbd0: short sent ReportBitMap size=4096 sent=160 Feb 1 17:34:25 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary Feb 1 17:34:25 pcvalid kernel: drbd0: meta connection shut down by peer. Feb 1 17:34:25 pcvalid kernel: drbd0: asender terminated Feb 1 17:34:25 pcvalid kernel: drbd0: sock was shut down by peer Feb 1 17:34:25 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> BrokenPipe Feb 1 17:34:26 pcvalid kernel: drbd0: short read expecting header on sock: r=0 Feb 1 17:34:26 pcvalid kernel: drbd0: worker terminated Feb 1 17:34:26 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> Unconnected Feb 1 17:34:26 pcvalid kernel: drbd0: Connection lost. Feb 1 17:34:26 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate Unconnected --> WFConnection Logs on bruegel Feb 1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate WFConnection --> WFReportParams Feb 1 17:34:25 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 17:34:25 bruegel kernel: drbd0: Connection established. Feb 1 17:34:25 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d7:0000002c:10 Feb 1 17:34:25 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 17:34:25 bruegel kernel: drbd0: Current Primary shall become sync TARGET! Aborting to prevent data corruption. Feb 1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate WFReportParams --> StandAlone Feb 1 17:34:25 bruegel kernel: drbd0: error receiving ReportParams, l: 72! Feb 1 17:34:25 bruegel kernel: drbd0: worker terminated Feb 1 17:34:25 bruegel kernel: drbd0: asender terminated Feb 1 17:34:25 bruegel kernel: drbd0: drbd0_receiver [32666]: cstate StandAlone --> StandAlone Feb 1 17:34:25 bruegel kernel: drbd0: Connection lost. Feb 1 17:34:25 bruegel kernel: drbd0: receiver terminated At this point, the statutes are: bruegel cs:StandAlone st:Primary/Unknown ld:Consistent pcvalid cs:WFConnection st:Secondary/Unknown ld:Consistent =========================================================================== First command drbdsetup on bruegel drbdsetup /dev/drbd0 net IP_bruegel IP_pcvalid C -d reconnect =========================================================================== Logs on bruegel Feb 1 18:20:24 bruegel kernel: drbd0: drbdsetup [897]: cstate StandAlone --> Unconnected Feb 1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate Unconnected --> WFConnection Feb 1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate WFConnection --> WFReportParams Feb 1 18:20:24 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 18:20:24 bruegel kernel: drbd0: Connection established. Feb 1 18:20:24 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d8:0000002c:10 Feb 1 18:20:24 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 18:20:24 bruegel kernel: drbd0: Current Primary shall become sync TARGET! Aborting to prevent data corruption. Feb 1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate WFReportParams --> StandAlone Feb 1 18:20:24 bruegel kernel: drbd0: error receiving ReportParams, l: 72! Feb 1 18:20:24 bruegel kernel: drbd0: worker terminated Feb 1 18:20:24 bruegel kernel: drbd0: asender terminated Feb 1 18:20:24 bruegel kernel: drbd0: drbd0_receiver [898]: cstate StandAlone --> StandAlone Feb 1 18:20:24 bruegel kernel: drbd0: Connection lost. Feb 1 18:20:24 bruegel kernel: drbd0: receiver terminated Logs on pcvalid Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFConnection --> WFReportParams Feb 1 18:20:24 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 18:20:24 pcvalid kernel: drbd0: Connection established. Feb 1 18:20:24 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 18:20:24 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d8:0000002c:10 Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapS Feb 1 18:20:24 pcvalid kernel: drbd0: meta connection shut down by peer. Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_asender [8949]: cstate WFBitMapS --> NetworkFailure Feb 1 18:20:24 pcvalid kernel: drbd0: asender terminated Feb 1 18:20:24 pcvalid kernel: drbd0: sock_sendmsg returned -32 Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate NetworkFailure --> BrokenPipe Feb 1 18:20:24 pcvalid kernel: drbd0: short sent ReportBitMap size=4096 sent=160 Feb 1 18:20:24 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary Feb 1 18:20:24 pcvalid kernel: drbd0: sock was shut down by peer Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> BrokenPipe Feb 1 18:20:24 pcvalid kernel: drbd0: short read expecting header on sock: r=0 Feb 1 18:20:24 pcvalid kernel: drbd0: worker terminated Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate BrokenPipe --> Unconnected Feb 1 18:20:24 pcvalid kernel: drbd0: Connection lost. Feb 1 18:20:24 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate Unconnected --> WFConnection At this point, the statutes are: bruegel cs:StandAlone st:Primary/Unknown ld:Consistent pcvalid cs:WFConnection st:Secondary/Unknown ld:Consistent =========================================================================== Second command drbdsetup on bruegel drbdsetup /dev/drbd0 net IP_bruegel IP_pcvalid C -d reconnect =========================================================================== Logs on bruegel Feb 1 18:24:39 bruegel kernel: drbd0: drbdsetup [938]: cstate StandAlone --> Unconnected Feb 1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate Unconnected --> WFConnection Feb 1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFConnection --> WFReportParams Feb 1 18:24:39 bruegel kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 18:24:39 bruegel kernel: drbd0: Connection established. Feb 1 18:24:39 bruegel kernel: drbd0: I am(P): 1:00000003:00000001:000001d9:0000002c:10 Feb 1 18:24:39 bruegel kernel: drbd0: Peer(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 18:24:39 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFReportParams --> WFBitMapS Feb 1 18:24:39 bruegel kernel: drbd0: Primary/Unknown --> Primary/Secondary Feb 1 18:24:40 bruegel kernel: drbd0: drbd0_receiver [939]: cstate WFBitMapS --> SyncSource Feb 1 18:24:40 bruegel kernel: drbd0: Resync started as SyncSource (need to sync 42016 KB [10504 bits set]). Feb 1 18:25:27 bruegel kernel: drbd0: Resync done (total 47 sec; paused 0 sec; 892 K/sec) Feb 1 18:25:27 bruegel kernel: drbd0: drbd0_worker [900]: cstate SyncSource --> Connected Logs on pcvalid Feb 1 18:24:39 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFConnection --> WFReportParams Feb 1 18:24:39 pcvalid kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 Feb 1 18:24:39 pcvalid kernel: drbd0: Connection established. Feb 1 18:24:39 pcvalid kernel: drbd0: I am(S): 1:00000003:00000001:000001d9:0000002a:00 Feb 1 18:24:39 pcvalid kernel: drbd0: Peer(P): 1:00000003:00000001:000001d9:0000002c:10 Feb 1 18:24:39 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFReportParams --> WFBitMapT Feb 1 18:24:39 pcvalid kernel: drbd0: Secondary/Unknown --> Secondary/Primary Feb 1 18:24:40 pcvalid kernel: drbd0: drbd0_receiver [8478]: cstate WFBitMapT --> SyncTarget Feb 1 18:24:40 pcvalid kernel: drbd0: Resync started as SyncTarget (need to sync 42016 KB [10504 bits set]). Feb 1 18:25:27 pcvalid kernel: drbd0: Resync done (total 47 sec; paused 0 sec; 892 K/sec) Feb 1 18:25:27 pcvalid kernel: drbd0: drbd0_worker [8950]: cstate SyncTarget --> Connected At this point, the statutes are: bruegel cs:Connected st:Primary/Secondary ld:Consistent pcvalid cs:Connected st:Secondary/Secondary ld:Consistent -- ================================================================== | Régis Gras | http://www-ledss.ujf-grenoble.fr | | L.E.D.S.S. | mailto:Regis.Gras at ujf-grenoble.fr | | 301, rue de la chimie | -------------- | | DU BP 53 | Tel 04 76 51 41 76 | | 38041 Grenoble Cedex 9 | Fax 04 76 51 40 89 | ==================================================================