Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
I am using drbd version: 0.7.21 (api:79/proto:74) and heartbeat-2.0.7-1 on CentOS4 with kernel-2.6.9-42 drdb alone works fine. "haresources" contains node1.ujf-grenoble.fr IPaddr::152.77.14.84 \ drbddisk::r0 Filesystem::/dev/drbd0::/drbd::ext3 (Actually, no service is started) The node1 is normaly the master. When node1 stop, node2 becomes master, and partition /drbd is mounted. The trouble appears when node1 restarts. On node2, in /var/log/ha-log I found this messages: ERROR: Both machines own our resources! ERROR: Both machines own foreign resources! And on each node, drbd becomes StandAlone Primary Whith google, I saw that this problem is perhaps in link with firewall. I suppressed iptable, and now, when node1 restart, drbd is not loaded ... In fact, on node1, drdb is started and dies immediatly. In /var/log/message, I found: node1 drbd: Starting DRBD resources: node1 kernel: drbd: initialised. Version: 0.7.21 (api:79/proto:74) node1 kernel: drbd0: resync bitmap: bits=2411112 words=75348 node1 kernel: drbd0: size = 9418 MB (9644448 KB) ........................................................... node1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map. ........................................................... node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected --> WFConnection ........................................................... node1 drbd: WARN: stdin/stdout is not a TTY; using /dev/console node1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 node1 kernel: drbd0: Connection established. node1 kernel: drbd0: I am(S): 1:00000003:00000030:0000008e:00000007:00 node1 kernel: drbd0: Peer(P): 1:00000003:00000030:0000008f:00000007:10 node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFReportParams --> WFBitMapT node1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary node1 rc: Starting drbd: succeeded ........................................................... node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFBitMapT --> SyncTarget node1 kernel: drbd0: Resync started as SyncTarget (need to sync 0 KB [0 bits set]). node1 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec) node1 kernel: drbd0: drbd0_receiver [3574]: cstate SyncTarget --> Connected ........................................................... node1 kernel: drbd0: drbdsetup [3669]: cstate Connected --> Unconnected node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected --> BrokenPipe node1 kernel: drbd0: short read expecting header on sock: r=-512 node1 kernel: drbd0: worker terminated node1 kernel: drbd0: asender terminated node1 kernel: drbd0: drbd0_receiver [3574]: cstate BrokenPipe --> StandAlone node1 kernel: drbd0: drbdsetup [3669]: cstate StandAlone --> Unconfigured node1 kernel: drbd0: worker terminated node1 kernel: drbd: module cleanup done. On node2 .... node2 heartbeat: [3225]: info: remote resource transition completed. node2 heartbeat: [3225]: info: remote resource transition completed. node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFConnection --> WFReportPar ams node2 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74 node2 kernel: drbd0: Connection established. node2 kernel: drbd0: I am(P): 1:00000003:00000030:0000008f:00000007:10 node2 kernel: drbd0: Peer(S): 1:00000003:00000030:0000008e:00000007:00 node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFReportParams --> WFBitMapS node2 kernel: drbd0: Primary/Unknown --> Primary/Secondary node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFBitMapS --> SyncSource node2 kernel: drbd0: Resync started as SyncSource (need to sync 0 KB [0 bits set]). node2 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec) node2 kernel: drbd0: drbd0_receiver [29210]: cstate SyncSource --> Connected node2 kernel: drbd0: sock was shut down by peer node2 kernel: drbd0: drbd0_receiver [29210]: cstate Connected --> BrokenPipe node2 kernel: drbd0: short read expecting header on sock: r=0 node2 kernel: drbd0: meta connection shut down by peer. node2 kernel: drbd0: asender terminated node2 kernel: drbd0: worker terminated node2 kernel: drbd0: drbd0_receiver [29210]: cstate BrokenPipe --> Unconnected node2 kernel: drbd0: Connection lost. node2 kernel: drbd0: drbd0_receiver [29210]: cstate Unconnected --> WFConnection Could someone help me ? Sincerely -- ================================================================== | Régis Gras | http://www-ledss.ujf-grenoble.fr | | D.C.M. | mailto:Regis.Gras at ujf-grenoble.fr | | 301, rue de la chimie | -------------- | | DU BP 53 | Tel 04 76 51 41 76 | | 38041 Grenoble Cedex 9 | Fax 04 76 51 40 89 | ==================================================================