[DRBD-user] heartbeat does'nt work with drbd
Regis Gras
Regis.Gras at ujf-grenoble.fr
Tue Mar 13 14:54:26 CET 2007
I am using drbd version: 0.7.21 (api:79/proto:74) and heartbeat-2.0.7-1
on CentOS4 with kernel-2.6.9-42
drdb alone works fine.
"haresources" contains
node1.ujf-grenoble.fr IPaddr::152.77.14.84 \
drbddisk::r0 Filesystem::/dev/drbd0::/drbd::ext3
(Actually, no service is started)
The node1 is normaly the master.
When node1 stop, node2 becomes master, and partition /drbd is mounted.
The trouble appears when node1 restarts.
On node2, in /var/log/ha-log I found this messages:
ERROR: Both machines own our resources!
ERROR: Both machines own foreign resources!
And on each node, drbd becomes StandAlone Primary
Whith google, I saw that this problem is perhaps in link with firewall.
I suppressed iptable, and now, when node1 restart, drbd is not loaded ...
In fact, on node1, drdb is started and dies immediatly.
In /var/log/message, I found:
node1 drbd: Starting DRBD resources:
node1 kernel: drbd: initialised. Version: 0.7.21 (api:79/proto:74)
node1 kernel: drbd0: resync bitmap: bits=2411112 words=75348
node1 kernel: drbd0: size = 9418 MB (9644448 KB)
...........................................................
node1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map.
...........................................................
node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected -->
WFConnection
...........................................................
node1 drbd: WARN: stdin/stdout is not a TTY; using /dev/console
node1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
node1 kernel: drbd0: Connection established.
node1 kernel: drbd0: I am(S): 1:00000003:00000030:0000008e:00000007:00
node1 kernel: drbd0: Peer(P): 1:00000003:00000030:0000008f:00000007:10
node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFReportParams -->
WFBitMapT
node1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary
node1 rc: Starting drbd: succeeded
...........................................................
node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFBitMapT --> SyncTarget
node1 kernel: drbd0: Resync started as SyncTarget (need to sync 0 KB [0 bits
set]).
node1 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
node1 kernel: drbd0: drbd0_receiver [3574]: cstate SyncTarget --> Connected
...........................................................
node1 kernel: drbd0: drbdsetup [3669]: cstate Connected --> Unconnected
node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected -->
BrokenPipe
node1 kernel: drbd0: short read expecting header on sock: r=-512
node1 kernel: drbd0: worker terminated
node1 kernel: drbd0: asender terminated
node1 kernel: drbd0: drbd0_receiver [3574]: cstate BrokenPipe --> StandAlone
node1 kernel: drbd0: drbdsetup [3669]: cstate StandAlone --> Unconfigured
node1 kernel: drbd0: worker terminated
node1 kernel: drbd: module cleanup done.
On node2 ....
node2 heartbeat: [3225]: info: remote resource transition completed.
node2 heartbeat: [3225]: info: remote resource transition completed.
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFConnection -->
WFReportPar
ams
node2 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
node2 kernel: drbd0: Connection established.
node2 kernel: drbd0: I am(P): 1:00000003:00000030:0000008f:00000007:10
node2 kernel: drbd0: Peer(S): 1:00000003:00000030:0000008e:00000007:00
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFReportParams -->
WFBitMapS
node2 kernel: drbd0: Primary/Unknown --> Primary/Secondary
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFBitMapS --> SyncSource
node2 kernel: drbd0: Resync started as SyncSource (need to sync 0 KB [0
bits set]).
node2 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
node2 kernel: drbd0: drbd0_receiver [29210]: cstate SyncSource --> Connected
node2 kernel: drbd0: sock was shut down by peer
node2 kernel: drbd0: drbd0_receiver [29210]: cstate Connected --> BrokenPipe
node2 kernel: drbd0: short read expecting header on sock: r=0
node2 kernel: drbd0: meta connection shut down by peer.
node2 kernel: drbd0: asender terminated
node2 kernel: drbd0: worker terminated
node2 kernel: drbd0: drbd0_receiver [29210]: cstate BrokenPipe -->
Unconnected
node2 kernel: drbd0: Connection lost.
node2 kernel: drbd0: drbd0_receiver [29210]: cstate Unconnected -->
WFConnection
Could someone help me ?
Sincerely
--
==================================================================
| Régis Gras | http://www-ledss.ujf-grenoble.fr |
| D.C.M. | mailto:Regis.Gras at ujf-grenoble.fr |
| 301, rue de la chimie | -------------- |
| DU BP 53 | Tel 04 76 51 41 76 |
| 38041 Grenoble Cedex 9 | Fax 04 76 51 40 89 |
==================================================================
More information about the drbd-user
mailing list