[DRBD-user] heartbeat does'nt work with drbd

Regis Gras Regis.Gras at ujf-grenoble.fr
Tue Mar 13 14:54:26 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


I am using drbd version: 0.7.21 (api:79/proto:74) and heartbeat-2.0.7-1
on CentOS4 with kernel-2.6.9-42

drdb alone works fine.
"haresources" contains
node1.ujf-grenoble.fr IPaddr::152.77.14.84 \
drbddisk::r0 Filesystem::/dev/drbd0::/drbd::ext3
(Actually, no service is started)

The node1 is normaly the master.
When node1 stop, node2 becomes master, and partition /drbd is mounted.

The trouble appears when node1 restarts.
On node2, in /var/log/ha-log I found this messages:
ERROR: Both machines own our resources!
ERROR: Both machines own foreign resources!
And on each node, drbd becomes StandAlone  Primary

Whith google, I saw that this problem is perhaps in link with firewall.
I suppressed iptable, and now, when node1 restart, drbd is not loaded ...

In fact, on node1, drdb is started and dies immediatly.

In /var/log/message, I found:

node1 drbd: Starting DRBD resources:
node1 kernel: drbd: initialised. Version: 0.7.21 (api:79/proto:74)
node1 kernel: drbd0: resync bitmap: bits=2411112 words=75348
node1 kernel: drbd0: size = 9418 MB (9644448 KB)
...........................................................
node1 kernel: drbd0: 0 KB marked out-of-sync by on disk bit-map.
...........................................................
node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected --> 
WFConnection
...........................................................
node1 drbd: WARN: stdin/stdout is not a TTY; using /dev/console
node1 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
node1 kernel: drbd0: Connection established.
node1 kernel: drbd0: I am(S): 1:00000003:00000030:0000008e:00000007:00
node1 kernel: drbd0: Peer(P): 1:00000003:00000030:0000008f:00000007:10
node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFReportParams --> 
WFBitMapT
node1 kernel: drbd0: Secondary/Unknown --> Secondary/Primary
node1 rc: Starting drbd:  succeeded
...........................................................
node1 kernel: drbd0: drbd0_receiver [3574]: cstate WFBitMapT --> SyncTarget
node1 kernel: drbd0: Resync started as SyncTarget (need to sync 0 KB [0 bits
set]).
node1 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
node1 kernel: drbd0: drbd0_receiver [3574]: cstate SyncTarget --> Connected
...........................................................
node1 kernel: drbd0: drbdsetup [3669]: cstate Connected --> Unconnected
node1 kernel: drbd0: drbd0_receiver [3574]: cstate Unconnected --> 
BrokenPipe
node1 kernel: drbd0: short read expecting header on sock: r=-512
node1 kernel: drbd0: worker terminated
node1 kernel: drbd0: asender terminated
node1 kernel: drbd0: drbd0_receiver [3574]: cstate BrokenPipe --> StandAlone
node1 kernel: drbd0: drbdsetup [3669]: cstate StandAlone --> Unconfigured
node1 kernel: drbd0: worker terminated
node1 kernel: drbd: module cleanup done.


On node2 ....
node2 heartbeat: [3225]: info: remote resource transition completed.
node2 heartbeat: [3225]: info: remote resource transition completed.
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFConnection --> 
WFReportPar
ams
node2 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
node2 kernel: drbd0: Connection established.
node2 kernel: drbd0: I am(P): 1:00000003:00000030:0000008f:00000007:10
node2 kernel: drbd0: Peer(S): 1:00000003:00000030:0000008e:00000007:00
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFReportParams --> 
WFBitMapS
node2 kernel: drbd0: Primary/Unknown --> Primary/Secondary
node2 kernel: drbd0: drbd0_receiver [29210]: cstate WFBitMapS --> SyncSource
node2 kernel: drbd0: Resync started as SyncSource (need to sync 0 KB [0 
bits set]).
node2 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 0 K/sec)
node2 kernel: drbd0: drbd0_receiver [29210]: cstate SyncSource --> Connected
node2 kernel: drbd0: sock was shut down by peer
node2 kernel: drbd0: drbd0_receiver [29210]: cstate Connected --> BrokenPipe
node2 kernel: drbd0: short read expecting header on sock: r=0
node2 kernel: drbd0: meta connection shut down by peer.
node2 kernel: drbd0: asender terminated
node2 kernel: drbd0: worker terminated
node2 kernel: drbd0: drbd0_receiver [29210]: cstate BrokenPipe --> 
Unconnected
node2 kernel: drbd0: Connection lost.
node2 kernel: drbd0: drbd0_receiver [29210]: cstate Unconnected --> 
WFConnection

Could someone help me ?

Sincerely

-- 
==================================================================
| Régis Gras                 | http://www-ledss.ujf-grenoble.fr  |
|   D.C.M.                   | mailto:Regis.Gras at ujf-grenoble.fr |
| 301, rue de la chimie      |         --------------            |
| DU BP 53                   | Tel 04 76 51 41 76                |
| 38041 Grenoble Cedex 9     | Fax 04 76 51 40 89                |
==================================================================




More information about the drbd-user mailing list