Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi all:
I countered a problem when configuring DRBD+HTTP+Heartbeat.
I set up a cluster of two computers: node1 (primary) and node2 (secondary), sharing a drbd device on the net. Each machine just have one Nic for drbd and heartbeats.
Step 1: startup drbd and heartbeat on node1(the heartbeat will mount drbd device in local directory tree and start http service) and node2. It’s ok. I can access virtual IP through IE. And see the page on node1.
Step 2: reboot node1. Then the node2 becomes the new primary and heartbeat startup http service on node2. node2 now provide httpd service.
Step 3: When the node1 comes up again. Start drbd and heartbeat on it.
But run “cat /proc/drbd” on node1, it shows:
At the same time, run “cat /proc/drbd” on node2, it shows:
version: 0.7.18 (api:78/proto:74)
SVN Revision: 2176 build by root at secondary, 2007-04-23 17:18:34
0: cs:StandAlone st:Primary/Unknown ld:Consistent
ns:8 nr:8 dw:68 dr:489 al:0 bm:1 lo:0 pe:0 ua:0 ap:0
/var/log/message in node2:
Apr 27 13:37:36 mouse kernel: drbd0: drbd0_receiver [6530]: cstate WFConnection --> WFReportParams
Apr 27 13:37:36 mouse kernel: drbd0: Handshake successful: DRBD Network Protocol version 74
Apr 27 13:37:36 mouse kernel: drbd0: Connection established.
Apr 27 13:37:36 mouse kernel: drbd0: I am(P): 1:00000002:00000001:0000004b:00000013:10
Apr 27 13:37:36 mouse kernel: drbd0: Peer(S): 1:00000002:00000001:0000004d:00000012:00
Apr 27 13:37:36 mouse kernel: drbd0: Current Primary shall become sync TARGET! Aborting to prevent data corruption.
Apr 27 13:37:36 mouse kernel: drbd0: drbd0_receiver [6530]: cstate WFReportParams --> StandAlone
Apr 27 13:37:36 mouse kernel: drbd0: error receiving ReportParams, l: 72!
Apr 27 13:37:36 mouse kernel: drbd0: worker terminated
Apr 27 13:37:36 mouse kernel: drbd0: asender terminated
Apr 27 13:37:36 mouse kernel: drbd0: drbd0_receiver [6530]: cstate StandAlone --> StandAlone
Apr 27 13:37:36 mouse kernel: drbd0: Connection lost.
Apr 27 13:37:36 mouse kernel: drbd0: receiver terminated
Step4: run “drbdadm adjust all” on node2, everything is ok.
In my opinion, without step 4, everything should ok: the failed node comes up and run as secondary in cluster.
I wonder if there is something wrong with my configuration. How can I get these two machines reconnect without step 4?
Thanks...
Best regards
Selina Sun
SW2
ZyXEL Communications(Wuxi)Corp.
Tel: +86-510-88080888 ext. 15516
Email: selina.sun at zyxel.cn <mailto:selina.sun at zyxel.cn>
Did you check www.zyxel.cn <http://www.zyxel.cn/> today?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20070427/7b76b208/attachment.htm>