you mean like this?<br><br><span class="q">lab-test-01 <a onclick="return top.js.OpenExtLink(window,event,this)" href="http://192.168.10.218/" target="_blank">192.168.10.218</a> drbddisk::r0 Filesystem::/dev/drbd0::/mysql::ext3 drbddisk::r1 Filesystem::/dev/drbd1::/data::ext3
</span><br><br>I'll do this and run it again, and post the debug. the weird thing is the debug says it releases the IP resource, but it never actually does.<br>it says "success" "success" but doesn't actually do anything.
<br>here's a portion of the ha-log:<br><br>ResourceManager[32348]: 2007/06/13_12:45:08 info: Running /etc/ha.d/resource.d/Filesystem /dev/drbd1 /data ext3 stop<br>Filesystem[32683]: 2007/06/13_12:45:08 INFO: Running stop for /dev/drbd1 on /data
<br>Filesystem[32678]: 2007/06/13_12:45:08 INFO: Success<br>ResourceManager[32348]: 2007/06/13_12:45:08 info: Running /etc/ha.d/resource.d/Filesystem /dev/drbd0 /mysql ext3 stop<br>Filesystem[32731]: 2007/06/13_12:45:08 INFO: Running stop for /dev/drbd0 on /mysql
<br>Filesystem[32726]: 2007/06/13_12:45:08 INFO: Success<br>ResourceManager[32348]: 2007/06/13_12:45:08 info: Running /etc/ha.d/resource.d/drbddisk r1 stop<br>ResourceManager[32348]: 2007/06/13_12:45:08 info: Running /etc/ha.d/resource.d/drbddisk r0 stop
<br>ResourceManager[32348]: 2007/06/13_12:45:08 info: Running /etc/ha.d/resource.d/IPaddr <a href="http://192.168.100.218">192.168.100.218</a> stop<br>IPaddr[371]: 2007/06/13_12:45:08 INFO: /sbin/ifconfig eth0:0 <a href="http://192.168.100.218">
192.168.100.218</a> down<br>IPaddr[360]: 2007/06/13_12:45:08 INFO: Success<br>mach_down[32328]: 2007/06/13_12:45:08 info: /usr/lib/heartbeat/mach_down: nice_failback: foreign resources acquired<br>mach_down[32328]: 2007/06/13_12:45:08 info: mach_down takeover complete for node lab-test-nag01.
<br>heartbeat[32257]: 2007/06/13_12:45:08 info: mach_down takeover complete.<br>heartbeat[32257]: 2007/06/13_12:45:13 info: Local Resource acquisition completed. (none)<br>heartbeat[32257]: 2007/06/13_12:45:13 info: local resource transition completed.
<br>hb_standby[421]: 2007/06/13_12:45:38 Going standby [foreign].<br>heartbeat[32257]: 2007/06/13_12:45:38 info: lab-test-nag02 wants to go standby [foreign]<br>heartbeat[32257]: 2007/06/13_12:45:49 WARN: No reply to standby request. Standby request cancelled
<br><br>BTW I use auto-failback for a specific reason - you always know which one is the primary. That is, if your servers are in a remote location, managed by different group, and you want to do maintenance, you can be reasonably sure it's ok to remove the secondary from service.
<br>But it's just a thought, not totally critical.<br><br>the ha-debug is way too huge to post. I could send attached, off-list. recommend?<br><br>Dan.<br><div><span class="gmail_quote"><br>On 6/14/07, <b class="gmail_sendername">
Lars Ellenberg</b> <<a href="mailto:lars.ellenberg@linbit.com">lars.ellenberg@linbit.com</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
On Thu, Jun 14, 2007 at 10:37:38AM -0400, Dan Gahlinger wrote:<br>> I posted this in linux-ha but got no response, and didn't even see my post get<br>> to the list.<br>> so here it is here. seems more like a drbd issue anyhow.
<br>><br>> I have two systems, with heartbeat and DRBD installed.<br>> Initially I tested with just DRBD, and was able to fail back and forth very<br>> well and easily.<br>><br>> However, when using heartbeat, it won't fail over, no matter what I do. status
<br>> doesn't change.<br>><br>> I have it setup so that DRBD goes over a cross-over cable between the two<br>> systems on a private IP.<br>> and heartbeat is run over the public (internet facing) interfaces.
<br>><br>> My heartbeat config looks like this:<br>><br>> vi /etc/ha.d/ha.cf -<br>> logfacility local0<br>><br>> logfile /var/log/ha-log<br>><br>> debugfile /var/log/ha-debug<br>><br>> udpport 694
<br>><br>> keepalive 1<br>><br>> deadtime 60<br>><br>> bcast eth0<br>><br>> node LAB-TEST-01<br> ^^^^^^^^^^^^ [1]<br>><br>> node LAB-TEST-02<br>><br>> auto_failback on<br><br>I don't like automatic failback.
<br><br>it may even be dangerous<br>(in case you have some misbehaving resource agent on stop ...<br>if you don't know what I mean, consider yourself happy<br>to have missed out on one of the most fun parts setting up
<br>a heartbeat cluster)<br><br>in a "homogeneous" 2-node-failover-cluster<br>(i.e. both nodes are more or less identical)<br>it does not make much sense.<br><br>and to have a non-homogeneous cluster is<br>not a good idea either (most of the time).
<br><br>even then, operator will get paged for the first failover,<br>and if deemd useful, will initiate the switch-back by hand.<br><br>> and /etc/ha.d/haresources (note IP address is the virtual public IP):<br><br>( this is all one long single line, right?
<br> if not, you _have_ to use backslash! )<br>> lab-test-01 <a href="http://192.168.10.218">192.168.10.218</a> drbddisk Filesystem::/dev/drbd0::/mysql::ext3 Filesystem::/dev/drbd1::/data::ext3<br> ^^^^^^^^^^^ [1] ^^^^^^^^[2]
<br><br>[1] should be the same cAsE (preferably both small).<br> it must be the actual node name, as reported by "uname -n"<br>[2] please use one drbddisk statement per drbd resource explicitly.<br> drbddisk::r0 drbddisk::r1
<br> (or whatever your resource names are in drbd.conf)<br><br>> configs on both systems are the same, hosts files identical with all<br>> the entries. I've tried with auto_failback on and off seems to make<br>
> no difference.<br>><br>> I test by pulling the public cable on lab-test-01, or using ifconfig eth0 down<br>><br>> Also, when I bring the server back up drbd can't see the other system<br>> (either one), it becomes
<br>> secondary/unknown and primary/unknown.<br>><br>> It seems for some cases I need to use the drbdadm primary all on the<br>> primary at boot up to fix that.<br>> One other note about the heartbeat issue above. I found if I enter the
<br>> commands manually it seems to work.<br>> which makes it really weird.<br>><br>> Can anyone tell me what's going wrong?<br><br>the heartneat log file(s) (ha-debug)?<br><br><br>--<br>: Lars Ellenberg Tel +43-1-8178292-0 :
<br>: LINBIT Information Technologies GmbH Fax +43-1-8178292-82 :<br>: Vivenotgasse 48, A-1120 Vienna/Europe <a href="http://www.linbit.com">http://www.linbit.com</a> :<br>__<br>please use the "List-Reply" function of your email client.
<br>_______________________________________________<br>drbd-user mailing list<br><a href="mailto:drbd-user@lists.linbit.com">drbd-user@lists.linbit.com</a><br><a href="http://lists.linbit.com/mailman/listinfo/drbd-user">http://lists.linbit.com/mailman/listinfo/drbd-user
</a><br></blockquote></div><br>