Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
hi,
i have two machine with gentoo installed with heartbeat 2.0.7, drbd 8.0.4and mon
this setup is made for having a failover environment for a web application (
apache2.2 + php + mysql)
when mon stop the heartbeat or when i stop the heartbeat the machine is
restarted because the drbd device cannot be setted in secondary mode
below are file config for heartbeat and for drbd
regards,
cosmih
/etc/ha.d/ha.cf
---------------------
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 3
deadtime 12
warntime 9
initdead 20
udpport 694
baud 38400
serial /dev/ttyS0
auto_failback off
watchdog /dev/watchdog
node host01 host02
ping gateway_ip
respawn cluster /usr/lib/heartbeat/ipfail
crm off
---------------------------
/etc/ha.d/haresources
-------------------------------
host02 ip1/27/eth1/broadcast ip2/27/eth1/broadcast ip3/27/eth1/broadcast
ip4/27/eth1/broadcast arpUpdate::ip1/ip2/ip3/ip4 drbddisk::mirror sleep5
Filesystem::/dev/drbd0::/mirror::reiserfs mountFORftp mysql apache2 fcron
mon sshd_ip1
------------------------------------
where arpUpdate,sleep5,mountFORftp and sshd_ip are some script made by me
... and this script works verry well
/etc/drbd.conf
--------------------
global {
usage-count no;
}
common {
syncer { rate 70M; }
}
resource mirror {
protocol C;
handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/sbin/drbd-peer-outdater";
}
startup {
wfc-timeout 12;
degr-wfc-timeout 120; # 2 minutes.
}
disk {
on-io-error detach;
}
net {
cram-hmac-alg "md5";
shared-secret "somepass";
after-sb-0pri discard-younger-primary;
after-sb-1pri consensus;
after-sb-2pri disconnect;
rr-conflict disconnect;
}
syncer {
rate 70M;
al-extents 257;
}
on host01 {
device /dev/drbd0;
disk /dev/sda7;
address 10.0.0.1:7788;
meta-disk /dev/sda8[0];
}
on host02 {
device /dev/drbd0;
disk /dev/sda7;
address 10.0.0.2:7788 ;
meta-disk /dev/sda8[0];
}
}
here is the ha-debug log (the real IP was replaced)
-------------------------------------
heartbeat[6522]: 2007/07/03_11:35:54 info: Heartbeat shutdown in progress.
(6522)
heartbeat[10706]: 2007/07/03_11:35:54 info: Giving up all HA resources.
ResourceManager[10716]: 2007/07/03_11:35:54 info: Releasing resource
group: host02 ip1/27/eth1/gateway_ip ip2/27/eth1/gateway_ip
ip3/27/eth1/gateway_ip ip4/27/eth1/gateway_ip arpUpdate::ip1/ip2/ip3/ip4
drbddisk::mirror sleep5 Filesystem::/dev/drbd0::/mirror::reiserfs
mountFORftp mysql apache2 fcron mon sshd_ip1
ResourceManager[10716]: 2007/07/03_11:35:54 info: Running
/etc/ha.d/resource.d/sshd_ip1 stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug: Starting
/etc/ha.d/resource.d/sshd_ip1 stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug:
/etc/ha.d/resource.d/sshd_ip1 stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:54 info: Running
/etc/ha.d/resource.d/mon stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug: Starting
/etc/ha.d/resource.d/mon stop
[32;01m* [0m Stopping mon ...
[A [72C [34;01m[ [32;01mok [34;01m ] [0m
ResourceManager[10716]: 2007/07/03_11:35:54 debug:
/etc/ha.d/resource.d/mon stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:54 info: Running
/etc/ha.d/resource.d/fcron stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug: Starting
/etc/ha.d/resource.d/fcron stop
Stopping fcron din heartbeat
11:35:54 reinstalling root's fcrontab
11:35:54 installing file /tmp/fcr-rCHC7Y for user root
Modifications will be taken into account right now.
ResourceManager[10716]: 2007/07/03_11:35:54 debug:
/etc/ha.d/resource.d/fcron stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:54 info: Running
/etc/ha.d/resource.d/apache2 stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug: Starting
/etc/ha.d/resource.d/apache2 stop
[33;01m* [0m WARNING: apache2 has not yet been started.
ResourceManager[10716]: 2007/07/03_11:35:54 debug:
/etc/ha.d/resource.d/apache2 stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:54 info: Running
/etc/ha.d/resource.d/mysql stop
ResourceManager[10716]: 2007/07/03_11:35:54 debug: Starting
/etc/ha.d/resource.d/mysql stop
[32;01m* [0m Stopping mysql ...
[32;01m* [0m Stopping mysqld (0)
[A [72C [34;01m[ [32;01mok [34;01m ] [0m
ResourceManager[10716]: 2007/07/03_11:35:57 debug:
/etc/ha.d/resource.d/mysql stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:57 info: Running
/etc/ha.d/resource.d/mountFORftp stop
ResourceManager[10716]: 2007/07/03_11:35:57 debug: Starting
/etc/ha.d/resource.d/mountFORftp stop
Unmounting /home/tony/website & /home/black/website
ResourceManager[10716]: 2007/07/03_11:35:57 debug:
/etc/ha.d/resource.d/mountFORftp stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:57 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /mirror reiserfs stop
ResourceManager[10716]: 2007/07/03_11:35:57 debug: Starting
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /mirror reiserfs stop
Filesystem[11147]: 2007/07/03_11:35:57 INFO: Running stop for /dev/drbd0
on /mirror
Filesystem[11147]: 2007/07/03_11:35:57 INFO: Trying to unmount /mirror
Filesystem[11147]: 2007/07/03_11:35:57 INFO: unmounted /mirror
successfully
Filesystem[11083]: 2007/07/03_11:35:57 INFO: Filesystem Success
INFO: Filesystem Success
ResourceManager[10716]: 2007/07/03_11:35:57 debug:
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /mirror reiserfs stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:35:57 info: Running
/etc/ha.d/resource.d/sleep5 stop
ResourceManager[10716]: 2007/07/03_11:35:57 debug: Starting
/etc/ha.d/resource.d/sleep5 stop
ResourceManager[10716]: 2007/07/03_11:36:02 debug:
/etc/ha.d/resource.d/sleep5 stop done. RC=0
ResourceManager[10716]: 2007/07/03_11:36:02 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:02 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:02 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:02 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:03 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:03 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:03 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:03 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:03 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:04 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:04 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:04 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:04 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:04 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:05 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:05 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:05 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:05 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:05 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:06 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:06 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:06 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:06 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:06 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:07 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:07 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:07 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:07 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:07 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:08 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:08 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:08 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:08 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:08 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:09 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:09 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:09 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:09 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:09 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:10 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:10 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:10 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:10 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:10 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:11 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:11 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:11 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:11 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:11 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:12 info: Retrying failed stop
operation [drbddisk::mirror]
ResourceManager[10716]: 2007/07/03_11:36:12 info: Running
/etc/ha.d/resource.d/drbddisk mirror stop
ResourceManager[10716]: 2007/07/03_11:36:12 debug: Starting
/etc/ha.d/resource.d/drbddisk mirror stop
State change failed: (-12) Device is held open by someone
Command '/sbin/drbdsetup /dev/drbd0 secondary' terminated with exit code 11
ResourceManager[10716]: 2007/07/03_11:36:13 debug:
/etc/ha.d/resource.d/drbddisk mirror stop done. RC=11
ResourceManager[10716]: 2007/07/03_11:36:13 ERROR: Return code 11 from
/etc/ha.d/resource.d/drbddisk
ResourceManager[10716]: 2007/07/03_11:36:13 CRIT: Resource STOP failure.
Reboot required!
ResourceManager[10716]: 2007/07/03_11:36:13 CRIT: Killing heartbeat
ungracefully!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20070703/e084f92f/attachment.htm>