[DRBD-user] Re: [Linux-HA] hb_report: trouble on "simple" 2-node active/passive cluster with heartbeat 2.1.3 and CRM

Wolfram Schlich lists at wolfram.schlich.org
Thu Feb 14 19:27:06 CET 2008

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


* Wolfram Schlich <lists at wolfram.schlich.org> [2008-02-14 16:22]:
> Looked fine. Then I ran "killall -9 heartbeat ccm cib lrmd stonithd attrd crmd
> tengine pengine cibmon dopd pingd" on sirius -- the node which did
> not currently run the resources but which was the DC.
> After a while, all resources on pollux were _restarted_ and
> strange DRBD kernel messages appeared -- see attached var-log-messages from
> pollux and sirius (I placed them in /tmp/hb_report/host/ myself).

I looked at the logs again and found out that something strange was
happening to the DRBD master/slave instances. After I killed the
processes on sirius, for an unknown reason the DRBD resource monitor
on pollux returned failure (everything was running fine) and the DRBS
resource which was previously running on sirius was migrated to
pollux, therefore everything was stopped and started again... very
strange!

Please see my commented logfile which contains heartbeat and drbd
log messages:
http://dev.gentoo.org/~wschlich/tmp/syslog.txt

Thanks!
-- 
Regards,
Wolfram Schlich <wschlich at gentoo.org>
Gentoo Linux * http://dev.gentoo.org/~wschlich/



More information about the drbd-user mailing list