Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi Philipp, Philipp Marek wrote, > > I am struggling with a problem since two days and found no solution, > > yet. I think it might be something trivially simple I am overlooking. > ... > > The following software is installed: > > drbd8-utils 2:8.4.4-1ubuntu1 > > pacemaker 1.1.10+git20130802-1ubuntu2.3 > > corosync 2.3.3-1ubuntu1 > ... > > All looks good to me looking into crm_mon: > > Last updated: Fri Nov 13 17:00:40 2015 > > Last change: Fri Nov 13 16:37:39 2015 via cibadmin on drbd01 > > Stack: corosync > > Current DC: drbd01 (169093703) - partition with quorum > ... > > Current DC: drbd02 (169093704) - partition with quorum > You've got a Pacemaker-split-brain, that's why the resources on top get > started on both sides. Do you have an idea why this can happen on a normal reboot? Attached the missing corosync.conf. I do not simlate any network failure, just a normal reboot. We are using unicast for the cluster communication. best regards Waldemar -------------- next part -------------- # Please read the openais.conf.5 manual page totem { version: 2 # How long before declaring a token lost (ms) token: 3000 # How many token retransmits before forming a new configuration token_retransmits_before_loss_const: 10 # How long to wait for join messages in the membership protocol (ms) join: 60 # How long to wait for consensus to be achieved before starting a new round of membership configuration (ms) consensus: 3600 # Turn off the virtual synchrony filter vsftype: none # Number of messages that may be sent by one processor on receipt of the token max_messages: 20 # Limit generated nodeids to 31-bits (positive signed integers) clear_node_high_bit: yes # Disable encryption secauth: off # How many threads to use for encryption/decryption threads: 0 # Optionally assign a fixed node id (integer) # nodeid: 1234 # This specifies the mode of redundant ring, which may be none, active, or passive. rrp_mode: none interface { # The following values need to be set based on your environment ringnumber: 0 bindnetaddr: 10.20.0.0 mcastport: 5405 } transport: udpu } amf { mode: disabled } quorum { # Quorum for the Pacemaker Cluster Resource Manager provider: corosync_votequorum expected_votes: 1 } aisexec { user: root group: root } logging { fileline: off to_stderr: yes to_logfile: no to_syslog: yes syslog_facility: daemon debug: off timestamp: on logger_subsys { subsys: AMF debug: off tags: enter|leave|trace1|trace2|trace3|trace4|trace6 } } nodelist { node { ring0_addr: 10.20.42.72 } node { ring0_addr: 10.20.42.71 } }