Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi Philipp,
Philipp Marek wrote,
> > I am struggling with a problem since two days and found no solution,
> > yet. I think it might be something trivially simple I am overlooking.
> ...
> > The following software is installed:
> > drbd8-utils 2:8.4.4-1ubuntu1
> > pacemaker 1.1.10+git20130802-1ubuntu2.3
> > corosync 2.3.3-1ubuntu1
> ...
> > All looks good to me looking into crm_mon:
> > Last updated: Fri Nov 13 17:00:40 2015
> > Last change: Fri Nov 13 16:37:39 2015 via cibadmin on drbd01
> > Stack: corosync
> > Current DC: drbd01 (169093703) - partition with quorum
> ...
> > Current DC: drbd02 (169093704) - partition with quorum
> You've got a Pacemaker-split-brain, that's why the resources on top get
> started on both sides.
Do you have an idea why this can happen on a normal reboot?
Attached the missing corosync.conf.
I do not simlate any network failure, just a normal reboot.
We are using unicast for the cluster communication.
best regards
Waldemar
-------------- next part --------------
# Please read the openais.conf.5 manual page
totem {
version: 2
# How long before declaring a token lost (ms)
token: 3000
# How many token retransmits before forming a new configuration
token_retransmits_before_loss_const: 10
# How long to wait for join messages in the membership protocol (ms)
join: 60
# How long to wait for consensus to be achieved before starting a new round of membership configuration (ms)
consensus: 3600
# Turn off the virtual synchrony filter
vsftype: none
# Number of messages that may be sent by one processor on receipt of the token
max_messages: 20
# Limit generated nodeids to 31-bits (positive signed integers)
clear_node_high_bit: yes
# Disable encryption
secauth: off
# How many threads to use for encryption/decryption
threads: 0
# Optionally assign a fixed node id (integer)
# nodeid: 1234
# This specifies the mode of redundant ring, which may be none, active, or passive.
rrp_mode: none
interface {
# The following values need to be set based on your environment
ringnumber: 0
bindnetaddr: 10.20.0.0
mcastport: 5405
}
transport: udpu
}
amf {
mode: disabled
}
quorum {
# Quorum for the Pacemaker Cluster Resource Manager
provider: corosync_votequorum
expected_votes: 1
}
aisexec {
user: root
group: root
}
logging {
fileline: off
to_stderr: yes
to_logfile: no
to_syslog: yes
syslog_facility: daemon
debug: off
timestamp: on
logger_subsys {
subsys: AMF
debug: off
tags: enter|leave|trace1|trace2|trace3|trace4|trace6
}
}
nodelist {
node {
ring0_addr: 10.20.42.72
}
node {
ring0_addr: 10.20.42.71
}
}