[DRBD-user] split-brain on Ubuntu 14.04 LTS after reboot of master node

Waldemar Brodkorb mail at waldemar-brodkorb.de
Sun Nov 15 11:05:11 CET 2015

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi Philipp,
Philipp Marek wrote,

> > I am struggling with a problem since two days and found no solution,
> > yet. I think it might be something trivially simple I am overlooking.
> ...
> > The following software is installed:
> > drbd8-utils 2:8.4.4-1ubuntu1
> > pacemaker 1.1.10+git20130802-1ubuntu2.3
> > corosync 2.3.3-1ubuntu1
> ...
> > All looks good to me looking into crm_mon:
> > Last updated: Fri Nov 13 17:00:40 2015
> > Last change: Fri Nov 13 16:37:39 2015 via cibadmin on drbd01
> > Stack: corosync
> > Current DC: drbd01 (169093703) - partition with quorum
> ...
> > Current DC: drbd02 (169093704) - partition with quorum
> You've got a Pacemaker-split-brain, that's why the resources on top get 
> started on both sides.
 
Do you have an idea why this can happen on a normal reboot?
Attached the missing corosync.conf.
I do not simlate any network failure, just a normal reboot.
We are using unicast for the cluster communication.

best regards
 Waldemar
 
-------------- next part --------------
# Please read the openais.conf.5 manual page

totem {
	version: 2

	# How long before declaring a token lost (ms)
	token: 3000

	# How many token retransmits before forming a new configuration
	token_retransmits_before_loss_const: 10

	# How long to wait for join messages in the membership protocol (ms)
	join: 60

	# How long to wait for consensus to be achieved before starting a new round of membership configuration (ms)
	consensus: 3600

	# Turn off the virtual synchrony filter
	vsftype: none

	# Number of messages that may be sent by one processor on receipt of the token
	max_messages: 20

	# Limit generated nodeids to 31-bits (positive signed integers)
	clear_node_high_bit: yes

	# Disable encryption
 	secauth: off

	# How many threads to use for encryption/decryption
 	threads: 0

	# Optionally assign a fixed node id (integer)
	# nodeid: 1234

	# This specifies the mode of redundant ring, which may be none, active, or passive.
 	rrp_mode: none

 	interface {
		# The following values need to be set based on your environment 
		ringnumber: 0
		bindnetaddr: 10.20.0.0
		mcastport: 5405
	}
	transport: udpu
}

amf {
	mode: disabled
}

quorum {
	# Quorum for the Pacemaker Cluster Resource Manager
	provider: corosync_votequorum
	expected_votes: 1
}

aisexec {
        user:   root
        group:  root
}

logging {
        fileline: off
        to_stderr: yes
        to_logfile: no
        to_syslog: yes
	syslog_facility: daemon
        debug: off
        timestamp: on
        logger_subsys {
                subsys: AMF
                debug: off
                tags: enter|leave|trace1|trace2|trace3|trace4|trace6
        }
}



nodelist {
 	node {
		ring0_addr: 10.20.42.72
	}
 	node {
		ring0_addr: 10.20.42.71
	}
}



More information about the drbd-user mailing list