[DRBD-user] Split Brain again, during reboot

Ben Clewett ben at roadrunner.uk.com
Tue Aug 28 13:23:00 CEST 2007



Hi Guys,

(Sorry if multiple postings, having problems with email)

Sometime during the last few days my DRBD (8.0.3) entered a split-brain.

I am not sure what caused it, and wonder whether there are any experts 
on reading logs who might be able to help me?

Starting from a good state, I requested a /sbin/reboot:

Aug 24 17:21:35 init: Switching to runlevel: 6
Aug 24 17:21:36 kernel: drbd0: disk( UpToDate -> Diskless )
Aug 24 17:21:36 kernel: drbd0: drbd_bm_resize called with capacity == 0
Aug 24 17:21:36 kernel: drbd1: State change failed: Device is held open 
by someone
Aug 24 17:21:36 kernel: drbd1:   state = { cs:StandAlone 
st:Primary/Unknown ds:UpToDate/DUnknown r--- }
Aug 24 17:21:36 kernel: drbd1:  wanted = { cs:StandAlone 
st:Secondary/Unknown ds:UpToDate/DUnknown r--- }
Aug 24 17:21:36 kernel: drbd0: worker terminated


The following is the log of the reboot, after which I am 95% certain 
DRBD was left in split-brain:


Aug 24 17:24:00 kernel: drbd: module not supported by Novell, setting U 
taint flag.
Aug 24 17:24:00 kernel: drbd: initialised. Version: 8.0.3 (api:86/proto:86)
Aug 24 17:24:00 kernel: drbd: SVN Revision: 2881 build by root@, 
2007-07-03 11:03:51
Aug 24 17:24:00 kernel: drbd: registered as block device major 147
Aug 24 17:24:00 kernel: drbd: minor_table @ 0xffff8102261223c0
Aug 24 17:24:00 kernel: drbd0: disk( Diskless -> Attaching )
Aug 24 17:24:00 kernel: drbd0: Found 6 transactions (276 active extents) 
in activity log.
Aug 24 17:24:00 kernel: drbd0: max_segment_size ( = BIO size ) = 32768
Aug 24 17:24:00 kernel: drbd0: drbd_bm_resize called with capacity == 
4878499462
Aug 24 17:24:00 kernel: drbd0: resync bitmap: bits=609812433 words=9528320
Aug 24 17:24:00 kernel: drbd0: size = 2326 GB (2439249731 KB)
Aug 24 17:24:01 kernel: drbd0: reading of bitmap took 117 jiffies
Aug 24 17:24:01 kernel: drbd0: recounting of set bits took additional 34 
jiffies
Aug 24 17:24:01 kernel: drbd0: 51 MB marked out-of-sync by on disk bit-map.
Aug 24 17:24:01 kernel: drbd0: disk( Attaching -> UpToDate )
Aug 24 17:24:01 kernel: drbd0: Writing meta data super block now.
Aug 24 17:24:01 kernel: drbd1: disk( Diskless -> Attaching )
Aug 24 17:24:01 kernel: drbd1: Found 6 transactions (324 active extents) 
in activity log.
Aug 24 17:24:01 kernel: drbd1: max_segment_size ( = BIO size ) = 32768
Aug 24 17:24:01 kernel: drbd1: drbd_bm_resize called with capacity == 
4878499462
Aug 24 17:24:01 kernel: drbd1: resync bitmap: bits=609812433 words=9528320
Aug 24 17:24:01 kernel: drbd1: size = 2326 GB (2439249731 KB)
Aug 24 17:24:01 kernel: drbd1: reading of bitmap took 68 jiffies
Aug 24 17:24:01 kernel: drbd1: recounting of set bits took additional 46 
jiffies
Aug 24 17:24:01 kernel: drbd1: 632 KB marked out-of-sync by on disk bit-map.
Aug 24 17:24:01 kernel: drbd1: disk( Attaching -> UpToDate )
Aug 24 17:24:01 kernel: drbd1: Writing meta data super block now.
Aug 24 17:24:01 kernel: drbd0: conn( StandAlone -> Unconnected )
Aug 24 17:24:01 kernel: drbd0: receiver (re)started
Aug 24 17:24:01 kernel: drbd0: conn( Unconnected -> WFConnection )
Aug 24 17:24:01 kernel: drbd1: conn( StandAlone -> Unconnected )
Aug 24 17:24:01 kernel: drbd1: receiver (re)started
Aug 24 17:24:01 kernel: drbd1: conn( Unconnected -> WFConnection )


Has any person any idea what might be causing this, and how to fix this?

Regards,

Ben





*************************************************************************
This e-mail is confidential and may be legally privileged. It is intended
solely for the use of the individual(s) to whom it is addressed. Any
content in this message is not necessarily a view or statement from Road
Tech Computer Systems Limited but is that of the individual sender. If
you are not the intended recipient, be advised that you have received
this e-mail in error and that any use, dissemination, forwarding,
printing, or copying of this e-mail is strictly prohibited. We use
reasonable endeavours to virus scan all e-mails leaving the company but
no warranty is given that this e-mail and any attachments are virus free.
You should undertake your own virus checking. The right to monitor e-mail
communications through our networks is reserved by us

  Road Tech Computer Systems Ltd. Shenley Hall, Rectory Lane, Shenley,
  Radlett, Hertfordshire, WD7 9AN. - VAT Registration No GB 449 3582 17
  Registered in England No: 02017435, Registered Address: Charter Court, 
  Midland Road, Hemel Hempstead,  Hertfordshire, HP2 5GE. 
*************************************************************************



More information about the drbd-user mailing list