[DRBD-user] DRBD 0.8rc2 deadlock "WFBitMap" ?

Ralf Schenk rs at databay.de
Mon Jan 22 19:09:37 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Philipp Reisner schrieb:

> drbdadm down all ; drbdadm up all

That doesn't work: Error is:
Child process does not terminate!
Exiting.
No response from the DRBD driver! Is the module loaded ?

(Of course the module is loaded)

> But, please post your excerpts from the kernel logs (of both machines) 
> of the incident.
> And tell us if the clocks of the machines are in sync (with NTP), or
> what time difference you have there.

Both nodes dates are in sync with ntp.

Ok what I did now: created the meta-data completely from scratch on both
nodes (drbdadm create-md vm0[1,2]). I started drbd (/etc/init.d/drbd start).
I set each of the two nodes primary for one drbd device (drbdadm primary
 -o vm0[1,2]). I expected drbd to start a full resync...

But nothing:
root at hosta:~# cat /proc/drbd
version: 8.0rc2 (api:86/proto:86)
SVN Revision: 2704 build by root at server, 2007-01-18 16:05:48
 0: cs:WFBitMapS st:Secondary/Secondary ds:UpToDate/Inconsistent C r---
    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0
        resync: used:0/31 hits:0 misses:0 starving:0 dirty:0 changed:0
        act_log: used:0/127 hits:0 misses:0 starving:0 dirty:0 changed:0
 1: cs:WFBitMapT st:Secondary/Primary ds:Inconsistent/UpToDate C r---
    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0
        resync: used:0/31 hits:0 misses:0 starving:0 dirty:0 changed:0
        act_log: used:0/127 hits:0 misses:0 starving:0 dirty:0 changed:0

Here ist the kernel log:

Jan 22 18:42:34 hosta kernel: drbd: initialised. Version: 8.0rc2
(api:86/proto:86)
Jan 22 18:42:34 hosta kernel: drbd: SVN Revision: 2704 build by
root at server, 2007-01-18 16:05:48
Jan 22 18:42:34 hosta kernel: drbd: registered as block device major 147
Jan 22 18:42:34 hosta kernel: drbd: minor_table @ 0xffff8801eff62880
Jan 22 18:44:11 hosta kernel: drbd0: disk( Diskless -> Attaching )
Jan 22 18:44:11 hosta kernel: drbd0: No usable activity log found.
Jan 22 18:44:11 hosta kernel: drbd0: max_segment_size ( = BIO size ) = 32768
Jan 22 18:44:11 hosta kernel: drbd0: drbd_bm_resize called with capacity
== 838860800
Jan 22 18:44:11 hosta kernel: drbd0: resync bitmap: bits=104857600
words=1638400
Jan 22 18:44:11 hosta kernel: drbd0: size = 400 GB (419430400 KB)
Jan 22 18:44:11 hosta kernel: drbd0: writing of bitmap took 14 jiffies
Jan 22 18:44:11 hosta kernel: drbd0: 400 GB marked out-of-sync by on
disk bit-map.
Jan 22 18:44:11 hosta kernel: drbd0: 419430400 KB now marked out-of-sync
by on disk bit-map.
Jan 22 18:44:11 hosta kernel: drbd0: reading of bitmap took 15 jiffies
Jan 22 18:44:11 hosta kernel: drbd0: recounting of set bits took
additional 2 jiffies
Jan 22 18:44:11 hosta kernel: drbd0: 400 GB marked out-of-sync by on
disk bit-map.
Jan 22 18:44:11 hosta kernel: drbd0: disk( Attaching -> Inconsistent )
Jan 22 18:44:11 hosta kernel: drbd0: Writing meta data super block now.
Jan 22 18:44:11 hosta kernel: drbd1: disk( Diskless -> Attaching )
Jan 22 18:44:11 hosta kernel: drbd1: No usable activity log found.
Jan 22 18:44:11 hosta kernel: drbd1: max_segment_size ( = BIO size ) = 32768
Jan 22 18:44:11 hosta kernel: drbd1: drbd_bm_resize called with capacity
== 838860800
Jan 22 18:44:11 hosta kernel: drbd1: resync bitmap: bits=104857600
words=1638400
Jan 22 18:44:11 hosta kernel: drbd1: size = 400 GB (419430400 KB)
Jan 22 18:44:11 hosta kernel: drbd1: writing of bitmap took 6 jiffies
Jan 22 18:44:11 hosta kernel: drbd1: 400 GB marked out-of-sync by on
disk bit-map.
Jan 22 18:44:11 hosta kernel: drbd1: 419430400 KB now marked out-of-sync
by on disk bit-map.
Jan 22 18:44:12 hosta kernel: drbd1: reading of bitmap took 23 jiffies
Jan 22 18:44:12 hosta kernel: drbd1: recounting of set bits took
additional 2 jiffies
Jan 22 18:44:12 hosta kernel: drbd1: 400 GB marked out-of-sync by on
disk bit-map.
Jan 22 18:44:12 hosta kernel: drbd1: disk( Attaching -> Inconsistent )
Jan 22 18:44:12 hosta kernel: drbd1: Writing meta data super block now.
Jan 22 18:44:12 hosta kernel: drbd0: Writing meta data super block now.
Jan 22 18:44:12 hosta kernel: drbd1: Writing meta data super block now.
Jan 22 18:44:12 hosta kernel: drbd0: conn( StandAlone -> Unconnected )
Jan 22 18:44:12 hosta kernel: drbd0: receiver (re)started
Jan 22 18:44:12 hosta kernel: drbd0: conn( Unconnected -> WFConnection )
Jan 22 18:44:12 hosta kernel: drbd1: conn( StandAlone -> Unconnected )
Jan 22 18:44:12 hosta kernel: drbd1: receiver (re)started
Jan 22 18:44:12 hosta kernel: drbd1: conn( Unconnected -> WFConnection )
Jan 22 18:44:19 hosta kernel: drbd0: conn( WFConnection -> WFReportParams )
Jan 22 18:44:19 hosta kernel: drbd0: Handshake successful: DRBD Network
Protocol version 86
Jan 22 18:44:19 hosta kernel: drbd0: Peer authenticated using 20 bytes
of 'sha1' HMAC
Jan 22 18:44:19 hosta kernel: drbd0: No resync, but bits in bitmap!
Jan 22 18:44:19 hosta kernel: drbd0: peer( Unknown -> Secondary ) conn(
WFReportParams -> Connected ) pds
k( DUnknown -> Inconsistent )
Jan 22 18:44:19 hosta kernel: drbd0: Writing meta data super block now.
Jan 22 18:44:19 hosta kernel: drbd1: conn( WFConnection -> WFReportParams )
Jan 22 18:44:19 hosta kernel: drbd1: Handshake successful: DRBD Network
Protocol version 86
Jan 22 18:44:19 hosta kernel: drbd1: Peer authenticated using 20 bytes
of 'sha1' HMAC
Jan 22 18:44:19 hosta kernel: drbd1: No resync, but bits in bitmap!
Jan 22 18:44:19 hosta kernel: drbd1: peer( Unknown -> Secondary ) conn(
WFReportParams -> Connected ) pds
k( DUnknown -> Inconsistent )
Jan 22 18:44:19 hosta kernel: drbd1: Writing meta data super block now.
Jan 22 18:45:41 hosta kernel: drbd0: role( Secondary -> Primary ) disk(
Inconsistent -> UpToDate )
Jan 22 18:45:41 hosta kernel: drbd0: Forced to conisder local data as
UpToDate!
Jan 22 18:45:41 hosta kernel: drbd0: Creating new current UUID
Jan 22 18:45:41 hosta kernel: drbd0: Writing meta data super block now.
Jan 22 18:45:41 hosta kernel: drbd0: conn( Connected -> WFBitMapS )
Jan 22 18:45:55 hosta kernel: drbd1: peer( Secondary -> Primary ) pdsk(
Inconsistent -> UpToDate )
Jan 22 18:45:55 hosta kernel: drbd1: Writing meta data super block now.
Jan 22 18:45:55 hosta kernel: drbd1: conn( Connected -> WFBitMapT )
Jan 22 18:45:55 hosta kernel: drbd1: Writing meta data super block now.
Jan 22 18:49:50 hosta kernel: drbd0: role( Primary -> Secondary )


-- 
__________________________________________________

Ralf Schenk
fon (02 41) 9 91 21-0
fax (02 41) 9 91 21-59
rs at databay.de

Databay AG
Hüttenstraße 7
D-52068 Aachen
www.databay.de

Sitz/Amtsgericht Aachen
HRB:8437
USt-IdNr.: DE 210844202

Databay - einfach machen.

_________________________________________________

Diese E-Mail und etwa angehängte Dateien enthalten vertrauliche
Informationen und sind ausschließlich für den Adressaten bestimmt.
Sollten Sie irrtümlich diese E-Mail erhalten haben, bitten wir Sie,
uns darüber unter info at databay.de zu informieren und die E-Mail
aus Ihrem System zu löschen.

This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify
info at databay.de. If you are not the named recipient, you should return
this message and delete it from your system.



More information about the drbd-user mailing list