[DRBD-user] Strange issues rebooting secondary node

Stefano Cislaghi s.cislaghi at gmail.com
Thu Jan 31 12:40:59 CET 2008

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hello all,

I've two nodes (active/passive) using DRBD and LinuxHA. The slave node
rebooted. Restarting the drbd startup wait on booting.

Doing a cat /proc/drbd shows:
1: cs:WFConnection st:Secondary/Unknown

Btw, after some minutes system go on. Now cat /proc/drbd.conf shows:
[root at rman1c ~]# cat /proc/drbd
version: 8.2.4 (api:88/proto:86-88)
GIT-hash: fc00c6e00a1b6039bfcebe37afa3e7e28dbd92fa build by
root at rman1a, 2008-01-29 11:48:39

Then

[root at rman1c ~]#  drbdadm connect all
[root at rman1c ~]# cat /proc/drbd
version: 8.2.4 (api:88/proto:86-88)
GIT-hash: fc00c6e00a1b6039bfcebe37afa3e7e28dbd92fa build by
root at rman1a, 2008-01-29 11:48:39

 1: cs:Unconnected st:Secondary/Unknown ds:Diskless/DUnknown C r---
    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0


my conf file is:

[root at rman1c ~]# cat /etc/drbd.conf
#
# please have a a look at the example configuration file in
# /usr/share/doc/drbd/drbd.conf
#

global { usage-count yes; }
common { syncer { rate 10M; } }
resource r0 {
        protocol C;
        startup { wfc-timeout 0; degr-wfc-timeout     120; }
        disk { on-io-error detach; } # or panic, ...
        net {
                cram-hmac-alg sha1;
                shared-secret "FooFunFactory";
        }
        on rman1a {
                device    /dev/drbd1;
                disk      /dev/sdb2;
                address   IP1:7789;
                meta-disk  internal;
        }
        on rman1c {
                device    /dev/drbd1;
                disk      /dev/sdb2;
                address   IP2:7789;
                meta-disk  internal;
        }
}

On kernel dmesg I've found:
divert: not allocating divert_blk for non-ethernet device sit0
drbd: no version for "struct_module" found: kernel tainted.
drbd: initialised. Version: 8.2.4 (api:88/proto:86-88)
drbd: GIT-hash: fc00c6e00a1b6039bfcebe37afa3e7e28dbd92fa build by
root at rman1a, 2008-01-29 11:48:39
drbd: registered as block device major 147
drbd: minor_table @ 0xf7de4b00
drbd1: disk( Diskless -> Attaching )
drbd1: Found 3 transactions (5 active extents) in activity log.
drbd1: max_segment_size ( = BIO size ) = 32768
drbd1: drbd_bm_resize called with capacity == 94520848
drbd1: resync bitmap: bits=11815106 words=369224
drbd1: size = 45 GB (47260424 KB)
drbd1: reading of bitmap took 56 jiffies
drbd1: recounting of set bits took additional 3 jiffies
drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
drbd1: disk( Attaching -> UpToDate )
drbd1: Writing meta data super block now.
drbd1: conn( StandAlone -> Unconnected )
drbd1: receiver (re)started
drbd1: conn( Unconnected -> WFConnection )
eth0: no IPv6 routers present
eth1: no IPv6 routers present
drbd1: conn( WFConnection -> Disconnecting )
drbd1: Discarding network configuration.
drbd1: tl_clear()
drbd1: Connection closed
drbd1: conn( Disconnecting -> StandAlone )
drbd1: receiver terminated
drbd1: disk( UpToDate -> Diskless )
drbd1: conn( StandAlone -> Unconnected )



Now primary node shows on dmesg:
drbd1: peer( Secondary -> Unknown ) conn( Connected -> TearDown )
pdsk( UpToDate -> DUnknown )
drbd1: Creating new current UUID
drbd1: Writing meta data super block now.
drbd1: asender terminated

and on cat /proc/drbd

[root at rman1a init.d]# cat /proc/drbd
version: 8.2.4 (api:88/proto:86-88)
GIT-hash: fc00c6e00a1b6039bfcebe37afa3e7e28dbd92fa build by
root at rman1a, 2008-01-29 11:48:39

 1: cs:TearDown st:Primary/Unknown ds:UpToDate/DUnknown C r---
    ns:11077188 nr:0 dw:11100252 dr:212037 al:3214 bm:25 lo:0 pe:0 ua:0 ap:0
        resync: used:0/31 hits:1 misses:1 starving:0 dirty:0 changed:1
        act_log: used:0/127 hits:2771849 misses:3933 starving:0
dirty:719 changed:3214


What's wrong? There's a way to avoid this problem during boot?

Thanks
Stefano



More information about the drbd-user mailing list