[DRBD-user] Re: DRBD on top of LVM2 kills primary server

Andreas Gabriel gabriel at hrz.uni-marburg.de
Wed Apr 27 00:37:09 CEST 2005

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hello,

As already reported by Laurent Simonneau, no problems occur if drbd runs
protocol A. Both protocol B and C kills the primary. However, both
servers are connected directly via GBit ethernet. Isn't this
in contrast to the recommendation to run Protocol A for high latency
networks?

Thanks in advance
Andreas

Gabriel Andreas wrote:
> Hello,for high latency networks
> 
> we try to setup DRBD (0.7.10) on top of LVM2 (2.01.0)
> on two Debian 2.4.29-vs1.2.10 (SMP) systems.
> 
> After configuration of the drbd resource and initialisation
> of the drbd device on both systems all seems to be running fine
> 
> server01> cat /proc/drbd
> version: 0.7.10 (api:77/proto:74)
> SVN Revision: 1743 build by root at pcrz175, 2005-03-02 09:18:52
>  0: cs:Connected st:Primary/Secondary ld:Consistent
>     ns:0 nr:8192 dw:8192 dr:0 al:0 bm:4 lo:0 pe:0 ua:0 ap:0
> 
> server02> cat /proc/drbd
> version: 0.7.10 (api:77/proto:74)
> SVN Revision: 1743 build by root at pcrz175, 2005-03-02 09:18:52
>  0: cs:Connected st:Secondary/Primary ld:Consistent
>     ns:8192 nr:0 dw:0 dr:8192 al:0 bm:2 lo:0 pe:0 ua:0 ap:0
> 
> 
> But if I try to create an ext3 filesystem on the /dev/drbd0 device,
> following command kills server01:
> 
> server01> mke2fs -j -m 0 /dev/drbd0
>     mke2fs 1.35 (28-Feb-2004)
>     Filesystem label=
>     OS type: Linux
>     Block size=4096 (log=2)
>     Fragment size=4096 (log=2)
>     245760 inodes, 491520 blocks
>     0 blocks (0.00%) reserved for the super user
>     First data block=0
>     15 block groups
>     32768 blocks per group, 32768 fragments per group
>     16384 inodes per group
>     Superblock backups stored on blocks:
>         32768, 98304, 163840, 229376, 294912
> 
>     Writing inode tables: done
>     Creating journal (8192 blocks): done
>     Writing superblocks and filesystem accounting information:
> 
> 
> And server02 shows following messages:
> 
> 
> server02> cat /proc/drbd
> version: 0.7.10 (api:77/proto:74)
> SVN Revision: 1743 build by root at pcrz175, 2005-03-02 09:18:52
>  0: cs:WFConnection st:Secondary/Unknown ld:Consistent
>     ns:8192 nr:16400 dw:16400 dr:8192 al:0 bm:2 lo:0 pe:0 ua:0 ap:0
> 
> server02> dmesg | grep drbd
> drbd: initialised. Version: 0.7.10 (api:77/proto:74)
> drbd: SVN Revision: 1743 build by root at xxxxx, 2005-03-02 09:18:52
> drbd: registered as block device major 147
> drbd0: resync bitmap: bits=491520 words=15360
> drbd0: size = 1920 MB (1966080 KB)
> drbd0: 0 KB marked out-of-sync by on disk bit-map.
> drbd0: No usable activity log found.
> drbd0: drbdsetup [1072]: cstate Unconfigured --> StandAlone
> drbd0: drbdsetup [1075]: cstate StandAlone --> Unconnected
> drbd0: drbd0_receiver [1076]: cstate Unconnected --> WFConnection
> drbd0: drbd0_receiver [1076]: cstate WFConnection --> WFReportParams
> drbd0: Handshake successful: DRBD Network Protocol version 74
> drbd0: Connection established.
> drbd0: I am(S): 1:00000002:00000001:00000002:00000001:01
> drbd0: Peer(S): 1:00000002:00000001:00000001:00000001:11
> drbd0: drbd0_receiver [1076]: cstate WFReportParams --> WFBitMapS
> drbd0: Secondary/Unknown --> Secondary/Secondary
> drbd0: drbd0_receiver [1076]: cstate WFBitMapS --> SyncSource
> drbd0: Resync started as SyncSource (need to sync 8192 KB [2048 bits set]).
> drbd0: Resync done (total 1 sec; paused 0 sec; 8192 K/sec)
> drbd0: drbd0_worker [1073]: cstate SyncSource --> Connected
> drbd0: Secondary/Secondary --> Secondary/Primary
> drbd0: drbd0_asender [1077]: cstate Connected --> Timeout
> drbd0: short sent WriteAck size=32 sent=8
> drbd0: short sent BarrierAck size=16 sent=-1001
> drbd0: asender terminated
> drbd0: error receiving Barrier, l: 8!
> drbd0: worker terminated
> drbd0: unacked_cnt = 75
> drbd0: drbd0_receiver [1076]: cstate Timeout --> Unconnected
> drbd0: Connection lost.
> drbd0: drbd0_receiver [1076]: cstate Unconnected --> WFConnection
> 
> --------------------
> 
> drbd.conf on both systems
> 
> #############
> resource "zeo" {
>   protocol C;
>   incon-degr-cmd "echo '!DRBD! pri on incon-degr' | wall ; sleep 60 halt
> -f";
>   startup {
>   }
>   disk {
>     on-io-error   detach;
>   }
>   net {
>   }
>   syncer {
>     rate 50M;
>     group 0;
>     al-extents 257;
>   }
>   on server01 {
>     device     /dev/drbd0;
>     disk       /dev/zeo_vg/zeo_lv;
>     address    10.0.0.1:7800;
>     meta-disk  internal;
>   }
>   on server02 {
>     device     /dev/drbd0;
>     disk       /dev/zeo_vg/zeo_lv;
>     address    10.0.0.2:7800;
>     meta-disk  internal;
>   }
> }
> ##############
> 
> 
> The logical volume configuration on both systems
> 
> server01/02>lvdisplay
>   --- Logical volume ---
>   LV Name                /dev/zeo_vg/zeo_lv
>   VG Name                zeo_vg
>   LV UUID                H8Ffeh-znZi-jE7x-cYtF-4Hch-ddRL-NyMDHp
>   LV Write Access        read/write
>   LV Status              available
>   # open                 4
>   LV Size                2.00 GB
>   Current LE             512
>   Segments               1
>   Allocation             inherit
>   Read ahead sectors     0
>   Block device           254:7
> 
> ##############
> 
> 
> Any hints?
> 
> Thank You
> Andreas




More information about the drbd-user mailing list