[DRBD-user] io error when mounting drbd device

Lawrence Strydom qholloi at gmail.com
Fri Feb 17 10:19:56 CET 2012

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi List,

I used DRBD in dual primary mode with ocfs2 for my load balancing web
server cluster. I didn't encounter any errors during setup and when I put
the web site on the DRBD device on the primary node, it replicated without
any errors. It has been running fine during the week of testing but this
morning when we updated code located on the DRBD device we noticed it was
not replicating to the secondary node.
the DRBD device was mounted on both nodes but /proc/drbd output this:

*version: 8.3.7 (api:88/proto:86-91)
GIT-hash: ea9e28dbff98e331a62bcbcc63a6135808fe2917 build by
root at web01.junkmail.co.za, 2012-01-10 09:54:40
 0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown   r----
    ns:0 nr:0 dw:5960937 dr:5047235 al:1490 bm:1363 lo:0 pe:0 ua:0 ap:0
ep:1 wo:b oos:8840028*


I restarted drbd and ocfs2 but still the result was the same. Next I
rebooted the misbehaving node and noticed when it came back up that the
DRBD device was no longer mounted.

Trying to mount the device manually returns this error:
*mount /dev/drbd0
mount.ocfs2: I/O error on channel while opening device /dev/drbd0*


A tail of the log file shows nothing but an earlier entry shows this:

*Feb 17 10:47:54 web02 kernel: [   13.531600] block drbd0: disk( Attaching
-> UpToDate )
Feb 17 10:47:54 web02 kernel: [   13.535865] block drbd0: conn( StandAlone
-> Unconnected )
Feb 17 10:47:54 web02 kernel: [   13.535889] block drbd0: Starting receiver
thread (from drbd0_worker [1484])
Feb 17 10:47:54 web02 kernel: [   13.535998] block drbd0: receiver
(re)started
Feb 17 10:47:54 web02 kernel: [   13.536006] block drbd0: conn( Unconnected
-> WFConnection )


*This is my r1.res file:

*===============================================================
resource r1 {
        meta-disk internal;
        device /dev/drbd0;
        disk /dev/vol01/docroot;

        syncer { rate 1000M; }
        net {
                allow-two-primaries;
                after-sb-0pri discard-zero-changes;
                after-sb-1pri discard-secondary;
                after-sb-2pri disconnect;
        }
        startup { become-primary-on both; }

        on web01.junkmail.co.za { address 10.0.0.111:7789; }
        on web02.junkmail.co.za { address 10.0.0.112:7789; }
}*
*===============================================================*



Here is /etc/ocfs2/cluster.conf:

===============================================================
*cluster:
    node_count = 2
    name = jbm_web

node:
    ip_port = 7777
    ip_address = 10.0.0.111
    number = 1
    name = web01
    cluster = jbm_web

node:
    ip_port = 7777
    ip_address = 10.0.0.112
    number = 2
    name = web02
    cluster = jbm_web
================================================================



*Any help/ideas much appreciated - the pressure is on here.

Thanks

Lawrence
*
*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20120217/f678cd2a/attachment.htm>


More information about the drbd-user mailing list