Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi List, I used DRBD in dual primary mode with ocfs2 for my load balancing web server cluster. I didn't encounter any errors during setup and when I put the web site on the DRBD device on the primary node, it replicated without any errors. It has been running fine during the week of testing but this morning when we updated code located on the DRBD device we noticed it was not replicating to the secondary node. the DRBD device was mounted on both nodes but /proc/drbd output this: *version: 8.3.7 (api:88/proto:86-91) GIT-hash: ea9e28dbff98e331a62bcbcc63a6135808fe2917 build by root at web01.junkmail.co.za, 2012-01-10 09:54:40 0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown r---- ns:0 nr:0 dw:5960937 dr:5047235 al:1490 bm:1363 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:8840028* I restarted drbd and ocfs2 but still the result was the same. Next I rebooted the misbehaving node and noticed when it came back up that the DRBD device was no longer mounted. Trying to mount the device manually returns this error: *mount /dev/drbd0 mount.ocfs2: I/O error on channel while opening device /dev/drbd0* A tail of the log file shows nothing but an earlier entry shows this: *Feb 17 10:47:54 web02 kernel: [ 13.531600] block drbd0: disk( Attaching -> UpToDate ) Feb 17 10:47:54 web02 kernel: [ 13.535865] block drbd0: conn( StandAlone -> Unconnected ) Feb 17 10:47:54 web02 kernel: [ 13.535889] block drbd0: Starting receiver thread (from drbd0_worker [1484]) Feb 17 10:47:54 web02 kernel: [ 13.535998] block drbd0: receiver (re)started Feb 17 10:47:54 web02 kernel: [ 13.536006] block drbd0: conn( Unconnected -> WFConnection ) *This is my r1.res file: *=============================================================== resource r1 { meta-disk internal; device /dev/drbd0; disk /dev/vol01/docroot; syncer { rate 1000M; } net { allow-two-primaries; after-sb-0pri discard-zero-changes; after-sb-1pri discard-secondary; after-sb-2pri disconnect; } startup { become-primary-on both; } on web01.junkmail.co.za { address 10.0.0.111:7789; } on web02.junkmail.co.za { address 10.0.0.112:7789; } }* *===============================================================* Here is /etc/ocfs2/cluster.conf: =============================================================== *cluster: node_count = 2 name = jbm_web node: ip_port = 7777 ip_address = 10.0.0.111 number = 1 name = web01 cluster = jbm_web node: ip_port = 7777 ip_address = 10.0.0.112 number = 2 name = web02 cluster = jbm_web ================================================================ *Any help/ideas much appreciated - the pressure is on here. Thanks Lawrence * * -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20120217/f678cd2a/attachment.htm>