<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<br>
<br>
On 2/17/12 4:19 AM, Lawrence Strydom wrote:
<blockquote
cite="mid:CA+EmvHe9Eid0o2GmZ4HeN-nMEf2yQxX4QYVYsnmMHQkD74t2BQ@mail.gmail.com"
type="cite">Hi List,<br>
<br>
I used DRBD in dual primary mode with ocfs2 for my load balancing
web server cluster. I didn't encounter any errors during setup and
when I put the web site on the DRBD device on the primary node, it
replicated without any errors. It has been running fine during the
week of testing but this morning when we updated code located on
the DRBD device we noticed it was not replicating to the secondary
node. <br>
the DRBD device was mounted on both nodes but /proc/drbd output
this:<br>
<br>
<b>version: 8.3.7 (api:88/proto:86-91)<br>
GIT-hash: ea9e28dbff98e331a62bcbcc63a6135808fe2917 build by <a
moz-do-not-send="true" class="moz-txt-link-abbreviated"
href="mailto:root@web01.junkmail.co.za">root@web01.junkmail.co.za</a>,
2012-01-10 09:54:40<br>
0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown
r----<br>
ns:0 nr:0 dw:5960937 dr:5047235 al:1490 bm:1363 lo:0 pe:0
ua:0 ap:0 ep:1 wo:b oos:8840028</b></blockquote>
So you have a split brain, i think - you didn't post the drbd output
from the other node, so that's just an educated guess.<br>
<br>
shutdown ocfs2/o2cb on one node, and follow this:<br>
<br>
<a class="moz-txt-link-freetext" href="http://www.drbd.org/users-guide/s-resolve-split-brain.html">http://www.drbd.org/users-guide/s-resolve-split-brain.html</a><br>
<br>
then validate both are primary/uptodate and restart your filesystem
clustering.<br>
<br>
You will need to post all the drbd logs from both boxes to
understand what root cause is. You are running a oldish version of
drbd, plus you didn't indicate what your os/kernel was.<br>
<br>
David<br>
</body>
</html>