<div>Hi all,</div><div> </div><div>We have this scenario:</div><div> </div><div>- A datacenter</div><div> - 1 Server running kvm VMs:</div><div> * 1 Openfiler to distribute hard disk to other VMs (stores VMs and VMs's data) and other uses</div>
<div> * Other VMs...</div><div> - DRBD configured to replicate Openfiler's volumes as primary</div><div> </div><div>- B datacenter</div><div> - 1 Server running kvm as backup for center A failure events</div><div>
- DRBD configured to serve as backup of Openfiler so VMs can be restarted in case of failure with updated data. It also works as primary. </div><div> </div><div>Both centers are connected through wireless connections (500 Mb/s) which is good for our requirements. Nevertheless, there is a ping of 300 ms because of multiple routers in whithin...</div>
<div> </div><div>We have been experiencing multiple split-brain situations and we don't know why... perhaps the link is down for a while but I don't understand the source of the problem as although they are primary-primary, one of the servers should never write to disk as all machines are kept on datacenter A, is this assumption right?, is this split-brain just "conceptual" telling that network was lost but no real uncoherences have appeared?. Under such asumption.. would it be ok to apply autorecover?.</div>
<div> </div><div>Any light on all this?, any alternatives?, any experience under such scenario?.</div><div> </div><div>Thanks in advance, </div><div> </div><div>Jose</div>