[DRBD-user] Initial sync stalls forever with many drbd disks

Brian R. Hellman brian at linbit.com
Wed Apr 18 01:57:55 CEST 2012

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi Andrew,


On 04/13/2012 09:19 AM, Andrew Maldonado wrote:
> Hey all,
>
> I am currently running into an issue using drbd in a xen cluster
> (managed by ganeti).
I've worked on Ganeti clusters with 30+vms active w/ DRBD, so I know it
works. :)
.....
>
> When adding drbd instances, I can add up to 17 without issue, but the
> 18th instance stalls on initial sync:
> block drbd17: peer( Primary -> Unknown ) conn( SyncTarget ->
> Disconnecting ) pdsk( UpToDate -> DUnknown )
> block drbd17: short read expecting header on sock: r=-512
> block drbd17: meta connection shut down by peer.
What does the other side say, do the logs mention why it shutdown?
> block drbd17: asender terminated
> block drbd17: Terminating asender thread
> block drbd17: Connection closed
> block drbd17: conn( Disconnecting -> StandAlone )
> block drbd17: receiver terminated
> block drbd17: Terminating receiver thread
> block drbd17: disk( Inconsistent -> Diskless )
> block drbd17: drbd_bm_resize called with capacity == 0
> block drbd17: worker terminated
> block drbd17: Terminating worker thread
>
> I am running Centos 5 xen, drbd 8.3.8. I have tried multiple
> kernel/drbd(8.3.2/8)/bios combinations to no avail. This behavior is
> consistent between all nodes (currently 5). I have even changed out the
> switch the drbd data is transferred on.
The quick and dirty fix might be upgrading to 8.3.12
>
> Currently the xen is running with 4GB ram allocated to dom0, with over
> 2GB free on each node.
>
> Do I just have not enough ram allocated to dom0? or am I missing
> something else.
>From our blog site:
"DRBD needs about 32MB RAM per TB storage for its bitmap."  So unless
you have a _really_ big volume you should be OK ;)

Full site here: http://blogs.linbit.com/p/169/maximum-volume-size/


Hope that helps,
Brian

-- 

: Brian Hellman
: LINBIT | "Your Way to High Availability"
: 1-503-573-1262 | 1-877-4-LINBIT
: Web: http://www.linbit.com
:
: Twitter: http://www.linbit.com/en/twitter
: Facebook: http://www.linbit.com/en/facebook




More information about the drbd-user mailing list