[Drbd-dev] [PATCH v3] drbd: fix throttling on newly created DM backing devices
Lars
lars.ellenberg at linbit.com
Mon Sep 8 15:38:16 CEST 2014
On Mon, Sep 08, 2014 at 03:05:28PM +0200, Imre Palik wrote:
> On 09/07/14 11:58, Lars wrote:
> >On Fri, Sep 05, 2014 at 08:41:18PM +0200, Imre Palik wrote:
> >>From: "Palik, Imre" <imrep at amazon.de>
> >>
> >>If the drbd backing device is a new device mapper device (e.g., a
> >>dm-linear mapping of an existing block device that contains data), the
> >>counters are initially 0 even though the device contains useful
> >>data. This causes throttling until something accesses the drbd device
> >>or the backing device.
> >
> >What was wrong with my previous proposal?
>
> Sorry, I haven't realised you added a proposal to your reply. It
> seems, I really needed that extra sleep during the weekend ...
>
> Your proposal is good. Of course, I like my last one a slightly
> better. But as they say, beauty is in the eye of the beholder :-)
>
> >How does changing the signedness help with
> >rs_last_events not being properly initialized?
>
> It only helps with reasoning. I reason with modular arithmetic way
> easier than with signed integer overflows. Accidentally, 0 is a
> good initialisation value in case of unsigned arithmetic.
>
> >Are you sure you have also considered all wrap-around cases?
> >
> >Maybe you are too focused on your particular corner case
> >(disk_stats starting with 0).
> >Maybe I'm just thick right now, so please explain.
>
> The idea is that 0 is the smallest possible value for an unsigned,
> and curr_events is monotonically increasing (mod 2^32) .
The problem is: it is not :-(
It's a difference between stats that are increased by the
block core at (usually) completion time, and an atomic_t
that is increased by DRBD at just before (or just after) submittion.
Depending very much on stress in the IO subsystem,
and overall timing of events, a later call may see a smaller
"curr_events" (because rs_last_sect_ev has already increased,
but the disk stats have not yet noticed).
With unsigned, that may wrap around to UINT_MAX, which we don't want.
> This
> means, initially either curr_events > 64, that is, we enter the
> loop, and do the initialisation, or it will be bigger than 64 at
> most when we want to start throttle in an ideal world (after no more
> than 64 sectors of activity).
>
> Basically, while you initialise rs_last_events to an ideal value
> with some calculation, I choose a safe static value. I am content
> with both approaches. I think, as a subsystem maintainer, you
> should choose the one you like better. If you choose yours, then
> you can add
> Reviewed-by: Imre Palik <imrep at amazon.de>
Thanks,
Lars
More information about the drbd-dev
mailing list