[Drbd-dev] [PATCH] drbd: fix discard_zeroes_if_aligned regression

Lars Ellenberg lars.ellenberg at linbit.com
Mon Jun 3 14:48:04 CEST 2019


On Sun, Jun 02, 2019 at 12:28:31AM +0000, Eric Wheeler wrote:
> On Tue, 28 May 2019, Lars Ellenberg wrote:
> 
> > On Fri, May 10, 2019 at 05:36:32PM +0000, Eric Wheeler wrote:
> > > Hi Lars,
> > > 
> > > We just tried 4.19.x and this bugs still exists. We applied the patch 
> > > which was originally submitted to this thread and it still applies cleanly 
> > > and seems to work for our use case. You mentioned that you had some older 
> > > code which zeroed out unaligned discard requests (or perhaps it was for a 
> > > different purpose) that you may be able to use to patch this. Could you 
> > > dig those up and see if we can get this solved?
> > > 
> > > It would be nice to be able to use drbd with thin backing volumes from the 
> > > vanilla kernel. If this has already been fixed in something newer than 
> > > 4.19, then please point me to the commit.
> > 
> > I think it was merged upstream in 5.0
> > f31e583aa2c2 drbd: introduce P_ZEROES (REQ_OP_WRITE_ZEROES on the "wire")
> 
> Thanks Lars, I appreciate your patch.  
> 
> Your unaligned zerout code in drbd_issue_discard_or_zero_out() looks 
> great.  I particulary like how you adjusted max_discard_sectors to the 
> granularity, as well as alignment handling.  Well thought out.
> 
> Your commit notes that "for backward compatibility, P_TRIM means zero-out, 
> unless the DRBD_FF_WZEROES feature flag is agreed upon during handshake."
> 
> We test our environment by deploying the newer kernel on one of the DRBD 
> servers and checking for regressions---but this will cause a zero-out on 
> the new server because the old server doesn't yet support DRBD_FF_WZEROES.
> 
> For our purpose, can you think of any reason that it would be unsafe to 
> hack the following into drbd_do_features() so the newer version will not 
> zero-out while we test and get both nodes up to the newer version?

If you "fake" agreed_features to include DRBD_FF_WZEROES,
that may start to send the P_ZEROES command,
which the older peer does not understand yet,
triggering a protocol error and disconnect...

You can always use a newer DRBD module with the older kernel,
until you are prepared to upgrade the kernel...

But you knew that.

> diff --git a/drivers/block/drbd/drbd_receiver.c b/drivers/block/drbd/drbd_receiver.c
> index c7ad88d..76191e6 100644
> --- a/drivers/block/drbd/drbd_receiver.c
> +++ b/drivers/block/drbd/drbd_receiver.c
> @@ -5382,6 +5382,8 @@ static int drbd_do_features(struct drbd_connection *connection)
>  	connection->agreed_pro_version = min_t(int, PRO_VERSION_MAX, p->protocol_max);
>  	connection->agreed_features = PRO_FEATURES & be32_to_cpu(p->feature_flags);
>  
> +	connection->agreed_features |= DRBD_FF_WZEROES;
> +
>  	drbd_info(connection, "Handshake successful: "
>  	     "Agreed network protocol version %d\n", connection->agreed_pro_version);

-- 
: Lars Ellenberg
: LINBIT | Keeping the Digital World Running
: DRBD -- Heartbeat -- Corosync -- Pacemaker
: R&D, Integration, Ops, Consulting, Support

DRBD® and LINBIT® are registered trademarks of LINBIT


More information about the drbd-dev mailing list