[Drbd-dev] [PATCH] drbd: fix discard_zeroes_if_aligned regression

Eric Wheeler drbd-dev at lists.ewheeler.net
Thu Jun 6 21:50:52 CEST 2019


On Mon, 3 Jun 2019, Lars Ellenberg wrote:
> On Sun, Jun 02, 2019 at 12:28:31AM +0000, Eric Wheeler wrote:
> > On Tue, 28 May 2019, Lars Ellenberg wrote:
> > 
> > > On Fri, May 10, 2019 at 05:36:32PM +0000, Eric Wheeler wrote:
> > > > Hi Lars,
> > > > 
> > > > We just tried 4.19.x and this bugs still exists. We applied the patch 
> > > > which was originally submitted to this thread and it still applies cleanly 
> > > > and seems to work for our use case. You mentioned that you had some older 
> > > > code which zeroed out unaligned discard requests (or perhaps it was for a 
> > > > different purpose) that you may be able to use to patch this. Could you 
> > > > dig those up and see if we can get this solved?
> > > > 
> > > > It would be nice to be able to use drbd with thin backing volumes from the 
> > > > vanilla kernel. If this has already been fixed in something newer than 
> > > > 4.19, then please point me to the commit.
> > > 
> > > I think it was merged upstream in 5.0
> > > f31e583aa2c2 drbd: introduce P_ZEROES (REQ_OP_WRITE_ZEROES on the "wire")
> > 
> > Thanks Lars, I appreciate your patch.  
> > 
> > Your unaligned zerout code in drbd_issue_discard_or_zero_out() looks 
> > great.  I particulary like how you adjusted max_discard_sectors to the 
> > granularity, as well as alignment handling.  Well thought out.
> > 
> > Your commit notes that "for backward compatibility, P_TRIM means zero-out, 
> > unless the DRBD_FF_WZEROES feature flag is agreed upon during handshake."
> > 
> > We test our environment by deploying the newer kernel on one of the DRBD 
> > servers and checking for regressions---but this will cause a zero-out on 
> > the new server because the old server doesn't yet support DRBD_FF_WZEROES.
> > 
> > For our purpose, can you think of any reason that it would be unsafe to 
> > hack the following into drbd_do_features() so the newer version will not 
> > zero-out while we test and get both nodes up to the newer version?
> 
> If you "fake" agreed_features to include DRBD_FF_WZEROES,
> that may start to send the P_ZEROES command,
> which the older peer does not understand yet,
> triggering a protocol error and disconnect...

Good point.

> You can always use a newer DRBD module with the older kernel,
> until you are prepared to upgrade the kernel...
> 
> But you knew that.

True.  I was hoping for a way to test without making any changes to the 
first node, but I'll give it a shot.  At least we can stay on the older 
kernel version and only upgrade the drbd module.

Thanks for the feedback!

--
Eric Wheeler



> 
> > diff --git a/drivers/block/drbd/drbd_receiver.c b/drivers/block/drbd/drbd_receiver.c
> > index c7ad88d..76191e6 100644
> > --- a/drivers/block/drbd/drbd_receiver.c
> > +++ b/drivers/block/drbd/drbd_receiver.c
> > @@ -5382,6 +5382,8 @@ static int drbd_do_features(struct drbd_connection *connection)
> >  	connection->agreed_pro_version = min_t(int, PRO_VERSION_MAX, p->protocol_max);
> >  	connection->agreed_features = PRO_FEATURES & be32_to_cpu(p->feature_flags);
> >  
> > +	connection->agreed_features |= DRBD_FF_WZEROES;
> > +
> >  	drbd_info(connection, "Handshake successful: "
> >  	     "Agreed network protocol version %d\n", connection->agreed_pro_version);
> 
> -- 
> : Lars Ellenberg
> : LINBIT | Keeping the Digital World Running
> : DRBD -- Heartbeat -- Corosync -- Pacemaker
> : R&D, Integration, Ops, Consulting, Support
> 
> DRBD® and LINBIT® are registered trademarks of LINBIT
> _______________________________________________
> drbd-dev mailing list
> drbd-dev at lists.linbit.com
> http://lists.linbit.com/mailman/listinfo/drbd-dev
> 


More information about the drbd-dev mailing list