[DRBD-user] Resync done on secondary, continue on primary

Lars Ellenberg Lars.Ellenberg at linbit.com
Tue May 18 10:45:58 CEST 2004

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


/ 2004-05-18 11:28:17 +0400
\ Eugene Crosser:
> On Tue, 2004-05-18 at 01:04, Lars Ellenberg wrote:
> 
> > > I see this problem again: secondary reports "resync complete" but
> > > primary continues to display "sync'ed:100.0%" for a long time after
> > > that.
> 
> > can you please verify that they agree about the amount that needs to be synced?
> > that should be logged as "need to sync <some number> KB" somewhere...
> 
> In yesterday's log, there is a case when they do not match.  Scenario
> was like this: at 20:33 I put two nodes in operation after upgrading to
> current-at-that-time CVS.  They quckly got in sync despite "need to
> sync" size did not match.  But as I suspected that not all data was
> synchronized previously (as you told there was a bug causing incomplete
> synchronization), I ran "drbdadm invalidate all" on the secondary.  It
> was at 20:35.  About an hour later, sync was complete on secondary but
> primary thought that it was not completed until ca 22:30 when I wrote my
> message.


> 
> May 17 20:33:37 nfsa2.mail.back kernel: drbd0: size = 214165504 KB
> May 17 20:33:38 nfsa2.mail.back kernel: drbd0: 28 KB marked out-of-sync by on disk bit-map.
> May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Found 6 transactions (324 active extents) in activity log.
> May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Marked additional 1052644 KB as out-of-sync based on AL.
> May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Connection established.
> May 17 20:33:38 nfsa1.mail.back kernel: drbd0: Connection established.
> May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Resync started as target (need to sync 3112172 KB).
> May 17 20:33:38 nfsa1.mail.back kernel: drbd0: Resync started as source (need to sync 3112180 KB).

*here* they do not agree.  strange. I need to understand this.

> May 17 20:35:02 nfsa2.mail.back kernel: drbd0: Resync done (total 85 sec; 36613 K/sec)
> May 17 20:35:46 nfsa2.mail.back kernel: drbd0: Resync started as target (need to sync 214165504 KB).
> May 17 20:35:46 nfsa1.mail.back kernel: drbd0: Resync started as source (need to sync 214165504 KB).

ok, *this* is the invalidate, where they *do* agree.

> May 17 22:05:54 nfsa2.mail.back kernel: drbd0: Resync done (total 5410 sec; 39586 K/sec)
> 


if you get this again, try a "drbdadm connect drbd0", which actually is
a _re_connect, and should reset some internal counters, then the
"forgotten" 8K should be synced, too...

	Lars Ellenberg



More information about the drbd-user mailing list