Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
/ 2004-05-18 11:28:17 +0400 \ Eugene Crosser: > On Tue, 2004-05-18 at 01:04, Lars Ellenberg wrote: > > > > I see this problem again: secondary reports "resync complete" but > > > primary continues to display "sync'ed:100.0%" for a long time after > > > that. > > > can you please verify that they agree about the amount that needs to be synced? > > that should be logged as "need to sync <some number> KB" somewhere... > > In yesterday's log, there is a case when they do not match. Scenario > was like this: at 20:33 I put two nodes in operation after upgrading to > current-at-that-time CVS. They quckly got in sync despite "need to > sync" size did not match. But as I suspected that not all data was > synchronized previously (as you told there was a bug causing incomplete > synchronization), I ran "drbdadm invalidate all" on the secondary. It > was at 20:35. About an hour later, sync was complete on secondary but > primary thought that it was not completed until ca 22:30 when I wrote my > message. > > May 17 20:33:37 nfsa2.mail.back kernel: drbd0: size = 214165504 KB > May 17 20:33:38 nfsa2.mail.back kernel: drbd0: 28 KB marked out-of-sync by on disk bit-map. > May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Found 6 transactions (324 active extents) in activity log. > May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Marked additional 1052644 KB as out-of-sync based on AL. > May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Connection established. > May 17 20:33:38 nfsa1.mail.back kernel: drbd0: Connection established. > May 17 20:33:38 nfsa2.mail.back kernel: drbd0: Resync started as target (need to sync 3112172 KB). > May 17 20:33:38 nfsa1.mail.back kernel: drbd0: Resync started as source (need to sync 3112180 KB). *here* they do not agree. strange. I need to understand this. > May 17 20:35:02 nfsa2.mail.back kernel: drbd0: Resync done (total 85 sec; 36613 K/sec) > May 17 20:35:46 nfsa2.mail.back kernel: drbd0: Resync started as target (need to sync 214165504 KB). > May 17 20:35:46 nfsa1.mail.back kernel: drbd0: Resync started as source (need to sync 214165504 KB). ok, *this* is the invalidate, where they *do* agree. > May 17 22:05:54 nfsa2.mail.back kernel: drbd0: Resync done (total 5410 sec; 39586 K/sec) > if you get this again, try a "drbdadm connect drbd0", which actually is a _re_connect, and should reset some internal counters, then the "forgotten" 8K should be synced, too... Lars Ellenberg