Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Philipp and guys, I seem to hit the same problem today that I already reported long ago and that was apparently fixed long ago. I am running kernel.org kernel 2.6.10-rc2 and drbd branch/drbd-0.7 checked out this morning, which reports itself as 0.7.6. I was testing my system reaction to pulling out a disk; it did all right, drbd noticed underlying device failure and dutyfully panicked. After reconnecting the disk (to hardware RAID0), I got the system up and ran "drdbdadm invalidate all" on the system with would-be-replaced disk. In half an hour, SyncTarget reported sync complete, but the SyncSource did not: Nov 23 14:00:58 nfsb2.mail.back kernel: drbd0: 214540288 KB now marked out-of-sync by on disk bit-map. Nov 23 14:00:58 nfsb2.mail.back kernel: drbd0: drbd0_receiver [155]: cstate Connected --> SyncSource Nov 23 14:00:58 nfsb2.mail.back kernel: drbd0: Resync started as SyncSource (need to sync 214540288 KB [53635072 bits set]). Nov 23 14:02:43 nfsb1.mail.back ntpd[133]: time set -0.032269 s Nov 23 14:16:10 nfsb2.mail.back -- MARK -- Nov 23 14:18:05 nfsb1.mail.back ntpd[133]: time reset -0.145189 s Nov 23 14:36:10 nfsb2.mail.back -- MARK -- Nov 23 14:38:45 nfsb1.mail.back -- MARK -- Nov 23 14:41:42 nfsb1.mail.back kernel: drbd0: Resync done (total 2445 sec; paused 0 sec; 87744 K/sec) Nov 23 14:41:42 nfsb1.mail.back kernel: drbd0: drbd0_worker [152]: cstate SyncTarget --> Connected Now, this is /proc/drbd on both notes: root at hanode1:~# cat /proc/drbd version: 0.7.6 (api:77/proto:74) SVN Revision: 1649 build by crosser at ariel.sovam.com, 2004-11-23 11:15:51 0: cs:Connected st:Secondary/Primary ld:Consistent ns:0 nr:234193156 dw:234193156 dr:0 al:0 bm:27472 lo:0 pe:0 ua:0 ap:0 root at hanode2:~# cat /proc/drbd version: 0.7.6 (api:77/proto:74) SVN Revision: 1649 build by crosser at ariel.sovam.com, 2004-11-23 11:15:51 0: cs:SyncSource st:Primary/Secondary ld:Consistent ns:234169036 nr:17584552 dw:38525788 dr:219993817 al:8449 bm:30241 lo:0 pe:0 ua:0 ap:0 [===================>] sync'ed:100.0% (1/209512)M finish: 0:00:00 speed: 120 (44,464) K/sec Also interesting thing, on the SyncSource note (hanode2) `uptime' reports unreasonable loadaverage: root at hanode2:~# uptime 15:34:27 up 3:18, 1 user, load average: 407.18, 593.69, 871.42 which is simply impossible given that there are only 119 processes... Running "drbdadm disconnect all" and "drbdadm connect all" apparently put things in order. Maybe. Eugene -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 256 bytes Desc: OpenPGP digital signature URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20041123/743f5a69/attachment.pgp>