Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
This is just an information for the brave people testing drbd-0.7.
Of course it is also a hint for very clever people where there are
places to earn some reputation for "finding bugs by looking at the code" :)
###############################################################################
###############################################################################
Apr 20 16:36:33 localhost kernel: drbd0: drbd_try_clear_on_disk_bm: (ext->lce.refcnt) in /root/src/drbd07/drbd/drbd_actlog.c:636
[...]
refcnt is positive while left == 0 ? Why ?
###############################################################################
###############################################################################
Apr 20 16:56:13 localhost kernel: drbd0: lc_get() failed! Probabely something stays dir
ty in the on disk BM. (resync LRU too small)
Apr 20 16:56:13 localhost kernel: drbd0: resync_locked=2 nr_elements=17
[...]
Could be the LC_STARVING flag, but why is it set at all ?
-> I am trying to reproduce, with debug code that prints the flags.
###############################################################################
###############################################################################
452 ? SW 9:09 [drbd0_receiver]
455 ? SW 1:33 [drbd0_asender]
456 ? SW 6:55 [drbd0_worker]
458 ? DW 0:07 [drbd1_receiver] <---
461 ? SW 0:27 [drbd1_asender]
462 ? SW 0:06 [drbd1_worker]
464 ? SW 0:00 [drbd2_receiver]
467 ? SW 0:00 [drbd2_asender]
468 ? SW 0:00 [drbd2_worker]
470 ? SW 0:56 [drbd3_receiver]
452 schedule_timeout drbd0_receiver
455 schedule_timeout drbd0_asender
456 down_interruptibl drbd0_worker
458 drbd_get_ee drbd1_receiver <----
461 schedule_timeout drbd1_asender
462 down_interruptibl drbd1_worker
464 schedule_timeout drbd2_receiver
467 schedule_timeout drbd2_asender
468 down_interruptibl drbd2_worker
470 schedule_timeout drbd3_receiver
Active Internet connections (w/o servers)
Proto Recv-Q Send-Q Local Address Foreign Address State
tcp 0 0 10.22.22.1:7788 10.22.22.2:32789 ESTABLISHED
tcp 0 0 10.22.22.1:7790 10.22.22.2:32791 ESTABLISHED
tcp 0 0 10.22.22.1:7789 10.22.22.2:32790 ESTABLISHED
tcp 0 0 mary.linbit:ssh mescal.linbit:32814 ESTABLISHED
tcp 0 0 10.22.22.1:32771 10.22.22.2:7790 ESTABLISHED
tcp 35784 0 10.22.22.1:32770 10.22.22.2:7789 ESTABLISHED
tcp 0 0 10.22.22.1:32769 10.22.22.2:7788 ESTABLISHED
on mary (smp)
root at mary:~# cat /proc/drbd
version: 0.7-pre6 (api:72/proto:72)
0: cs:Connected st:Secondary/Primary ld:Consistent
ns:21139796 nr:4811460 dw:4811460 dr:21139796 al:0 bm:1252 lo:0 pe:0 ua:0
1: cs:SyncSource st:Primary/Secondary ld:Consistent
ns:5778428 nr:0 dw:0 dr:228668 al:9021 bm:13 lo:0 pe:0 ua:0
[=======>............] sync'ed: 39.0% (352/576)M
finish: 220:49:40 speed: 0 (3) K/sec
2: cs:Connected st:Secondary/Secondary ld:Consistent
ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0
3: cs:WFConnection st:Secondary/Unknown ld:Inconsistent
ns:0 nr:1408716 dw:1408716 dr:0 al:0 bm:86 lo:0 pe:0 ua:0
Missing wakeup ?
###############################################################################
###############################################################################
resource r0 { syncer { group=1 } }
resource r1 { syncer { group=2 } }
resource r2 { syncer { group=3 } }
resource r3 { syncer { group=1 } }
r1 runs resync
drbdadm invalidate r2
r1 goes into syncPause
drbdadm down r2
r1 stays in syncPause <----
###############################################################################
###############################################################################
-Philipp
--
: Dipl-Ing Philipp Reisner Tel +43-1-8178292-50 :
: LINBIT Information Technologies GmbH Fax +43-1-8178292-82 :
: Schönbrunnerstr 244, 1120 Vienna, Austria http://www.linbit.com :