Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
This is just an information for the brave people testing drbd-0.7. Of course it is also a hint for very clever people where there are places to earn some reputation for "finding bugs by looking at the code" :) ############################################################################### ############################################################################### Apr 20 16:36:33 localhost kernel: drbd0: drbd_try_clear_on_disk_bm: (ext->lce.refcnt) in /root/src/drbd07/drbd/drbd_actlog.c:636 [...] refcnt is positive while left == 0 ? Why ? ############################################################################### ############################################################################### Apr 20 16:56:13 localhost kernel: drbd0: lc_get() failed! Probabely something stays dir ty in the on disk BM. (resync LRU too small) Apr 20 16:56:13 localhost kernel: drbd0: resync_locked=2 nr_elements=17 [...] Could be the LC_STARVING flag, but why is it set at all ? -> I am trying to reproduce, with debug code that prints the flags. ############################################################################### ############################################################################### 452 ? SW 9:09 [drbd0_receiver] 455 ? SW 1:33 [drbd0_asender] 456 ? SW 6:55 [drbd0_worker] 458 ? DW 0:07 [drbd1_receiver] <--- 461 ? SW 0:27 [drbd1_asender] 462 ? SW 0:06 [drbd1_worker] 464 ? SW 0:00 [drbd2_receiver] 467 ? SW 0:00 [drbd2_asender] 468 ? SW 0:00 [drbd2_worker] 470 ? SW 0:56 [drbd3_receiver] 452 schedule_timeout drbd0_receiver 455 schedule_timeout drbd0_asender 456 down_interruptibl drbd0_worker 458 drbd_get_ee drbd1_receiver <---- 461 schedule_timeout drbd1_asender 462 down_interruptibl drbd1_worker 464 schedule_timeout drbd2_receiver 467 schedule_timeout drbd2_asender 468 down_interruptibl drbd2_worker 470 schedule_timeout drbd3_receiver Active Internet connections (w/o servers) Proto Recv-Q Send-Q Local Address Foreign Address State tcp 0 0 10.22.22.1:7788 10.22.22.2:32789 ESTABLISHED tcp 0 0 10.22.22.1:7790 10.22.22.2:32791 ESTABLISHED tcp 0 0 10.22.22.1:7789 10.22.22.2:32790 ESTABLISHED tcp 0 0 mary.linbit:ssh mescal.linbit:32814 ESTABLISHED tcp 0 0 10.22.22.1:32771 10.22.22.2:7790 ESTABLISHED tcp 35784 0 10.22.22.1:32770 10.22.22.2:7789 ESTABLISHED tcp 0 0 10.22.22.1:32769 10.22.22.2:7788 ESTABLISHED on mary (smp) root at mary:~# cat /proc/drbd version: 0.7-pre6 (api:72/proto:72) 0: cs:Connected st:Secondary/Primary ld:Consistent ns:21139796 nr:4811460 dw:4811460 dr:21139796 al:0 bm:1252 lo:0 pe:0 ua:0 1: cs:SyncSource st:Primary/Secondary ld:Consistent ns:5778428 nr:0 dw:0 dr:228668 al:9021 bm:13 lo:0 pe:0 ua:0 [=======>............] sync'ed: 39.0% (352/576)M finish: 220:49:40 speed: 0 (3) K/sec 2: cs:Connected st:Secondary/Secondary ld:Consistent ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 3: cs:WFConnection st:Secondary/Unknown ld:Inconsistent ns:0 nr:1408716 dw:1408716 dr:0 al:0 bm:86 lo:0 pe:0 ua:0 Missing wakeup ? ############################################################################### ############################################################################### resource r0 { syncer { group=1 } } resource r1 { syncer { group=2 } } resource r2 { syncer { group=3 } } resource r3 { syncer { group=1 } } r1 runs resync drbdadm invalidate r2 r1 goes into syncPause drbdadm down r2 r1 stays in syncPause <---- ############################################################################### ############################################################################### -Philipp -- : Dipl-Ing Philipp Reisner Tel +43-1-8178292-50 : : LINBIT Information Technologies GmbH Fax +43-1-8178292-82 : : Schönbrunnerstr 244, 1120 Vienna, Austria http://www.linbit.com :