[DRBD-user] drbd 0.6.12 and 0.6.7: Epoch set size wrong; tl messed up; transferlog too small

Nick Urbanik nick.urbanik at optusnet.com.au
Sun Nov 25 01:37:07 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Dear Folks,

(sending again without the exclamation marks in the subject, as a spam
system categorised it as spam).

I am using DRBD 0.6.12 and have created a second drbd device on two
200MB raw disks.  I am syncing data, and am copying data, and all was
going nicely until these messages began to appear in
/var/log/messages:
Nov 23 00:23:51 machine1 kernel: drbd1: tl messed up!
Nov 23 00:23:51 machine1 kernel: drbd1: Epoch set size wrong!!found=192 
reported=191 Nov 23 00:23:51 machine1 kernel: drbd1: transferlog too 
small!! 
What does this mean?  Will I lose data?  Is something configured
wrongly?

Note: heartbeat is monitoring the first device, but not the second, on
which these errors are being reported.

I am migrating from the first (full) device to the second (much
bigger) device.

The OS is Red Hat 7.3.

On machine1:
version: 0.6.7 (api:63/proto:62)

0: cs:Connected st:Primary/Secondary ns:33570316 nr:0 dw:3380216 dr:92033725 pe:0 ua:0
1: cs:SyncingAll st:Primary/Secondary ns:89834460 nr:0 dw:19728732 dr:87334457 pe:236 ua:0
         [=======>............] sync'ed: 35.8% (153198/238475)M
         finish: 1:19h speed: 33,305 (32,632) K/sec


On machine2:
$ cat /proc/drbd
version: 0.6.12 (api:64/proto:62)

0: cs:Connected st:Secondary/Primary ns:0 nr:33575328 dw:33575328 dr:0 pe:0 ua:0
1: cs:SyncingAll st:Secondary/Primary ns:0 nr:90748404 dw:90748404 dr:0 pe:0 ua:0
         [=======>............] sync'ed: 36.2% (152321/238475)M
         finish: 1:14:24h speed: 35,409 (32,626) K/sec

On both machine1 and machine2:
==============================

$ cat /etc/drbd.conf
global {
      minor_count=2
}

resource drbd0 {
     protocol = B
     fsckcmd  = /bin/true

     disk {
         disk-size = 28555537
         do-panic
     }

     net {
         sync-min =  4M
         sync-max = 32M
     }

     on machine1 {
         device  = /dev/nb0
         disk    = /dev/hda2
         address = 10.0.0.1
         port    = 7789
     }

     on machine2 {
         device  = /dev/nb0
         disk    = /dev/hda2
         address = 10.0.0.2
         port    = 7789
     }
}

resource drbd1 {
     protocol = B
     fsckcmd  = /bin/true

     disk {
         disk-size = 244198584
         do-panic
     }

     net {
         sync-min =  4M
         sync-max = 32M
     }

     on machine1 {
         device  = /dev/nb1
         disk    = /dev/hdg
         address = 10.0.0.1
         port    = 7790
     }

     on machine2 {
         device  = /dev/nb1
         disk    = /dev/hdg
         address = 10.0.0.2
         port    = 7790
     }
}

On machine1:
============
$ cat /proc/partitions
major minor  #blocks  name     rio rmerge rsect ruse wio wmerge wsect wuse running use aveq

   34     0  244198584 hdg 1506700 20927911 179476319 1165770 611753 15056690 71238766 29754250 -22912 3051439 38944540
    3     0   39082680 hda 1117758094 1871924582 -1852315458 23141603 242132370 367857982 618507728 15700892 -22881 2944459 15334072
    3     1      32098 hda1 34 110 288 500 16 8 48 370 0 810 870
    3     2   28555537 hda2 1112463027 1812472900 1924683194 17586248 228913384 316482511 100669744 35975144 45 30369384 32800729
    3     3    4192965 hda3 4101685 58233923 498680650 12929453 8676554 44992238 429354760 32853471 1 6859787 2835851
    3     4          1 hda4 0 0 0 0 0 0 0 0 0 0 0
    3     5    4192933 hda5 567596 1217482 14280642 13755677 2013803 1123378 26134888 26976473 0 38549017 41179120
    3     6    2096451 hda6 625745 126 5006968 21819177 2528613 5259847 62348288 5794779 0 9546773 27690526

On machine2:
============
$ cat /proc/partitions
major minor  #blocks  name     rio rmerge rsect ruse wio wmerge wsect wuse running use aveq

   34     0  244198584 hdg 5 39 88 60 1351245 19640962 167937800 27357890 -21247 22893982 16933797
    3     0   39082680 hda 8870 9418 144574 50130 220670609 296640908 4168607888 13512501 -21255 22892712 41343752
    3     1      32098 hda1 25 52 154 270 13 11 48 1540 0 630 1810
    3     2   28555537 hda2 10 30 80 90 211799029 288685393 4031964176 22807754 0 37013146 23287664
    3     3    4192965 hda3 1320 2576 30642 12070 4752785 2782924 60288056 17626560 0 13502690 17638500
    3     4          1 hda4 0 0 0 0 0 0 0 0 0 0 0
    3     5    4192933 hda5 7501 6687 113506 37510 4118782 5172580 76355608 16026320 0 5825480 16063830
    3     6    2096451 hda6 3 0 24 30 0 0 0 0 0 30 30


(Please don't reply saying "|_|pgr4|)3 d00dz!" :-)
-- 
Nick Urbanik http://nicku.org           nick.urbanik at optusnet.com.au
GPG: 7FFA CDC7 5A77 0558 DC7A 790A 16DF EC5B BB9D 2C24  ID: BB9D2C24



----- End forwarded message -----

-- 
Nick Urbanik http://nicku.org 808-71011 nick.urbanik at optusnet.com.au
GPG: 7FFA CDC7 5A77 0558 DC7A 790A 16DF EC5B BB9D 2C24  ID: BB9D2C24
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20071125/c1ef8707/attachment.pgp>


More information about the drbd-user mailing list