[DRBD-user] Replication of 3TB goes to stand Alone -workerterminated

Francis I. Malolot francis_m at proware.com.tw
Tue Sep 13 13:26:39 CEST 2005

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


***********************
This mail has been scanned by InterScan MSS.
***********-***********


Hey Guys,


sending you the unfiltered kernel
bufffer log, Okey we assume that is a memory
issue but we increase our memory to 2GB and set vmalloc to 512m
its still the same problem.we also check vmstat it also did not
got low memory.



arp_tables: (C) 2002 David S. Miller
input: AT Translated Set 2 keyboard on isa0060/serio0
TCP bic registered
NET: Registered protocol family 1
NET: Registered protocol family 17
Using IPI Shortcut mode
md: Autodetecting RAID arrays.
md: autorun ...
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
ACPI: PCI Interrupt 0000:01:0f.0[A] -> GSI 17 (level, low) -> IRQ 20
sym0: <1010-66> rev 0x1 at pci 0000:01:0f.0 irq 20
sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym0: SCSI BUS has been reset.
scsi1 : sym-2.2.1
  Vendor: SN-3143P  Model:                   Rev: 0001
  Type:   Direct-Access                      ANSI SCSI revision: 03
 target1:0:0: tagged command queuing enabled, command queue depth 16.
 target1:0:0: Beginning Domain Validation
 target1:0:0: asynchronous.
 target1:0:0: wide asynchronous.
 target1:0:0: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 62)
 target1:0:0: Ending Domain Validation
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda: sda1 sda2
Attached scsi disk sda at scsi1, channel 0, id 0, lun 0
Attached scsi generic sg0 at scsi1, channel 0, id 0, lun 0,  type 0
  Vendor: SN-3143P  Model:                   Rev: 0001
  Type:   Direct-Access                      ANSI SCSI revision: 03
 target1:0:0: tagged command queuing enabled, command queue depth 16.
 target1:0:0: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 62)
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb: sdb1 sdb2
Attached scsi disk sdb at scsi1, channel 0, id 0, lun 1
Attached scsi generic sg1 at scsi1, channel 0, id 0, lun 1,  type 0
  Vendor: SN-3143P  Model:                   Rev: 0001
  Type:   Direct-Access                      ANSI SCSI revision: 03
 target1:0:0: tagged command queuing enabled, command queue depth 16.
 target1:0:0: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 62)
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc: sdc1 sdc2
Attached scsi disk sdc at scsi1, channel 0, id 0, lun 2
Attached scsi generic sg2 at scsi1, channel 0, id 0, lun 2,  type 0
ACPI: PCI Interrupt 0000:01:0f.1[A] -> GSI 17 (level, low) -> IRQ 20
sym1: <1010-66> rev 0x1 at pci 0000:01:0f.1 irq 20
sym1: Symbios NVRAM, ID 6, Fast-80, SE, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
sym1: SCSI BUS has been reset.
scsi2 : sym-2.2.1
e100: Intel(R) PRO/100 Network Driver, 3.4.8-k2-NAPI
e100: Copyright(c) 1999-2005 Intel Corporation
ACPI: PCI Interrupt 0000:01:05.0[A] -> GSI 21 (level, low) -> IRQ 21
e100: eth0: e100_probe: addr 0xed064000, irq 21, MAC addr 00:10:F3:05:00:78
Intel(R) PRO/1000 Network Driver - version 6.0.60-k2
Copyright (c) 1999-2005 Intel Corporation.
ACPI: PCI Interrupt 0000:01:07.0[A] -> GSI 22 (level, low) -> IRQ 22
e1000: eth1: e1000_probe: Intel(R) PRO/1000 Network Connection
Freeing unused kernel memory: 248k freed
Fusion MPT base driver 3.03.02
Copyright (c) 1999-2005 LSI Logic Corporation
Fusion MPT SPI Host driver 3.03.02
EXT2-fs warning: mounting unchecked fs, running e2fsck is recommended
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda: sda1 sda2
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda:
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb: sdb1 sdb2
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb:
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc: sdc1 sdc2
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc:
i2c /dev entries driver
Ethernet Channel Bonding Driver: v2.6.3 (June 8, 2005)
bonding: MII link monitoring set to 100 ms
drbd: initialised. Version: 0.7.13 (api:77/proto:74)
drbd: SVN Revision: 1942 build by root at localhost, 2005-09-12 17:42:20
drbd: registered as block device major 147
e100: eth0: e100_watchdog: link up, 100Mbps, full-duplex
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda:
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda:
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda:
SCSI device sda: 4294950912 512-byte hdwr sectors (2199015 MB)
SCSI device sda: drive cache: write back
 sda: sda1 sda2
XFS mounting filesystem dm-3
Ending clean XFS mount for filesystem: dm-3
XFS quotacheck dm-3: Please wait.
XFS quotacheck dm-3: Done.
XFS mounting filesystem dm-4
Ending clean XFS mount for filesystem: dm-4
XFS quotacheck dm-4: Please wait.
XFS quotacheck dm-4: Done.
Adding 524280k swap on /dev/VG01200509131906proNASVG/SWAP.  Priority:-1 
extents:1
XFS mounting filesystem dm-0
Ending clean XFS mount for filesystem: dm-0
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc:
NET: Registered protocol family 5
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc:
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc:
SCSI device sdc: 2344239104 512-byte hdwr sectors (1200250 MB)
SCSI device sdc: drive cache: write back
 sdc: sdc1 sdc2
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb:
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb:
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb:
SCSI device sdb: 100532224 512-byte hdwr sectors (51472 MB)
SCSI device sdb: drive cache: write back
 sdb: sdb1 sdb2
e1000: eth1: e1000_watchdog_task: NIC Link is Up 1000 Mbps Full Duplex
e1000: eth1: e1000_watchdog_task: NIC Link is Up 1000 Mbps Full Duplex
nfsd: last server has exited
nfsd: unexporting all filesystems
NET: Unregistered protocol family 5
NET: Registered protocol family 5
XFS mounting filesystem dm-5
Ending clean XFS mount for filesystem: dm-5
XFS quotacheck dm-5: Please wait.
XFS quotacheck dm-5: Done.
drbd0: Creating state block
drbd0: resync bitmap: bits=768000000 words=24000000
drbd0: size = 2929 GB (3072000000 KB)
drbd0: 3072000000 KB now marked out-of-sync by on disk bit-map.
drbd0: Assuming that all blocks are out of sync (aka FullSync)
drbd0: 3072000000 KB now marked out-of-sync by on disk bit-map.
drbd0: drbdsetup [5288]: cstate Unconfigured --> StandAlone
drbd0: drbdsetup [6309]: cstate StandAlone --> Unconnected
drbd0: drbdsetup [6309]: cstate Unconnected --> StandAlone
drbd0: drbdsetup [6309]: cstate StandAlone --> Unconfigured
drbd0: worker terminated
XFS mounting filesystem dm-5
Ending clean XFS mount for filesystem: dm-5











/ 2005-09-13 15:44:08 +0800
\ Francis I. Malolot:
>
>
> People,
>
> we had setup an volume in replication,
> unfortunately we are up to  2TB, if more
> than that replication won't sync_ed.But it
> says that drbd support up to 4TB.
>
> Here's our setup:
> 3TBvolumes for rep with external meta data
> Drbd on top of LVM(vg 4.5TB) and XFS
> vmalloc=512m
> OS =32bits
> ram=1GB
>
> kernel buffer log:
> XFS mounting filesystem dm-6
> Ending clean XFS mount for filesystem: dm-6
> XFS quotacheck dm-6: Please wait.
> XFS quotacheck dm-6: Done.
> drbd0: Creating state block
> drbd0: resync bitmap: bits=768000000 words=24000000
> drbd0: size = 2929 GB (3072000000 KB)
> drbd0: 3072000000 KB now marked out-of-sync by on disk bit-map.
> drbd0: Assuming that all blocks are out of sync (aka FullSync)
> drbd0: 3072000000 KB now marked out-of-sync by on disk bit-map.
> drbd0: drbdsetup [8072]: cstate Unconfigured --> StandAlone
> drbd0: drbdsetup [9066]: cstate StandAlone --> Unconnected
> drbd0: drbdsetup [9066]: cstate Unconnected --> StandAlone
> drbd0: drbdsetup [9066]: cstate StandAlone --> Unconfigured
> drbd0: worker terminated
> XFS mounting filesystem dm-6
> Ending clean XFS mount for filesystem: dm-6

is it possible that there are messages missing
due to some priority filter?

even though it may be "suprising" to you, this looks like an out of
memory condition to me.

-- 
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.
_______________________________________________
drbd-user mailing list
drbd-user at lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user






More information about the drbd-user mailing list