Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi. I'm using the 'Version: 0.7.6 (api:77)' version of drbd on a linux
2.6.7 (debian/ubuntu). Unfortunately, drbd more than occasionally
crashes on startup when inserting the 'drbd' module
into the kernel. The only solution are reboots and retries - sometimes
it works, sometimes not. I've
attached a kernel oops analysis and an output on fdisk. The oopses are
occurring on two
distinct but similar machines independently (both primary and slave),
also already with a
different partitioning (hda3 and hda4 are both drbd devices),
different sync protocol, etc.
This oops is really my only hint, nothing more in dmesg or in the
logs, when modprobing
drbd. Another thing is that the initial syncing is VERY VERY slow
(about 200-500 bytes/s),
although all HDD performance results seem ok (bonnie++ result also
attached at the very bottom).
Please - what might be going on?? Is this a known bug, and/or what
should I do further?
Thanks,
Akman
FDISK OUTPUT:
Disk /dev/hda: 300.0 GB, 300001443840 bytes
255 heads, 63 sectors/track, 36473 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Device Boot Start End Blocks Id System
/dev/hda1 1 122 979933+ 83 Linux
/dev/hda2 123 1338 9767520 83 Linux
/dev/hda3 1339 19041 142199347+ 83 Linux
/dev/hda4 19042 36473 140022540 83 Linux
KERNEL OOPS:
Unable to handle kernel paging request at virtual address df9193a0
printing eip:
df9193a0
*pde = 1ec15063
*pte = 00000000
Oops: 0000 [#1]
PREEMPT SMP
Modules linked in: drbd ehci_hcd uhci_hcd
CPU: 0
EIP: 0060:[<df9193a0>] Not tainted
EFLAGS: 00010246 (2.6.7-maturis-ha1)
eax: 00000000 ebx: 00000000 ecx: 00020d84 edx: c0107b94
esi: fffffff4 edi: 00000000 ebp: 00000000 esp: dd76af24
ds: 007b es: 007b ss: 0068
Process modprobe (pid: 2834, threadinfo=dd76a000 task=c17b6c10)
Stack: df8e8415 00020d84 000000d0 df923a10 df923b28 00000000 fffffff4 c0106240
df952aa0 c0106224 c0106224 c024696d c018caac 00000001 df952aa0 46f4b000
0805e5b8 00000010 dd76a000 c020a097 46f4b000 0002ecc8 0805e5b8 0805e5b8
Call Trace:
[<df8e8415>] [<c024696d>] [<c020a097>]
Code: Bad EIP value.
Reading Oops report from the terminal
Unable to handle kernel paging request at virtual address df9193a0
df9193a0
*pde = 1ec15063
Oops: 0000 [#1]
CPU: 0
EIP: 0060:[<df9193a0>] Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246 (2.6.7-maturis-ha1)
eax: 00000000 ebx: 00000000 ecx: 00020d84 edx: c0107b94
esi: fffffff4 edi: 00000000 ebp: 00000000 esp: dd76af24
ds: 007b es: 007b ss: 0068
Stack: df8e8415 00020d84 000000d0 df923a10 df923b28 00000000 fffffff4 c0106240
df952aa0 c0106224 c0106224 c024696d c018caac 00000001 df952aa0 46f4b000
0805e5b8 00000010 dd76a000 c020a097 46f4b000 0002ecc8 0805e5b8 0805e5b8
Call Trace:
[<df8e8415>] [<c024696d>] [<c020a097>]
Code: Bad EIP value.
>>EIP; df9193a0 <__crc_ip_mc_dec_group+4a627/5a5f7> <=====
>>edx; c0107b94 <malloc_sizes+b4/c0>
>>esi; fffffff4 <__kernel_rt_sigreturn+1bb4/????>
>>esp; dd76af24 <__crc_unregister_console+9b687/16440a>
Trace; df8e8415 <__crc_ip_mc_dec_group+1969c/5a5f7>
Trace; c024696d <sys_init_module+11d/1e0>
Trace; c020a097 <syscall_call+7/b>
1 warning and 1 error issued. Results may not be reliable.
$ modprobe -V
module-init-tools version 3.1-pre2
HDD PERFORMANCE:
$ bonnie++ -d /home -u root
Using uid:0, gid:0.
Writing with putc()...done
Writing intelligently...done
Rewriting...done
Reading with getc()...done
Reading intelligently...done
start 'em...done...done...done...
Create files in sequential order...done.
Stat files in sequential order...done.
Delete files in sequential order...done.
Create files in random order...done.
Stat files in random order...done.
Delete files in random order...done.
Version 1.03 ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
hydrogen 1G 8174 96 39555 39 17620 18 8422 95 42617 27 206.8 1
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
16 391 99 +++++ +++ +++++ +++ 394 99 +++++ +++ 1585 98
hydrogen,1G,8174,96,39555,39,17620,18,8422,95,42617,27,206.8,1,16,391,99,+++++,+++,+++++,+++,394,99,+++++,+++,1585,98