Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Am 21.09.2009 um 13:35 schrieb Lars Ellenberg: > > > something that Oopses at offset 0 of some function is most likely a > compile problem, > i.e. I suggest that your kernel and drbd module do not fit together. Hi, I have experienced a similar looking problem on a cluster system also using Ubuntu: Systems (identically configured): Ubuntu 9.04 Server x86_64 server with Kernel 2.6.28-15-server (standard distro kernel) DRBD: as included in Ubuntu Server (8.3.0) Usage: KVM host (Software Raid 1 -> DRBD -> LVM -> KVM VMs) The problem appeared on the secondary directly after issuing "drbdadm verify all": > Sep 23 17:04:58 cluster2 kernel: [ 1579.818149] drbd0: > conn( Connected -> VerifyT ) > Sep 23 17:04:59 cluster2 kernel: [ 1579.820914] drbd1: > conn( Connected -> VerifyT ) > Sep 23 17:04:59 cluster2 kernel: [ 1579.830819] BUG: unable to > handle kernel NULL pointer dereference at 0000000000000030 > Sep 23 17:04:59 cluster2 kernel: [ 1579.830877] IP: > [<ffffffffa0105373>] w_e_end_ov_req+0x43/0x1a0 [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.830924] PGD 0 > Sep 23 17:04:59 cluster2 kernel: [ 1579.830949] Oops: 0000 [#1] SMP > Sep 23 17:04:59 cluster2 kernel: [ 1579.830979] last sysfs file: / > sys/module/drbd/parameters/cn_idx > Sep 23 17:04:59 cluster2 kernel: [ 1579.831011] Dumping ftrace buffer: > Sep 23 17:04:59 cluster2 kernel: [ 1579.831037] (ftrace buffer > empty) > Sep 23 17:04:59 cluster2 kernel: [ 1579.831063] CPU 0 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831087] Modules linked in: > tun r8169 mii ipt_REJECT xt_tcpudp iptable_filter ip_tables x_tables > bridge stp kvm_intel kvm video output input_polldev drbd > sha1_generic lp parport iTCO_wdt iTCO_vendor_support pcspkr raid10 > multipath linear fbcon tileblit font bitblit softcursor r8168 > raid456 async_xor async_memcpy async_tx xor raid1 raid0 3w_9xxx > 3w_xxxx > Sep 23 17:04:59 cluster2 kernel: [ 1579.831340] Pid: 3286, comm: > drbd0_worker Not tainted 2.6.28-15-server #49-Ubuntu > Sep 23 17:04:59 cluster2 kernel: [ 1579.831389] RIP: 0010: > [<ffffffffa0105373>] [<ffffffffa0105373>] w_e_end_ov_req+0x43/0x1a0 > [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.831453] RSP: > 0018:ffff88023acc9e60 EFLAGS: 00010202 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831482] RAX: > 0000000000000000 RBX: ffff880210115720 RCX: ffff88023acc9ecc > Sep 23 17:04:59 cluster2 kernel: [ 1579.831515] RDX: > 0000000000000000 RSI: 00000000000000d0 RDI: ffff88023c8cb000 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831548] RBP: > ffff88023acc9e80 R08: 0000000000000004 R09: 0000000000000001 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831581] R10: > 0000000000000000 R11: 00000000ffffffff R12: ffff88023c8cb000 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831614] R13: > ffff880210115720 R14: ffff88023c8cb118 R15: 0000000000000000 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831648] FS: > 0000000000000000(0000) GS:ffffffff80a9a000(0000) knlGS: > 0000000000000000 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831698] CS: 0010 DS: 0018 > ES: 0018 CR0: 000000008005003b > Sep 23 17:04:59 cluster2 kernel: [ 1579.831728] CR2: > 0000000000000030 CR3: 0000000000201000 CR4: 00000000000026a0 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831761] DR0: > 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831795] DR3: > 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831828] Process drbd0_worker > (pid: 3286, threadinfo ffff88023acc8000, task ffff88023bcdd980) > Sep 23 17:04:59 cluster2 kernel: [ 1579.831879] Stack: > Sep 23 17:04:59 cluster2 kernel: [ 1579.831901] ffff880210115720 > ffff88023c8cb000 ffff88023acc9eb0 ffff88023c8cb118 > Sep 23 17:04:59 cluster2 kernel: [ 1579.831943] ffff88023acc9f00 > ffffffffa01046e7 ffff88023c8cb620 ffff88023c8cb100 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832004] ffff88023c8cb120 > ffff88023c8cb0f0 ffff88023acc9eb0 ffff88023acc9eb0 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832082] Call Trace: > Sep 23 17:04:59 cluster2 kernel: [ 1579.832106] > [<ffffffffa01046e7>] drbd_worker+0x1e7/0x440 [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.832147] > [<ffffffff806997ad>] ? schedule_timeout+0x5d/0xd0 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832185] > [<ffffffffa011de03>] drbd_thread_setup+0xe3/0x200 [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.832230] > [<ffffffff80213979>] child_rip+0xa/0x11 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832264] > [<ffffffffa011dd20>] ? drbd_thread_setup+0x0/0x200 [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.832308] > [<ffffffff8021396f>] ? child_rip+0x0/0x11 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832340] Code: 89 1c 24 4c 89 > 74 24 18 49 89 f5 0f 85 ef 00 00 00 48 8b 46 20 f6 40 18 01 0f 84 d9 > 00 00 00 48 8b 87 c8 05 00 00 be d0 00 00 00 <44> 8b 70 30 49 63 fe > e8 c1 de 1d e0 48 85 c0 48 89 c3 0f 84 b5 > Sep 23 17:04:59 cluster2 kernel: [ 1579.832595] RIP > [<ffffffffa0105373>] w_e_end_ov_req+0x43/0x1a0 [drbd] > Sep 23 17:04:59 cluster2 kernel: [ 1579.832639] RSP > <ffff88023acc9e60> > Sep 23 17:04:59 cluster2 kernel: [ 1579.832664] CR2: 0000000000000030 > Sep 23 17:04:59 cluster2 kernel: [ 1579.833056] ---[ end trace > bfafe5c2861ff3a8 ]--- I had done some pretty intensive testing (Bonnie++, dd, etc.) before and everything was ok...could not reproduce the problem afterwards, either. What bothers me (apart from the bug) is the following: - Your changelog says that compatibility with Linux 2.6.27, 2.6.28 and 2.6.29 was added in 8.3.1, while Ubuntu Server with kernel 2.6.28 ships with DRBD 8.3.0. Btw., the DRBD module appear to has been compiled in January, while the kernel was compiled in August. Did the Ubuntu guys really screw it up that bloody? And if so, am I the only one using the DRBD version included in Ubuntu server?? Thanks for your time, Thomas