[DRBD-user] 2.6.24-rc1 oops

Matteo matteo at rmnet.it
Wed Nov 7 10:10:52 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


HI all folks,

anyone tried 8.2.1 with kernel 2.6.24-rc1?
i got a kernel oops while starting drbd:

drbd: initialised. Version: 8.2.1 (api:86/proto:86-87)
drbd: GIT-hash: 318925802fc2638479ad090b73d7af45503dd184 build by root at nas1,
2007-11-07 02:57:27
drbd: registered as block device major 147
drbd: minor_table @ 0xc587b580
BUG: unable to handle kernel NULL pointer dereference at virtual address
00000004
printing eip: c038cd39 *pdpt = 00000000055ee001 <1>*pde = 0000000000000000
Oops: 0002 [#1] PREEMPT SMP
Modules linked in: drbd cn tg3 e1000 bond1 bond0 cciss

Pid: 4329, comm: drbdsetup Not tainted (2.6.24-rc1 #1)
EIP: 0060:[<c038cd39>] EFLAGS: 00010047 CPU: 0
EIP is at skb_dequeue+0x22/0x3f
EAX: 00000000 EBX: c558207c ECX: 00000000 EDX: 00000282
ESI: c29ce300 EDI: c5582088 EBP: c55f9f28 ESP: c55f9d2c
 DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Process drbdsetup (pid: 4329, ti=c55f8000 task=c5812550 task.ti=c55f8000)
Stack: c041c920 c5582000 c558207c c038e13b 00000000 7fffffff c041c920
c5582000       00001000 c55f9f28 c03a02dc c55f9d70 00000001 c2c0b300 00000000
       c55f9db0       c2141a80 00100100 00200200 c2141a98 c55f9ea8 000010e9 00000000
       00000000Call Trace:
 [<c038e13b>] skb_recv_datagram+0x6d/0xa7
 [<c03a02dc>] netlink_recvmsg+0x4c/0x1f1
 [<c038749a>] sock_recvmsg+0xcf/0xe8
 [<c03872d2>] sock_sendmsg+0xbc/0xd4
 [<c0136750>] autoremove_wake_function+0x0/0x33
 [<c03f0633>] _read_unlock_irq+0xe/0x22
 [<c014aac1>] find_lock_page+0x6d/0x73
 [<c03884ef>] sys_recvfrom+0xc0/0x122
 [<c0158ea7>] __do_fault+0x31a/0x352
 [<c0159295>] handle_mm_fault+0x19b/0x35a
 [<c0388588>] sys_recv+0x37/0x3b
 [<c0388bb7>] sys_socketcall+0x17f/0x242
 [<c0176072>] sys_poll+0x39/0x6c
 [<c0103eb6>] sysenter_past_esp+0x5f/0x85
 =======================
Code: 10 83 c4 0c 5b 5e 5f 5d c3 57 8d 78 0c 56 31 f6 53 89 c3 89 f8 e8 e9
35 06 00 89 c2 8b 03 39 d8 74 19 89 c6 8b 00 ff 4b 08 89 03 <89> 58 04 c7 06
00 00 00 00 c7 46 04 00 00 00 00 89 f8 e8 58 38
EIP: [<c038cd39>] skb_dequeue+0x22/0x3f SS:ESP 0068:c55f9d2c
note: drbdsetup[4329] exited with preempt_count 1
BUG: scheduling while atomic: drbdsetup/0x10000002/4329
 [<c03ef1a6>] __sched_text_start+0x86/0x36f
 [<c01213b2>] __cond_resched+0xf/0x28
 [<c03ef795>] cond_resched+0x23/0x2c
 [<c0157180>] unmap_vmas+0x19c/0x205
 [<c015bb2f>] exit_mmap+0x76/0x109
 [<c01231ff>] mmput+0x20/0x78
 [<c0127a76>] do_exit+0x1de/0x2a0
 [<c010514b>] die+0x1d8/0x1e0
 [<c03f22df>] do_page_fault+0x6a6/0x796
 [<c03f1c39>] do_page_fault+0x0/0x796
 [<c03f082a>] error_code+0x72/0x78
 [<c038cd39>] skb_dequeue+0x22/0x3f
 [<c038e13b>] skb_recv_datagram+0x6d/0xa7
 [<c03a02dc>] netlink_recvmsg+0x4c/0x1f1
 [<c038749a>] sock_recvmsg+0xcf/0xe8
 [<c03872d2>] sock_sendmsg+0xbc/0xd4
 [<c0136750>] autoremove_wake_function+0x0/0x33
 [<c03f0633>] _read_unlock_irq+0xe/0x22
 [<c014aac1>] find_lock_page+0x6d/0x73
 [<c03884ef>] sys_recvfrom+0xc0/0x122
 [<c0158ea7>] __do_fault+0x31a/0x352
 [<c0159295>] handle_mm_fault+0x19b/0x35a
 [<c0388588>] sys_recv+0x37/0x3b
 [<c0388bb7>] sys_socketcall+0x17f/0x242
 [<c0176072>] sys_poll+0x39/0x6c
 [<c0103eb6>] sysenter_past_esp+0x5f/0x85
 =======================
BUG: soft lockup - CPU#0 stuck for 11s! [drbdsetup:4329]

Pid: 4329, comm: drbdsetup Tainted: G      D (2.6.24-rc1 #1)
EIP: 0060:[<c03f0364>] EFLAGS: 00000202 CPU: 0
EIP is at _spin_lock_irqsave+0x53/0x60
EAX: 00000000 EBX: c5582088 ECX: c1809468 EDX: 00000001
ESI: 00000000 EDI: c5582088 EBP: c5cc6330 ESP: c55f9b64
 DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: 00000004 CR3: 005b0000 CR4: 000006f0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
 [<c038cd28>] skb_dequeue+0x11/0x3f
 [<c038cdb0>] skb_queue_purge+0x11/0x17
 [<c039ea56>] netlink_sock_destruct+0x2c/0xd1
 [<c0389ca2>] sk_free+0x21/0xb7
 [<c039f21f>] netlink_release+0x157/0x17c
 [<c039f23b>] netlink_release+0x173/0x17c
 [<c038aa4f>] release_sock+0x10/0x52
 [<c0129740>] local_bh_enable_ip+0x40/0x56
 [<c0387a9c>] sock_fasync+0x106/0x110
 [<c03871a4>] sock_release+0x12/0x84
 [<c0387991>] sock_close+0x28/0x2d
 [<c0387969>] sock_close+0x0/0x2d
 [<c016bb37>] __fput+0x96/0x141
 [<c016a604>] filp_close+0x4d/0x54
 [<c01270c0>] close_files+0x46/0x5a
 [<c0127113>] put_files_struct+0x15/0x3d
 [<c0127a84>] do_exit+0x1ec/0x2a0
 [<c010514b>] die+0x1d8/0x1e0
 [<c03f22df>] do_page_fault+0x6a6/0x796
 [<c03f1c39>] do_page_fault+0x0/0x796
 [<c03f082a>] error_code+0x72/0x78
 [<c038cd39>] skb_dequeue+0x22/0x3f
 [<c038e13b>] skb_recv_datagram+0x6d/0xa7
 [<c03a02dc>] netlink_recvmsg+0x4c/0x1f1
 [<c038749a>] sock_recvmsg+0xcf/0xe8
 [<c03872d2>] sock_sendmsg+0xbc/0xd4
 [<c0136750>] autoremove_wake_function+0x0/0x33
 [<c03f0633>] _read_unlock_irq+0xe/0x22
 [<c014aac1>] find_lock_page+0x6d/0x73
 [<c03884ef>] sys_recvfrom+0xc0/0x122
 [<c0158ea7>] __do_fault+0x31a/0x352
 [<c0159295>] handle_mm_fault+0x19b/0x35a
 [<c0388588>] sys_recv+0x37/0x3b
 [<c0388bb7>] sys_socketcall+0x17f/0x242
 [<c0176072>] sys_poll+0x39/0x6c
 [<c0103eb6>] sysenter_past_esp+0x5f/0x85
 =======================
BUG: soft lockup - CPU#0 stuck for 11s! [drbdsetup:4329]

Pid: 4329, comm: drbdsetup Tainted: G      D (2.6.24-rc1 #1)
EIP: 0060:[<c03f0364>] EFLAGS: 00000202 CPU: 0
EIP is at _spin_lock_irqsave+0x53/0x60
EAX: 00000000 EBX: c5582088 ECX: c1809468 EDX: 00000001
ESI: 00000000 EDI: c5582088 EBP: c5cc6330 ESP: c55f9b64
 DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
CR0: 8005003b CR2: 00000004 CR3: 005b0000 CR4: 000006f0
DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
DR6: ffff0ff0 DR7: 00000400
 [<c038cd28>] skb_dequeue+0x11/0x3f
 [<c038cdb0>] skb_queue_purge+0x11/0x17
 [<c039ea56>] netlink_sock_destruct+0x2c/0xd1
 [<c0389ca2>] sk_free+0x21/0xb7
 [<c039f21f>] netlink_release+0x157/0x17c
 [<c039f23b>] netlink_release+0x173/0x17c
 [<c038aa4f>] release_sock+0x10/0x52
 [<c0129740>] local_bh_enable_ip+0x40/0x56
 [<c0387a9c>] sock_fasync+0x106/0x110
 [<c03871a4>] sock_release+0x12/0x84
 [<c0387991>] sock_close+0x28/0x2d
 [<c0387969>] sock_close+0x0/0x2d
 [<c016bb37>] __fput+0x96/0x141
 [<c016a604>] filp_close+0x4d/0x54
 [<c01270c0>] close_files+0x46/0x5a
 [<c0127113>] put_files_struct+0x15/0x3d
 [<c0127a84>] do_exit+0x1ec/0x2a0
 [<c010514b>] die+0x1d8/0x1e0
 [<c03f22df>] do_page_fault+0x6a6/0x796
 [<c03f1c39>] do_page_fault+0x0/0x796
 [<c03f082a>] error_code+0x72/0x78
 [<c038cd39>] skb_dequeue+0x22/0x3f
 [<c038e13b>] skb_recv_datagram+0x6d/0xa7
 [<c03a02dc>] netlink_recvmsg+0x4c/0x1f1
 [<c038749a>] sock_recvmsg+0xcf/0xe8
 [<c03872d2>] sock_sendmsg+0xbc/0xd4
 [<c0136750>] autoremove_wake_function+0x0/0x33
 [<c03f0633>] _read_unlock_irq+0xe/0x22
 [<c014aac1>] find_lock_page+0x6d/0x73
 [<c03884ef>] sys_recvfrom+0xc0/0x122
 [<c0158ea7>] __do_fault+0x31a/0x352
 [<c0159295>] handle_mm_fault+0x19b/0x35a
 [<c0388588>] sys_recv+0x37/0x3b
 [<c0388bb7>] sys_socketcall+0x17f/0x242
 [<c0176072>] sys_poll+0x39/0x6c
 [<c0103eb6>] sysenter_past_esp+0x5f/0x85


And this happens in at least 4 servers with different hardware...
I see kernel 2.6.24-rc2 but drbd won't compile:
 CC [M]  /usr/src/drbd-8.2.1/drbd/drbd_worker.o
/usr/src/drbd-8.2.1/drbd/drbd_worker.c: In function `drbd_csum':
/usr/src/drbd-8.2.1/drbd/drbd_worker.c:297: error: too few arguments to
function `sg_set_page'
make[3]: *** [/usr/src/drbd-8.2.1/drbd/drbd_worker.o] Error 1

Any help may be very appreciated...
Thanks,
--
Matteo Tescione





More information about the drbd-user mailing list