[DRBD-user] Kernel 2.6.20+ Drbd 8.0.0 (2738M): BUG: unable to handle kernel NULL pointer

Martin Bene martin.bene at icomedias.com
Wed Feb 14 08:43:40 CET 2007

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi, I'm seeing the above error in one of the scenarios I'm testing:

* Kernel 2.6.20
* drbd 8.0.0, SVN Revision 2738
* 2 drbd devices

I've seen the above bug several times, most recently after hard
resetting a node that was primary for one of the devices.

On reboot I got the following bug in kernel logs:

Feb 14 10:37:17 webc-neu2 kernel: drbd: initialised. Version: 8.0.0
(api:86/proto:86)
Feb 14 10:37:17 webc-neu2 kernel: drbd: SVN Revision: 2738M build by
root at webc-neu2, 2007-02-06 14:56:34
Feb 14 10:37:17 webc-neu2 kernel: drbd: registered as block device major
147
Feb 14 10:37:17 webc-neu2 kernel: drbd: minor_table @ 0xf7a888c0
Feb 14 10:37:17 webc-neu2 kernel: drbd0: disk( Diskless -> Attaching )
Feb 14 10:37:17 webc-neu2 kernel: drbd0: Found 6 transactions (276
active extents) in activity log.
Feb 14 10:37:17 webc-neu2 kernel: drbd0: max_segment_size ( = BIO size )
= 32768
Feb 14 10:37:17 webc-neu2 kernel: drbd0: drbd_bm_resize called with
capacity == 58586936
Feb 14 10:37:17 webc-neu2 kernel: drbd0: resync bitmap: bits=7323367
words=228856
Feb 14 10:37:17 webc-neu2 kernel: drbd0: size = 27 GB (29293468 KB)
Feb 14 10:37:17 webc-neu2 kernel: drbd0: reading of bitmap took 8
jiffies
Feb 14 10:37:17 webc-neu2 kernel: drbd0: recounting of set bits took
additional 0 jiffies
Feb 14 10:37:17 webc-neu2 kernel: drbd0: 0 KB marked out-of-sync by on
disk bit-map.
Feb 14 10:37:17 webc-neu2 kernel: drbd0: Marked additional 1026 MB as
out-of-sync based on AL.
Feb 14 10:37:17 webc-neu2 kernel: drbd0: disk( Attaching -> UpToDate )
Feb 14 10:37:17 webc-neu2 kernel: drbd0: Writing meta data super block
now.
Feb 14 10:37:17 webc-neu2 kernel: drbd1: disk( Diskless -> Attaching )
Feb 14 10:37:17 webc-neu2 kernel: drbd1: Found 6 transactions (324
active extents) in activity log.
Feb 14 10:37:17 webc-neu2 kernel: drbd1: max_segment_size ( = BIO size )
= 32768
Feb 14 10:37:17 webc-neu2 kernel: drbd1: drbd_bm_resize called with
capacity == 58586936
Feb 14 10:37:17 webc-neu2 kernel: drbd1: resync bitmap: bits=7323367
words=228856
Feb 14 10:37:17 webc-neu2 kernel: drbd1: size = 27 GB (29293468 KB)
Feb 14 10:37:17 webc-neu2 kernel: drbd1: reading of bitmap took 11
jiffies
Feb 14 10:37:17 webc-neu2 kernel: drbd1: recounting of set bits took
additional 0 jiffies
Feb 14 10:37:17 webc-neu2 kernel: drbd1: 0 KB marked out-of-sync by on
disk bit-map.
Feb 14 10:37:17 webc-neu2 kernel: drbd1: disk( Attaching -> UpToDate )
Feb 14 10:37:17 webc-neu2 kernel: drbd1: Writing meta data super block
now.
Feb 14 10:37:17 webc-neu2 kernel: drbd0: conn( StandAlone -> Unconnected
)
Feb 14 10:37:17 webc-neu2 kernel: drbd0: receiver (re)started
Feb 14 10:37:17 webc-neu2 kernel: drbd0: conn( Unconnected ->
WFConnection )
Feb 14 10:37:17 webc-neu2 kernel: drbd1: conn( StandAlone -> Unconnected
)
Feb 14 10:37:17 webc-neu2 kernel: drbd1: receiver (re)started
Feb 14 10:37:17 webc-neu2 kernel: drbd1: conn( Unconnected ->
WFConnection )
Feb 14 10:37:17 webc-neu2 kernel: BUG: unable to handle kernel NULL
pointer dereference at virtual address 00000000
Feb 14 10:37:17 webc-neu2 kernel:  printing eip:
Feb 14 10:37:17 webc-neu2 kernel: 00000000
Feb 14 10:37:17 webc-neu2 kernel: *pde = 00000000
Feb 14 10:37:17 webc-neu2 kernel: Oops: 0000 [#1]
Feb 14 10:37:17 webc-neu2 kernel: SMP
Feb 14 10:37:17 webc-neu2 kernel: Modules linked in: drbd usbcore sha1
ipmi_devintf ipmi_si ipmi_msghandler nfs lockd sunrpc tg3 iTCO_wdt
Feb 14 10:37:17 webc-neu2 kernel: CPU:    1
Feb 14 10:37:17 webc-neu2 kernel: EIP:    0060:[_proxy_pda+0/1048576]
Not tainted VLI
Feb 14 10:37:17 webc-neu2 kernel: EIP:    0060:[<00000000>]    Not
tainted VLI
Feb 14 10:37:17 webc-neu2 kernel: EFLAGS: 00010247   (2.6.20-gentoo #2)
Feb 14 10:37:17 webc-neu2 kernel: EIP is at _stext+0x3feffc6c/0x20
Feb 14 10:37:17 webc-neu2 kernel: eax: f79d7c10   ebx: f732c608   ecx:
c2169358   edx: 00000286
Feb 14 10:37:17 webc-neu2 kernel: esi: f732c5cc   edi: f732c5cc   ebp:
00000286   esp: c229df40
Feb 14 10:37:17 webc-neu2 kernel: ds: 007b   es: 007b   ss: 0068
Feb 14 10:37:17 webc-neu2 kernel: Process cqueue/1 (pid: 127,
ti=c229c000 task=c2267550 task.ti=c229c000)
Feb 14 10:37:17 webc-neu2 kernel: Stack: c0261f69 f732c5d0 c2169340
c0125957 0000021b 13a32329 00000007 f7f4d030
Feb 14 10:37:17 webc-neu2 kernel:        c0261f5c c2169340 c2169340
c229df80 fffffffc c0125b01 ffffffff ffffffff
Feb 14 10:37:17 webc-neu2 kernel:        00000001 00000000 c0113891
00010000 00000000 c213da70 00000000 c200c900
Feb 14 10:37:17 webc-neu2 kernel: Call Trace:
Feb 14 10:37:17 webc-neu2 kernel:  [cn_queue_wrapper+13/36]
cn_queue_wrapper+0xd/0x24
Feb 14 10:37:17 webc-neu2 kernel:  [<c0261f69>]
cn_queue_wrapper+0xd/0x24
Feb 14 10:37:17 webc-neu2 kernel:  [run_workqueue+138/292]
run_workqueue+0x8a/0x124
Feb 14 10:37:17 webc-neu2 kernel:  [<c0125957>] run_workqueue+0x8a/0x124
Feb 14 10:37:17 webc-neu2 kernel:  [cn_queue_wrapper+0/36]
cn_queue_wrapper+0x0/0x24
Feb 14 10:37:17 webc-neu2 kernel:  [<c0261f5c>]
cn_queue_wrapper+0x0/0x24
Feb 14 10:37:17 webc-neu2 kernel:  [worker_thread+272/315]
worker_thread+0x110/0x13b
Feb 14 10:37:17 webc-neu2 kernel:  [<c0125b01>]
worker_thread+0x110/0x13b
Feb 14 10:37:17 webc-neu2 kernel:  [default_wake_function+0/12]
default_wake_function+0x0/0xc
Feb 14 10:37:17 webc-neu2 kernel:  [<c0113891>]
default_wake_function+0x0/0xc
Feb 14 10:37:17 webc-neu2 kernel:  [default_wake_function+0/12]
default_wake_function+0x0/0xc
Feb 14 10:37:17 webc-neu2 kernel:  [<c0113891>]
default_wake_function+0x0/0xc
Feb 14 10:37:17 webc-neu2 kernel:  [worker_thread+0/315]
worker_thread+0x0/0x13b
Feb 14 10:37:17 webc-neu2 kernel:  [<c01259f1>] worker_thread+0x0/0x13b
Feb 14 10:37:17 webc-neu2 kernel:  [kthread+116/152] kthread+0x74/0x98
Feb 14 10:37:17 webc-neu2 kernel:  [<c0128614>] kthread+0x74/0x98
Feb 14 10:37:17 webc-neu2 kernel:  [kthread+0/152] kthread+0x0/0x98
Feb 14 10:37:17 webc-neu2 kernel:  [<c01285a0>] kthread+0x0/0x98
Feb 14 10:37:17 webc-neu2 kernel:  [kernel_thread_helper+7/16]
kernel_thread_helper+0x7/0x10
Feb 14 10:37:17 webc-neu2 kernel:  [<c0103473>]
kernel_thread_helper+0x7/0x10
Feb 14 10:37:17 webc-neu2 kernel:  =======================
Feb 14 10:37:17 webc-neu2 kernel: Code:  Bad EIP value.
Feb 14 10:37:17 webc-neu2 kernel: EIP: [_proxy_pda+0/1048576]
_stext+0x3feffc6c/0x20 SS:ESP 0068:c229df40
Feb 14 10:37:17 webc-neu2 kernel: EIP: [<00000000>]
_stext+0x3feffc6c/0x20 SS:ESP 0068:c229df40
Feb 14 10:37:20 webc-neu2 kernel:  <6>drbd0: conn( WFConnection ->
WFReportParams )
Feb 14 10:37:20 webc-neu2 kernel: drbd0: Handshake successful: DRBD
Network Protocol version 86
Feb 14 10:37:20 webc-neu2 kernel: drbd0: Peer authenticated using 20
bytes of 'sha1' HMAC
Feb 14 10:37:20 webc-neu2 kernel: drbd0: peer( Unknown -> Secondary )
conn( WFReportParams -> WFBitMapS ) pdsk( DUnknown -> UpToDate )
Feb 14 10:37:20 webc-neu2 kernel: drbd1: conn( WFConnection ->
WFReportParams )
Feb 14 10:37:20 webc-neu2 kernel: drbd1: Handshake successful: DRBD
Network Protocol version 86
Feb 14 10:37:20 webc-neu2 kernel: drbd1: Peer authenticated using 20
bytes of 'sha1' HMAC
Feb 14 10:37:20 webc-neu2 kernel: drbd1: peer( Unknown -> Secondary )
conn( WFReportParams -> Connected ) pdsk( DUnknown -> UpToDate )
Feb 14 10:37:20 webc-neu2 kernel: drbd1: Writing meta data super block
now.
Feb 14 10:37:20 webc-neu2 kernel: drbd0: Writing meta data super block
now.
Feb 14 10:37:20 webc-neu2 kernel: drbd0: conn( WFBitMapS -> SyncSource )
pdsk( UpToDate -> Inconsistent )
Feb 14 10:37:20 webc-neu2 kernel: drbd1: aftr_isp( 0 -> 1 )

Any Idea what's going on?

Thanks, Martin



More information about the drbd-user mailing list