[DRBD-user] strange drbd bug

Darren Ginter dsginter at gmail.com
Wed Nov 5 15:04:56 CET 2014

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Here's an easy way to reproduce:

1) Put ZFS zvol (zfs pseudo block device) on two systems (mine are 8TB).
2) Use DRBD 8.4.4 to replicate those block devices (I'm using protocol A).
3) Watch your syslog for the crash.

Regards,

Darren


Nov  4 12:24:11 rozrep2 kernel: [ 3722.622015] BUG: soft lockup - CPU#0
stuck for 22s! [zvol/4:640]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622426] Modules linked in: drbd
hid_generic usbhid hid lru_cache libcrc32c snd_hda_codec_hdmi zfs(POF)
zunicode(POF) zavl(POF) zcommon(POF) crc32_pclmul znvpair(POF) spl(OF)
aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper
ghash_clmulni_intel cryptd kvm_amd kvm crct10dif_pclmul k10temp serio_raw
fam15h_power edac_mce_amd edac_core nouveau i2c_piix4 sp5100_tco video
mxm_wmi snd_hda_intel wmi snd_hda_codec drm_kms_helper snd_hwdep ttm
snd_pcm drm snd_page_alloc snd_timer snd mac_hid i2c_algo_bit soundcore lp
parport pata_acpi psmouse ixgbe r8169 firewire_ohci mii ahci dca
firewire_core crc_itu_t pata_atiixp megaraid_sas ptp libahci pps_core mdio
[last unloaded: drbd]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622457] CPU: 0 PID: 640 Comm: zvol/4
Tainted: PF          O 3.13.0-32-generic #57-Ubuntu
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622459] Hardware name: Gigabyte
Technology Co., Ltd. To be filled by O.E.M./990FXA-UD3, BIOS F2 07/15/2013
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622461] task: ffff88080460afe0 ti:
ffff880804606000 task.ti: ffff880804606000
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622462] RIP:
0010:[<ffffffff81723c30>]  [<ffffffff81723c30>] _raw_spin_lock+0x30/0x50
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622466] RSP: 0018:ffff880804607ad0
 EFLAGS: 00000297
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622467] RAX: 0000000000000878 RBX:
0000000000000000 RCX: 0000000000000004
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622468] RDX: 0000000000000000 RSI:
0000000000000000 RDI: ffff8806dfb5805c
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622470] RBP: ffff880804607ad0 R08:
000060f7c1001510 R09: ffff8803c814c000
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622471] R10: ffff880810ee1868 R11:
0000000000000001 R12: ffffffffa062cee3
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622472] R13: ffff880804607a90 R14:
00000230632847b8 R15: ffffffffa04baeb9
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622474] FS:  00007f037e3c6780(0000)
GS:ffff88083ec00000(0000) knlGS:0000000000000000
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622475] CS:  0010 DS: 0000 ES: 0000
CR0: 000000008005003b
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622476] CR2: 00007f037e3c5000 CR3:
0000000001c0e000 CR4: 00000000000407f0
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622477] Stack:
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622478]  ffff880804607af0
ffffffff8172228f 0000000000000004 0000000000000001
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622482]  ffff880804607b00
ffffffff817222db ffff880804607ba0 ffffffffa0591971
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622484]  ffffffff00000001
0000000001cce970 ffff8803e3bfe480 0000002e0e037860
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622487] Call Trace:
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622491]  [<ffffffff8172228f>]
__mutex_unlock_slowpath+0x1f/0x50
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622493]  [<ffffffff817222db>]
mutex_unlock+0x1b/0x20
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622512]  [<ffffffffa0591971>]
dbuf_read+0x341/0x930 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622527]  [<ffffffffa0592188>] ?
__dbuf_hold_impl+0x228/0x4d0 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622540]  [<ffffffffa05923a2>]
__dbuf_hold_impl+0x442/0x4d0 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622554]  [<ffffffffa05924ab>]
dbuf_hold_impl+0x7b/0xa0 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622571]  [<ffffffffa05a6bd7>]
dmu_tx_count_write+0x3c7/0x6f0 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622588]  [<ffffffffa05a6f36>]
dmu_tx_hold_write+0x36/0x50 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622614]  [<ffffffffa0637f8a>]
zvol_write+0x9a/0x480 [zfs]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622621]  [<ffffffffa04bc487>]
taskq_thread+0x237/0x4b0 [spl]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622624]  [<ffffffff81097508>] ?
finish_task_switch+0x128/0x170
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622627]  [<ffffffff8109a800>] ?
wake_up_state+0x20/0x20
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622632]  [<ffffffffa04bc250>] ?
taskq_cancel_id+0x1f0/0x1f0 [spl]
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622635]  [<ffffffff8108b3d2>]
kthread+0xd2/0xf0
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622637]  [<ffffffff8108b300>] ?
kthread_create_on_node+0x1d0/0x1d0
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622640]  [<ffffffff8172c5bc>]
ret_from_fork+0x7c/0xb0
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622642]  [<ffffffff8108b300>] ?
kthread_create_on_node+0x1d0/0x1d0
Nov  4 12:24:11 rozrep2 kernel: [ 3722.622644] Code: 55 48 89 e5 b8 00 00
02 00 f0 0f c1 07 89 c2 c1 ea 10 66 39 c2 75 02 5d c3 83 e2 fe 0f b7 f2 b8
00 80 00 00 eb 0c 0f 1f 44 00 00 <f3> 90 83 e8 01 74 0a 0f b7 0f 66 39 ca
75 f1 5d c3 66 66 66 90
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20141105/5979022e/attachment.htm>


More information about the drbd-user mailing list