[DRBD-user] Problem with 3.5.3 and drbd 8.4.2

Holger Kiehl Holger.Kiehl at dwd.de
Fri Sep 14 13:00:35 CEST 2012

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hello,

Got the following error situation where I do not know why it happened. In
/var/log/messages I found the following:

    Sep 14 08:32:06 praktifix kernel: WARNING: at kernel/smp.c:461 smp_call_function_many+0x6c/0x1bb()
    Sep 14 08:32:06 praktifix kernel: Hardware name: PRIMERGY RX300 S4
    Sep 14 08:32:06 praktifix kernel: Modules linked in: drbd(O) coretemp ipmi_devintf ipmi_si bonding binfmt_misc video acpi_ipmi ipmi_msghandler ac nvram sr_mod cdrom sg usbhid mgag200 fbcon ttm tileblit font bitblit softcursor drm_kms_helper drm i2c_algo_bit sysimgblt sysfillrect syscopyarea i5k_amb pata_acpi i2c_i801 ata_generic i2c_core i5000_edac ehci_hcd uhci_hcd usbcore usb_common [last unloaded: microcode]
    Sep 14 08:32:06 praktifix kernel: Pid: 4442, comm: drbd_r_r0 Tainted: G           O 3.5.3 #1
    Sep 14 08:32:06 praktifix kernel: Call Trace:
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81060411>] ? smp_call_function_many+0x6c/0x1bb
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102ab0e>] warn_slowpath_common+0x80/0x99
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102ab3c>] warn_slowpath_null+0x15/0x17
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81060411>] smp_call_function_many+0x6c/0x1bb
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81024bd1>] ? leave_mm+0x43/0x43
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81024bd1>] ? leave_mm+0x43/0x43
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810605c2>] smp_call_function+0x20/0x24
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810606a9>] on_each_cpu+0x16/0x32
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81024aa3>] flush_tlb_all+0x17/0x19
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8109f971>] __purge_vmap_area_lazy+0x122/0x17a
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8109fa4b>] free_vmap_area_noflush+0x54/0x5b
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810a05e9>] free_unmap_vmap_area+0x20/0x24
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810a064a>] remove_vm_area+0x5d/0x71
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810a076a>] __vunmap+0x38/0xb5
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff810a080d>] vunmap+0x26/0x28
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00e9fb7>] ttm_bo_kunmap+0x55/0xa3 [ttm]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00fc6a3>] mga_dirty_update+0x10b/0x122 [mgag200]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00fc6e4>] mga_imageblit+0x2a/0x2f [mgag200]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00ca7a4>] bit_putcs+0x44b/0x4b0 [bitblit]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00cacf7>] ? bit_cursor+0x4ee/0x7f7 [bitblit]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa0103a74>] fbcon_putcs+0xa1/0x101 [fbcon]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00ca359>] ? bit_clear+0xd6/0xd6 [bitblit]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa0105232>] fbcon_redraw+0xd8/0x16c [fbcon]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa0104494>] ? fbcon_cursor+0x127/0x150 [fbcon]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa00ca809>] ? bit_putcs+0x4b0/0x4b0 [bitblit]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01071c5>] fbcon_scroll+0x687/0xc6c [fbcon]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102bc9f>] ? console_unlock+0x2e0/0x2ef
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff811ee543>] scrup+0x71/0xe8
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff811ee64e>] lf+0x2d/0x66
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff811f3119>] vt_console_print+0x1d9/0x304
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102afd5>] call_console_drivers+0x7b/0x8d
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102bc1f>] console_unlock+0x260/0x2ef
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102c435>] vprintk_emit+0x302/0x364
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8102c97c>] printk_emit+0x88/0x8a
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8104cd4b>] ? __wake_up+0x43/0x50
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff812fb18d>] ? netlink_broadcast_filtered+0x28e/0x2bb
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81205d8b>] __dev_printk+0x1d2/0x1e4
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01a5c82>] ? drbd_bcast_event+0xd7/0x11c [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01a949f>] ? drbd_khelper+0x1cc/0x1ff [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff8120636a>] dev_printk+0xa9/0xab
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa018f551>] ? drbd_recv+0x26/0x15a [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa018f551>] ? drbd_recv+0x26/0x15a [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa018e4d1>] drbd_sync_handshake+0x34b/0x548 [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa0194d8d>] receive_state+0x3ce/0x75d [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01908fc>] drbdd+0x9d/0x13a [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa019118c>] drbdd_init+0x79/0x98 [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01a2b38>] drbd_thread_setup+0x97/0x13f [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81377254>] kernel_thread_helper+0x4/0x10
    Sep 14 08:32:06 praktifix kernel:  [<ffffffffa01a2aa1>] ? drbd_bmio_clear_n_write+0x149/0x149 [drbd]
    Sep 14 08:32:06 praktifix kernel:  [<ffffffff81377250>] ? gs_change+0xb/0xb
    Sep 14 08:32:06 praktifix kernel: ---[ end trace 8b6e7b6ecbb1b906 ]---
    Sep 14 08:32:06 praktifix kernel: block drbd0: helper command: /sbin/drbdadm split-brain minor-0
    Sep 14 08:32:06 praktifix kernel: block drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
    Sep 14 08:32:06 praktifix kernel: d-con r0: conn( NetworkFailure -> Disconnecting )
    Sep 14 08:32:06 praktifix kernel: d-con r0: error receiving ReportState, e: -5 l: 0!
    Sep 14 08:32:06 praktifix kernel: d-con r0: Connection closed
    Sep 14 08:32:06 praktifix kernel: d-con r0: conn( Disconnecting -> StandAlone )
    Sep 14 08:32:06 praktifix kernel: d-con r0: receiver terminated
    Sep 14 08:32:06 praktifix kernel: d-con r0: Terminating receiver thread

Before this it was running kernel 3.2.x and drbd 8.4.1 for a long time
without any errors. Any clue why this happened?

If more information is needed please just ask.

Regards,
Holger



More information about the drbd-user mailing list