[Drbd-dev] DRBD Lockups: We did not send a P_BARRIER – drbd kernel thread blocked

Philipp Reisner philipp.reisner at linbit.com
Wed Apr 13 09:20:45 CEST 2022


Hello Michael,

Unfortunately, it is hard to tell what went wrong from the log
messages. In recent development (after 9.1.6 before 9.1.7-rc.2) there
were some fixes that cured some bugs that led to the same symptom
(=log messages).
I do not know if that also eliminated what you see. My asks are:
 * Please verify with 9.1.7-rc.2 if you can reproduce the issue.
 * If yes, try, step by step to create the minimal reproducing
scenario and send it to us in a way that we can reproduce it.

thanks in advance.

best regards,
 Phil



On Tue, Apr 12, 2022 at 11:36 AM Michael Hierweck <michael at hierweck.de> wrote:
>
> Dear DRBD Developers,
>
> we successfully used using a rather simple IO Stack with two nodes for many years:
>
> DRBD 8.x => KVM/QEMU => XFS
>
>
> In late 2020 we migrated to:
>
> DRBD 9.x => KVM/QEMU => XFS
>
>
> With both the 9.0 and 9.1 series of DRBD we experienced that the invokation of "fstrim" inside
> the VMs can lead to lookups of the corresponding DRBD device, especially when using HDD based
> backing storage. We experienced the problem with NVMe based backing though. If this happens the
> VM cannot perform IO anymore and the host has to be rebooted.
>
> [1664789.544862] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 43656ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1664832.555478] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 86668ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1664875.558517] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 129672ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1664900.134277] INFO: task kworker/u192:2:90254 blocked for more than 120 seconds.
> [1664900.134336]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1664900.134377] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1664900.134415] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1664900.134493] Workqueue: drbd21440_submit do_submit [drbd]
> [1664900.134496] Call Trace:
> [1664900.134509]  __schedule+0x2be/0x770
> [1664900.134520]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1664900.134521]  schedule+0x3c/0xa0
> [1664900.134529]  do_submit+0x3c6/0x690 [drbd]
> [1664900.134539]  ? finish_wait+0x80/0x80
> [1664900.134545]  process_one_work+0x1aa/0x340
> [1664900.134547]  worker_thread+0x30/0x390
> [1664900.134550]  ? create_worker+0x1a0/0x1a0
> [1664900.134551]  kthread+0x116/0x130
> [1664900.134553]  ? __kthread_cancel_work+0x40/0x40
> [1664900.134558]  ret_from_fork+0x22/0x30
> [1664918.565217] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 172680ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1664961.571983] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 215688ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665004.578843] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 258696ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665020.962904] INFO: task kworker/u192:2:90254 blocked for more than 241 seconds.
> [1665020.962951]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665020.962982] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665020.963008] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665020.963031] Workqueue: drbd21440_submit do_submit [drbd]
> [1665020.963033] Call Trace:
> [1665020.963045]  __schedule+0x2be/0x770
> [1665020.963057]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665020.963059]  schedule+0x3c/0xa0
> [1665020.963067]  do_submit+0x3c6/0x690 [drbd]
> [1665020.963075]  ? finish_wait+0x80/0x80
> [1665020.963081]  process_one_work+0x1aa/0x340
> [1665020.963084]  worker_thread+0x30/0x390
> [1665020.963086]  ? create_worker+0x1a0/0x1a0
> [1665020.963088]  kthread+0x116/0x130
> [1665020.963090]  ? __kthread_cancel_work+0x40/0x40
> [1665020.963094]  ret_from_fork+0x22/0x30
> [1665047.585837] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 301704ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665090.592449] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 344712ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665133.599392] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 387720ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665141.791772] INFO: task kworker/u192:2:90254 blocked for more than 362 seconds.
> [1665141.791827]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665141.791865] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665141.791901] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665141.791927] Workqueue: drbd21440_submit do_submit [drbd]
> [1665141.791930] Call Trace:
> [1665141.791944]  __schedule+0x2be/0x770
> [1665141.791959]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665141.791962]  schedule+0x3c/0xa0
> [1665141.791973]  do_submit+0x3c6/0x690 [drbd]
> [1665141.791982]  ? finish_wait+0x80/0x80
> [1665141.791989]  process_one_work+0x1aa/0x340
> [1665141.791993]  worker_thread+0x30/0x390
> [1665141.791996]  ? create_worker+0x1a0/0x1a0
> [1665141.791999]  kthread+0x116/0x130
> [1665141.792001]  ? __kthread_cancel_work+0x40/0x40
> [1665141.792006]  ret_from_fork+0x22/0x30
> [1665176.606182] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 430728ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665219.613011] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 473736ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665262.619975] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 516744ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665262.620472] INFO: task kworker/u192:2:90254 blocked for more than 483 seconds.
> [1665262.620520]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665262.620550] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665262.620576] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665262.620599] Workqueue: drbd21440_submit do_submit [drbd]
> [1665262.620601] Call Trace:
> [1665262.620613]  __schedule+0x2be/0x770
> [1665262.620624]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665262.620626]  schedule+0x3c/0xa0
> [1665262.620633]  do_submit+0x3c6/0x690 [drbd]
> [1665262.620642]  ? finish_wait+0x80/0x80
> [1665262.620648]  process_one_work+0x1aa/0x340
> [1665262.620651]  worker_thread+0x30/0x390
> [1665262.620653]  ? create_worker+0x1a0/0x1a0
> [1665262.620654]  kthread+0x116/0x130
> [1665262.620657]  ? __kthread_cancel_work+0x40/0x40
> [1665262.620661]  ret_from_fork+0x22/0x30
> [1665305.626858] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 559752ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665348.633505] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 602760ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665383.448906] INFO: task kworker/u192:2:90254 blocked for more than 604 seconds.
> [1665383.448944]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665383.448965] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665383.448988] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665383.449009] Workqueue: drbd21440_submit do_submit [drbd]
> [1665383.449011] Call Trace:
> [1665383.449021]  __schedule+0x2be/0x770
> [1665383.449029]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665383.449030]  schedule+0x3c/0xa0
> [1665383.449036]  do_submit+0x3c6/0x690 [drbd]
> [1665383.449043]  ? finish_wait+0x80/0x80
> [1665383.449048]  process_one_work+0x1aa/0x340
> [1665383.449051]  worker_thread+0x30/0x390
> [1665383.449052]  ? create_worker+0x1a0/0x1a0
> [1665383.449053]  kthread+0x116/0x130
> [1665383.449054]  ? __kthread_cancel_work+0x40/0x40
> [1665383.449058]  ret_from_fork+0x22/0x30
> [1665391.640337] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 645768ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665434.647471] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 688776ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665477.654117] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 731784ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665504.277746] INFO: task kworker/u192:2:90254 blocked for more than 724 seconds.
> [1665504.277802]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665504.277841] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665504.277877] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665504.277902] Workqueue: drbd21440_submit do_submit [drbd]
> [1665504.277904] Call Trace:
> [1665504.277916]  __schedule+0x2be/0x770
> [1665504.277927]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665504.277929]  schedule+0x3c/0xa0
> [1665504.277937]  do_submit+0x3c6/0x690 [drbd]
> [1665504.277946]  ? finish_wait+0x80/0x80
> [1665504.277951]  process_one_work+0x1aa/0x340
> [1665504.277954]  worker_thread+0x30/0x390
> [1665504.277956]  ? create_worker+0x1a0/0x1a0
> [1665504.277959]  kthread+0x116/0x130
> [1665504.277961]  ? __kthread_cancel_work+0x40/0x40
> [1665504.277965]  ret_from_fork+0x22/0x30
> [1665520.661045] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 774792ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665563.667635] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 817800ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665606.674604] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 860808ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665625.106375] INFO: task kworker/u192:2:90254 blocked for more than 845 seconds.
> [1665625.106422]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665625.106454] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665625.106483] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665625.106560] Workqueue: drbd21440_submit do_submit [drbd]
> [1665625.106563] Call Trace:
> [1665625.106576]  __schedule+0x2be/0x770
> [1665625.106588]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665625.106590]  schedule+0x3c/0xa0
> [1665625.106598]  do_submit+0x3c6/0x690 [drbd]
> [1665625.106607]  ? finish_wait+0x80/0x80
> [1665625.106613]  process_one_work+0x1aa/0x340
> [1665625.106616]  worker_thread+0x30/0x390
> [1665625.106619]  ? create_worker+0x1a0/0x1a0
> [1665625.106620]  kthread+0x116/0x130
> [1665625.106623]  ? __kthread_cancel_work+0x40/0x40
> [1665625.106627]  ret_from_fork+0x22/0x30
> [1665649.681308] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 903816ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665692.688072] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 946824ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665735.695099] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 989832ms >
> ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665745.935255] INFO: task kworker/u192:2:90254 blocked for more than 966 seconds.
> [1665745.935309]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665745.935347] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665745.935382] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665745.935410] Workqueue: drbd21440_submit do_submit [drbd]
> [1665745.935413] Call Trace:
> [1665745.935426]  __schedule+0x2be/0x770
> [1665745.935440]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665745.935443]  schedule+0x3c/0xa0
> [1665745.935454]  do_submit+0x3c6/0x690 [drbd]
> [1665745.935463]  ? finish_wait+0x80/0x80
> [1665745.935470]  process_one_work+0x1aa/0x340
> [1665745.935474]  worker_thread+0x30/0x390
> [1665745.935477]  ? create_worker+0x1a0/0x1a0
> [1665745.935480]  kthread+0x116/0x130
> [1665745.935482]  ? __kthread_cancel_work+0x40/0x40
> [1665745.935487]  ret_from_fork+0x22/0x30
> [1665778.701745] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1032840ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665821.708739] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1075848ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665864.715639] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1118856ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665866.763823] INFO: task kworker/u192:2:90254 blocked for more than 1087 seconds.
> [1665866.763881]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665866.763923] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665866.763952] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665866.764033] Workqueue: drbd21440_submit do_submit [drbd]
> [1665866.764035] Call Trace:
> [1665866.764050]  __schedule+0x2be/0x770
> [1665866.764061]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665866.764063]  schedule+0x3c/0xa0
> [1665866.764071]  do_submit+0x3c6/0x690 [drbd]
> [1665866.764080]  ? finish_wait+0x80/0x80
> [1665866.764086]  process_one_work+0x1aa/0x340
> [1665866.764089]  worker_thread+0x30/0x390
> [1665866.764091]  ? create_worker+0x1a0/0x1a0
> [1665866.764093]  kthread+0x116/0x130
> [1665866.764095]  ? __kthread_cancel_work+0x40/0x40
> [1665866.764099]  ret_from_fork+0x22/0x30
> [1665907.722451] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1161864ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665950.729230] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1204872ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1665987.592801] INFO: task kworker/u192:2:90254 blocked for more than 1208 seconds.
> [1665987.592858]       Tainted: P           OE     5.10.0-0.bpo.12-amd64 #1 Debian
> 5.10.103-1~bpo10+1
> [1665987.592898] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [1665987.592933] task:kworker/u192:2  state:D stack:    0 pid:90254 ppid:     2 flags:0x00004000
> [1665987.592961] Workqueue: drbd21440_submit do_submit [drbd]
> [1665987.592964] Call Trace:
> [1665987.592976]  __schedule+0x2be/0x770
> [1665987.592990]  ? prepare_al_transaction_nonblock+0x202/0x320 [drbd]
> [1665987.592992]  schedule+0x3c/0xa0
> [1665987.593003]  do_submit+0x3c6/0x690 [drbd]
> [1665987.593014]  ? finish_wait+0x80/0x80
> [1665987.593021]  process_one_work+0x1aa/0x340
> [1665987.593024]  worker_thread+0x30/0x390
> [1665987.593027]  ? create_worker+0x1a0/0x1a0
> [1665987.593029]  kthread+0x116/0x130
> [1665987.593032]  ? __kthread_cancel_work+0x40/0x40
> [1665987.593036]  ret_from_fork+0x22/0x30
> [1665993.735884] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1247880ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666036.742693] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1290888ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666079.749605] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1333896ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666122.756349] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1376904ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666165.763367] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1419912ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666208.770063] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1462920ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666251.776837] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1505928ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666294.787710] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1548940ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666337.790495] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1591944ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666380.797514] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1634952ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666423.804168] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1677960ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666466.811307] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1720968ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666509.817810] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1763976ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666552.824635] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1806984ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666595.831476] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1849992ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666638.838345] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1893000ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666681.845180] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1936008ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666724.852146] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 1979016ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666767.858782] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2022024ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666810.865844] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2065032ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666853.872441] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2108040ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666896.879251] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2151048ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666939.886162] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2194056ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1666982.893049] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2237064ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667025.900021] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2280072ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667068.906827] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2323080ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667111.913419] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2366088ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667154.920471] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2409096ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667197.927280] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2452104ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667240.933904] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2495112ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
> [1667283.941107] drbd vm2144/0 drbd21440 host101020: We did not send a P_BARRIER for 2538120ms
>  > ko-count (7) * timeout (60 * 0.1s); drbd kernel thread blocked?
>
> [...]
>
> We already tried to analyze the DRBD code in order to develop (and provide) a patch.
>
> Can we provide more information or run some tests in order to be able to identify the root cause?
>
> Thanks in advance,
>
> Michael
>
> _______________________________________________
> drbd-dev mailing list
> drbd-dev at lists.linbit.com
> https://lists.linbit.com/mailman/listinfo/drbd-dev


More information about the drbd-dev mailing list