[DRBD-user] "PingAck not received" messages

Matthew Bloch matthew at bytemark.co.uk
Tue May 22 12:45:12 CEST 2012

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On 22/05/12 08:16, Felix Frank wrote:
> On 05/21/2012 06:56 PM, Matthew Bloch wrote:
>> Thanks for the accounts Pascal and Felix, though Felix I'm pretty
>> certain Debian/lenny's kernel had a virtio bug that does cause its
>> network to break and require a "rmmod virtio_net; modprobe virtio_net"
>> to fix.  That's nothing to do with drbd, and your problem may be
>> entirely separate from that as well :)
> 
> Right - I agree that guest issues could not conceivably cause DRBD
> issues on the host. I had wrongly inferred that you were DRBDing from
> inside a guest.

Indeed, I started logging this command every second, and e.g. this kind
of event is typical every few hours:

  dd if=/dev/zero of=/dev/drbd13 conv=fdatasync bs=1M count=1 2>&1 | \
    grep copied

2012-05-22 02:17:00 W 1048576 bytes (1.0 MB) copied, 0.0115253 s, 91.0 MB/s
2012-05-22 02:17:01 W 1048576 bytes (1.0 MB) copied, 0.011519 s, 91.0 MB/s
2012-05-22 02:17:02 W 1048576 bytes (1.0 MB) copied, 0.0116563 s, 90.0 MB/s
2012-05-22 02:17:03 W 1048576 bytes (1.0 MB) copied, 1.1898 s, 881 kB/s
2012-05-22 02:17:05 W 1048576 bytes (1.0 MB) copied, 28.3202 s, 37.0 kB/s
2012-05-22 02:17:35 W 1048576 bytes (1.0 MB) copied, 0.0127468 s, 82.3 MB/s
2012-05-22 02:17:36 W 1048576 bytes (1.0 MB) copied, 0.0113499 s, 92.4 MB/s
2012-05-22 02:17:37 W 1048576 bytes (1.0 MB) copied, 0.0112707 s, 93.0 MB/s

And in the kernel log:

May 22 02:17:11 v3a kernel: [1341064.126449] block drbd13:
[drbd13_worker/797] sock_sendmsg time expired, ko = 4294967295
May 22 02:17:17 v3a kernel: [1341070.129829] block drbd13:
[drbd13_worker/797] sock_sendmsg time expired, ko = 4294967294
May 22 02:17:23 v3a kernel: [1341076.133170] block drbd13:
[drbd13_worker/797] sock_sendmsg time expired, ko = 4294967293
May 22 02:17:29 v3a kernel: [1341082.133592] block drbd13:
[drbd13_worker/797] sock_sendmsg time expired, ko = 4294967292

Curiously, the "v3a" host (on which I'm running this test) just shows
these disconnects, it's the "v3b" host that gives the "PingAck not
received" messages.  But not in this instance.

-- 
Matthew



More information about the drbd-user mailing list