Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi
We observed that drbd was completely locked (in kernel thread?). All
drbdsetup commands blocked and were not executed. This was after a short
network failure. The drbd log is below. Some other running processes
like squid where also locked, didn't work anymore and could not be
killed and restarted.
We are using drbd-0.7.14
11:24:33 kernel drbd0: PingAck did not arrive in time.
11:24:33 kernel drbd0: drbd0_asender [1814]: cstate Connected -->
NetworkFailure
11:24:33 kernel drbd0: asender terminated
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure -->
BrokenPipe
11:24:33 kernel drbd0: short read expecting header on sock: r=-512
11:24:33 kernel drbd0: worker terminated
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate BrokenPipe -->
Unconnected
11:24:33 kernel drbd0: Connection lost.
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate Unconnected -->
WFConnection
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFConnection -->
WFReportParams
11:24:33 kernel drbd0: Handshake successful: DRBD Network Protocol
version 74
11:24:33 kernel drbd0: Connection established.
11:24:33 kernel drbd0: I am(P): 1:00000002:00000001:00000020:00000012:10
11:24:33 kernel drbd0: Peer(S): 1:00000002:00000001:0000001f:00000012:01
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFReportParams -->
WFBitMapS
11:24:33 kernel drbd0: Primary/Unknown --> Primary/Secondary
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFBitMapS -->
SyncSource
11:24:33 kernel drbd0: Resync started as SyncSource (need to sync 624 KB
[156 bits set]).
11:24:33 kernel drbd0: Resync done (total 1 sec; paused 0 sec; 624 K/sec)
11:24:33 kernel drbd0: drbd0_worker [13681]: cstate SyncSource -->
Connected
11:29:37 kernel drbd0: [kupdated/6] sock_sendmsg time expired, ko =
4294967295
11:29:40 kernel drbd0: PingAck did not arrive in time.
11:29:40 kernel drbd0: drbd0_asender [13686]: cstate Connected -->
NetworkFailure
11:29:40 kernel drbd0: asender terminated
11:29:40 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure -->
BrokenPipe
11:29:40 kernel drbd0: short read expecting header on sock: r=-512
11:29:40 kernel drbd0: short sent UnplugRemote size=8 sent=-1001
11:29:40 kernel drbd0: worker terminated
Anyone seen this before? Could this be fixed in a more recent drbd version?
Thanks and regards
--
Wim Ceulemans
R&D Engineer
------------------------------------------------------
Able NV Tel: +32(0)15 50.44.00
Dellingstraat 28b Fax: +32(0)15.50.44.09
B-2800 Mechelen
Belgium mailto:wim.ceulemans at able.be
http://www.axsguard.com http://www.doITsafe.net
aXs GUARD - internet communication appliance
------------------------------------------------------
--
Wim Ceulemans
R&D Engineer
------------------------------------------------------
Able NV Tel: +32(0)15 50.44.00
Dellingstraat 28b Fax: +32(0)15.50.44.09
B-2800 Mechelen
Belgium mailto:wim.ceulemans at able.be
http://www.axsguard.com http://www.doITsafe.net
aXs GUARD - internet communication appliance
------------------------------------------------------
--
---------------------------------------------------
Able: 1996-2006: already 10 safe years in YOUR company!
aXs GUARD has completed security and anti-virus checks on this e-mail (http://www.axsguard.com)
---------------------------------------------------
Able NV: ond.nr 0457.938.087
RPR Mechelen