Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi We observed that drbd was completely locked (in kernel thread?). All drbdsetup commands blocked and were not executed. This was after a short network failure. The drbd log is below. Some other running processes like squid where also locked, didn't work anymore and could not be killed and restarted. We are using drbd-0.7.14 11:24:33 kernel drbd0: PingAck did not arrive in time. 11:24:33 kernel drbd0: drbd0_asender [1814]: cstate Connected --> NetworkFailure 11:24:33 kernel drbd0: asender terminated 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure --> BrokenPipe 11:24:33 kernel drbd0: short read expecting header on sock: r=-512 11:24:33 kernel drbd0: worker terminated 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate BrokenPipe --> Unconnected 11:24:33 kernel drbd0: Connection lost. 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate Unconnected --> WFConnection 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFConnection --> WFReportParams 11:24:33 kernel drbd0: Handshake successful: DRBD Network Protocol version 74 11:24:33 kernel drbd0: Connection established. 11:24:33 kernel drbd0: I am(P): 1:00000002:00000001:00000020:00000012:10 11:24:33 kernel drbd0: Peer(S): 1:00000002:00000001:0000001f:00000012:01 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFReportParams --> WFBitMapS 11:24:33 kernel drbd0: Primary/Unknown --> Primary/Secondary 11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFBitMapS --> SyncSource 11:24:33 kernel drbd0: Resync started as SyncSource (need to sync 624 KB [156 bits set]). 11:24:33 kernel drbd0: Resync done (total 1 sec; paused 0 sec; 624 K/sec) 11:24:33 kernel drbd0: drbd0_worker [13681]: cstate SyncSource --> Connected 11:29:37 kernel drbd0: [kupdated/6] sock_sendmsg time expired, ko = 4294967295 11:29:40 kernel drbd0: PingAck did not arrive in time. 11:29:40 kernel drbd0: drbd0_asender [13686]: cstate Connected --> NetworkFailure 11:29:40 kernel drbd0: asender terminated 11:29:40 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure --> BrokenPipe 11:29:40 kernel drbd0: short read expecting header on sock: r=-512 11:29:40 kernel drbd0: short sent UnplugRemote size=8 sent=-1001 11:29:40 kernel drbd0: worker terminated Anyone seen this before? Could this be fixed in a more recent drbd version? Thanks and regards -- Wim Ceulemans R&D Engineer ------------------------------------------------------ Able NV Tel: +32(0)15 50.44.00 Dellingstraat 28b Fax: +32(0)15.50.44.09 B-2800 Mechelen Belgium mailto:wim.ceulemans at able.be http://www.axsguard.com http://www.doITsafe.net aXs GUARD - internet communication appliance ------------------------------------------------------ -- Wim Ceulemans R&D Engineer ------------------------------------------------------ Able NV Tel: +32(0)15 50.44.00 Dellingstraat 28b Fax: +32(0)15.50.44.09 B-2800 Mechelen Belgium mailto:wim.ceulemans at able.be http://www.axsguard.com http://www.doITsafe.net aXs GUARD - internet communication appliance ------------------------------------------------------ -- --------------------------------------------------- Able: 1996-2006: already 10 safe years in YOUR company! aXs GUARD has completed security and anti-virus checks on this e-mail (http://www.axsguard.com) --------------------------------------------------- Able NV: ond.nr 0457.938.087 RPR Mechelen