[DRBD-user] drbd locked and blocks other processes

Wim Ceulemans wim.ceulemans at able.be
Wed Jul 12 09:01:33 CEST 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi

We observed that drbd was completely locked (in kernel thread?). All 
drbdsetup commands blocked and were not executed. This was after a short 
network failure. The drbd log is below. Some other running processes 
like squid where also locked, didn't work anymore and could not be 
killed and restarted.
We are using drbd-0.7.14

11:24:33 kernel drbd0: PingAck did not arrive in time.
11:24:33 kernel drbd0: drbd0_asender [1814]: cstate Connected --> 
NetworkFailure
11:24:33 kernel drbd0: asender terminated
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure --> 
BrokenPipe
11:24:33 kernel drbd0: short read expecting header on sock: r=-512
11:24:33 kernel drbd0: worker terminated
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate BrokenPipe --> 
Unconnected
11:24:33 kernel drbd0: Connection lost.
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate Unconnected --> 
WFConnection
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFConnection --> 
WFReportParams
11:24:33 kernel drbd0: Handshake successful: DRBD Network Protocol 
version 74
11:24:33 kernel drbd0: Connection established.
11:24:33 kernel drbd0: I am(P): 1:00000002:00000001:00000020:00000012:10
11:24:33 kernel drbd0: Peer(S): 1:00000002:00000001:0000001f:00000012:01
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFReportParams --> 
WFBitMapS
11:24:33 kernel drbd0: Primary/Unknown --> Primary/Secondary
11:24:33 kernel drbd0: drbd0_receiver [1799]: cstate WFBitMapS --> 
SyncSource
11:24:33 kernel drbd0: Resync started as SyncSource (need to sync 624 KB 
[156 bits set]).
11:24:33 kernel drbd0: Resync done (total 1 sec; paused 0 sec; 624 K/sec)
11:24:33 kernel drbd0: drbd0_worker [13681]: cstate SyncSource --> 
Connected
11:29:37 kernel drbd0: [kupdated/6] sock_sendmsg time expired, ko = 
4294967295
11:29:40 kernel drbd0: PingAck did not arrive in time.
11:29:40 kernel drbd0: drbd0_asender [13686]: cstate Connected --> 
NetworkFailure
11:29:40 kernel drbd0: asender terminated
11:29:40 kernel drbd0: drbd0_receiver [1799]: cstate NetworkFailure --> 
BrokenPipe
11:29:40 kernel drbd0: short read expecting header on sock: r=-512
11:29:40 kernel drbd0: short sent UnplugRemote size=8 sent=-1001
11:29:40 kernel drbd0: worker terminated

Anyone seen this before? Could this be fixed in a more recent drbd version?

Thanks and regards

-- 
Wim Ceulemans
R&D Engineer
------------------------------------------------------
Able NV                            Tel: +32(0)15 50.44.00
Dellingstraat 28b               Fax: +32(0)15.50.44.09
B-2800 Mechelen
Belgium                   mailto:wim.ceulemans at able.be
http://www.axsguard.com        http://www.doITsafe.net

   aXs GUARD - internet communication appliance
------------------------------------------------------

-- 
Wim Ceulemans
R&D Engineer
------------------------------------------------------
Able NV	                        Tel: +32(0)15 50.44.00
Dellingstraat 28b               Fax: +32(0)15.50.44.09
B-2800 Mechelen
Belgium                   mailto:wim.ceulemans at able.be
http://www.axsguard.com        http://www.doITsafe.net

    aXs GUARD - internet communication appliance
------------------------------------------------------

--
---------------------------------------------------
Able: 1996-2006: already 10 safe years in YOUR company!

aXs GUARD has completed security and anti-virus checks on this e-mail (http://www.axsguard.com)
---------------------------------------------------
Able NV: ond.nr 0457.938.087
RPR Mechelen




More information about the drbd-user mailing list