Hello<br>I am using drbd 0.7.21 & it got stuck in broken pipe state . Looks like drbd worker thread is not able to exit . Please suggest what should I do ? <br><br>Syslog at Primary Node:<br> Dec 12 19:29:21 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74<br>
Dec 12 19:29:21 kernel: drbd0: Connection established.<br> Dec 12 19:29:21 kernel: drbd0: I am(P): 1:00000002:00000001:00000005:00000004:10<br> Dec 12 19:29:21 kernel: drbd0: Peer(S): 1:00000002:00000001:00000004:00000004:00<br>
Dec 12 19:29:21 kernel: drbd0: drbd0_receiver [16730]: cstate WFReportParams --> WFBitMapS<br> Dec 12 19:29:21 kernel: drbd0: Primary/Unknown --> Primary/Secondary<br> Dec 12 19:29:21 kernel: drbd0: drbd0_receiver [16730]: cstate WFBitMapS --> SyncSource<br>
Dec 12 19:29:21 kernel: drbd0: Resync started as SyncSource (need to sync 16 KB [4 bits set]).<br> Dec 12 19:29:21 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 16 K/sec)<br> Dec 12 19:29:21 kernel: drbd0: drbd0_worker [24592]: cstate SyncSource --> Connected<br>
Dec 12 19:29:22 kernel: drbd1: drbd1_receiver [16738]: cstate WFConnection --> WFReportParams<br> Dec 12 19:29:22 kernel: drbd1: Handshake successful: DRBD Network Protocol version 74<br> Dec 12 19:29:22 kernel: drbd1: Connection established.<br>
Dec 12 19:29:22 kernel: drbd1: I am(P): 1:00000002:00000001:00000006:00000002:10<br> Dec 12 19:29:22 kernel: drbd1: Peer(S): 1:00000002:00000001:00000005:00000002:00<br> Dec 12 19:29:22 kernel: drbd1: drbd1_receiver [16738]: cstate WFReportParams --> WFBitMapS<br>
Dec 12 19:29:22 kernel: drbd1: Primary/Unknown --> Primary/Secondary<br> Dec 12 19:29:22 kernel: drbd1: drbd1_receiver [16738]: cstate WFBitMapS --> SyncSource<br> Dec 12 19:29:22 kernel: drbd1: Resync started as SyncSource (need to sync 1488 KB [372 bits set]).<br>
Dec 12 19:29:22 kernel: drbd1: Resync done (total 1 sec; paused 0 sec; 1488 K/sec)<br> Dec 12 19:29:22 kernel: drbd1: drbd1_worker [24593]: cstate SyncSource --> Connected<br> <br>Note : Here Secondary gone for reboot :<br>
Dec 12 19:55:14 kernel: drbd0: sock was shut down by peer<br> Dec 12 19:55:14 kernel: drbd0: drbd0_receiver [16730]: cstate Connected --> BrokenPipe<br> Dec 12 19:55:14 kernel: drbd0: short read expecting header on sock: r=0<br>
Dec 12 19:55:14 kernel: drbd0: meta connection shut down by peer.<br> Dec 12 19:55:14 kernel: drbd0: worker terminated<br> Dec 12 19:55:14 kernel: drbd0: asender terminated<br> Dec 12 19:55:14 kernel: drbd0: drbd0_receiver [16730]: cstate BrokenPipe --> Unconnected<br>
Dec 12 19:55:14 kernel: drbd0: Connection lost.<br> Dec 12 19:55:14 kernel: drbd0: drbd0_receiver [16730]: cstate Unconnected --> WFConnection<br> Dec 12 19:55:14 kernel: drbd1: sock was reset by peer<br>
Dec 12 19:55:14 kernel: drbd1: meta connection shut down by peer.<br> Dec 12 19:55:14 kernel: drbd1: sock_sendmsg returned -32<br> Dec 12 19:55:14 kernel: drbd1: drbd1_receiver [16738]: cstate Connected --> BrokenPipe<br>
Dec 12 19:55:14 kernel: drbd1: short read expecting header on sock: r=-104<br> Dec 12 19:55:14 kernel: drbd1: asender terminated<br> Dec 12 19:55:14 kernel: drbd1: drbd1_worker [24593]: cstate BrokenPipe --> BrokenPipe<br>
Dec 12 19:55:14 kernel: drbd1: short sent UnplugRemote size=8 sent=0<br><br>Syslog at Secondary Node:<br> Dec 12 19:29:21 kernel: drbd0: Handshake successful: DRBD Network Protocol version 74<br> Dec 12 19:29:21 kernel: drbd0: Connection established.<br>
Dec 12 19:29:21 kernel: drbd0: I am(S): 1:00000002:00000001:00000004:00000004:00<br> Dec 12 19:29:21 kernel: drbd0: Peer(P): 1:00000002:00000001:00000005:00000004:10<br> Dec 12 19:29:21 kernel: drbd0: drbd0_receiver [16754]: cstate WFReportParams --> WFBitMapT<br>
Dec 12 19:29:21 kernel: drbd0: Secondary/Unknown --> Secondary/Primary<br> Dec 12 19:29:21 kernel: drbd0: drbd0_receiver [16754]: cstate WFBitMapT --> SyncTarget<br> Dec 12 19:29:21 kernel: drbd0: Resync started as SyncTarget (need to sync 16 KB [4 bits set]).<br>
Dec 12 19:29:21 kernel: drbd0: Resync done (total 1 sec; paused 0 sec; 16 K/sec)<br> Dec 12 19:29:21 kernel: drbd0: drbd0_worker [16732]: cstate SyncTarget --> Connected<br> Dec 12 19:29:22 kernel: drbd1: drbd1_receiver [16762]: cstate WFConnection --> WFReportParams<br>
Dec 12 19:29:22 kernel: drbd1: Handshake successful: DRBD Network Protocol version 74<br> Dec 12 19:29:22 kernel: drbd1: Connection established.<br> Dec 12 19:29:22 kernel: drbd1: I am(S): 1:00000002:00000001:00000005:00000002:00<br>
Dec 12 19:29:22 kernel: drbd1: Peer(P): 1:00000002:00000001:00000006:00000002:10<br> Dec 12 19:29:22 kernel: drbd1: drbd1_receiver [16762]: cstate WFReportParams --> WFBitMapT<br> Dec 12 19:29:22 kernel: drbd1: Secondary/Unknown --> Secondary/Primary<br>
<br>Note : Now system is going for reboot:<br> Dec 12 19:55:14 kernel: drbd0: drbdsetup [2393]: cstate Connected --> Unconnected<br> Dec 12 19:55:14 kernel: drbd0: drbd0_receiver [16754]: cstate Unconnected --> BrokenPipe<br>
Dec 12 19:55:14 kernel: drbd0: short read expecting header on sock: r=-512<br> Dec 12 19:55:14 kernel: drbd0: worker terminated<br> Dec 12 19:55:14 kernel: drbd0: asender terminated<br> Dec 12 19:55:14 kernel: drbd0: drbd0_receiver [16754]: cstate BrokenPipe --> StandAlone<br>
Dec 12 19:55:14 kernel: drbd0: Connection lost.<br> Dec 12 19:55:14 kernel: drbd0: receiver terminated<br> Dec 12 19:55:14 kernel: drbd0: drbdsetup [2393]: cstate StandAlone --> StandAlone<br> Dec 12 19:55:14 kernel: drbd0: drbdsetup [2393]: cstate StandAlone --> Unconfigured<br>
Dec 12 19:55:14 kernel: drbd0: worker terminated<br> Dec 12 19:55:14 kernel: drbd1: drbdsetup [2398]: cstate Connected --> Unconnected<br> Dec 12 19:55:14 kernel: drbd1: drbd1_receiver [16762]: cstate Unconnected --> BrokenPipe<br>
Dec 12 19:55:14 kernel: drbd1: short read expecting header on sock: r=-512<br> Dec 12 19:55:14 kernel: drbd1: worker terminated<br> Dec 12 19:55:14 kernel: drbd1: asender terminated<br> Dec 12 19:55:14 kernel: drbd1: drbd1_receiver [16762]: cstate BrokenPipe --> StandAlone<br>
Dec 12 19:55:14 kernel: drbd1: Connection lost.<br> Dec 12 19:55:14 kernel: drbd1: receiver terminated<br> Dec 12 19:55:14 kernel: drbd1: drbdsetup [2398]: cstate StandAlone --> StandAlone<br> Dec 12 19:55:14 kernel: drbd1: drbdsetup [2398]: cstate StandAlone --> Unconfigured<br>
Dec 12 19:55:14 kernel: drbd1: worker terminated<br><br><br>Any valuable input will be helpful.<br><br>Thanks in advance.<br><br>Regards,<br>Anil <br><br>