[DRBD-user] ...sock was shut down by peer

Ralf Gross Ralf-Lists at ralfgross.de
Wed Dec 20 14:08:28 CET 2006

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


Hi,

I finally got my new server and transferd my drbd.conf from the
working test system to the new server.

ubuntu edgy 64bit
drbd 0.7.20
kernel 2.6.17-10
  
Server 1 VU0EM005: Server with Areca Raid with three 1,5TB volumes
Server 2 VU0EM003: HP DL380G4 with ext. RAID Array and threee 1,5TB volumes

The servers are connected by a EtherChannel and linux bonding.

I had to change some parts (address), but basically it's the same
config as before. 

But now the two nodes can't connect anymore. I'm a bit lost because
the network connection is fine, I can ping both hosts. The only thing
that has changed since by test is the EtherChannel/Bonding between the
two server.

Logfile of host VU0EM003:

Dec 20 13:45:48  drbd: module cleanup done.564] drbd1: worker
terminated
Dec 20 13:45:48  drbd: initialised. Version: 0.7.20 (api:79/proto:74)
[29103]: cstate BrokenPipe --> Unconnected
Dec 20 13:45:48  drbd: SVN Revision: 2260 build by root at VU0EM003,
2006-11-15 13:19:00
Dec 20 13:45:48  drbd: registered as block device major 1471_receiver
[29103]: cstate Unconnected --> WFConnection
Dec 20 13:45:49  drbd0: resync bitmap: bits=365654016 words=5713344
Dec 20 13:45:49  drbd0: size = 1394 GB (1462616064 KB)
Dec 20 13:46:03  drbd0: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:03  drbd0: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:03 : cstate Unconfigured --> StandAlone
Dec 20 13:46:03  drbd1: resync bitmap: bits=365654016 words=5713344
Dec 20 13:46:03  drbd1: size = 1394 GB (1462616064 KB)
Dec 20 13:46:16  drbd1: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:16  drbd1: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:16 : cstate Unconfigured --> StandAlone
Dec 20 13:46:16  drbd2: resync bitmap: bits=365654016 words=5713344
Dec 20 13:46:16  drbd2: size = 1394 GB (1462616064 KB)
Dec 20 13:46:29  drbd2: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:29  drbd2: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:29 : cstate Unconfigured --> StandAlone
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:30 : cstate WFConnection --> WFReportParams
Dec 20 13:46:30  drbd2: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:30  drbd2: Connection established.
Dec 20 13:46:30  drbd2: I am(S):
1:00000003:00000001:00000005:00000001:00
Dec 20 13:46:30  drbd2: Peer(S):
1:00000002:00000001:00000004:00000001:00
Dec 20 13:46:30 : cstate WFReportParams --> WFBitMapS
Dec 20 13:46:30 : cstate WFBitMapS --> BrokenPipe
Dec 20 13:46:30  drbd2: asender terminated
Dec 20 13:46:30  drbd2: Secondary/Unknown --> Secondary/Secondary
Dec 20 13:46:30  drbd2: sock was shut down by peer
Dec 20 13:46:30 : cstate BrokenPipe --> BrokenPipe
Dec 20 13:46:30  drbd2: worker terminated
Dec 20 13:46:30 : cstate BrokenPipe --> Unconnected
Dec 20 13:46:30  drbd2: Connection lost.
Dec 20 13:46:30 : cstate Unconnected --> WFConnection
Dec 20 13:46:32 : cstate WFConnection --> WFReportParams
Dec 20 13:46:32  drbd1: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:32  drbd1: Connection established.
Dec 20 13:46:32  drbd1: I am(S):
1:00000003:00000001:00000005:00000001:00
Dec 20 13:46:32  drbd1: Peer(S):
1:00000002:00000001:00000004:00000001:00
Dec 20 13:46:32 : cstate WFReportParams --> WFBitMapS
Dec 20 13:46:32 : cstate WFBitMapS --> BrokenPipe
Dec 20 13:46:32  drbd1: Secondary/Unknown --> Secondary/Secondary
Dec 20 13:46:32  drbd1: asender terminated
Dec 20 13:46:32  drbd1: sock was shut down by peer
Dec 20 13:46:32 : cstate BrokenPipe --> BrokenPipe
Dec 20 13:46:32  drbd1: worker terminated
Dec 20 13:46:32 : cstate BrokenPipe --> Unconnected
Dec 20 13:46:32  drbd1: Connection lost.
Dec 20 13:46:32  drbd: module cleanup done.
Dec 20 13:45:48  drbd: initialised. Version: 0.7.20 (api:79/proto:74)
Dec 20 13:45:48  drbd: SVN Revision: 2260 build by root at VU0EM003,
2006-11-15 13:19:00
Dec 20 13:45:48  drbd: registered as block device major 147
Dec 20 13:45:49  drbd0: resync bitmap: bits=365654016 words=5713344
Dec 20 13:45:49  drbd0: size = 1394 GB (1462616064 KB)
Dec 20 13:46:03  drbd0: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:03  drbd0: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:03 : cstate Unconfigured --> StandAlone
Dec 20 13:46:03  drbd1: resync bitmap: bits=365654016 words=5713344
Dec 20 13:46:03  drbd1: size = 1394 GB (1462616064 KB)
Dec 20 13:46:16  drbd1: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:16  drbd1: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:16 : cstate Unconfigured --> StandAlone
Dec 20 13:46:16  drbd2: resync bitmap: bits=365654016 words=5713344
Dec 20 13:46:16  drbd2: size = 1394 GB (1462616064 KB)
Dec 20 13:46:29  drbd2: 1394 GB marked out-of-sync by on disk bit-map.
Dec 20 13:46:29  drbd2: Found 6 transactions (324 active extents) in
activity log.
Dec 20 13:46:29 : cstate Unconfigured --> StandAlone
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:29 : cstate StandAlone --> Unconnected
Dec 20 13:46:29 : cstate Unconnected --> WFConnection
Dec 20 13:46:30 : cstate WFConnection --> WFReportParams
Dec 20 13:46:30  drbd2: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:30  drbd2: Connection established.
Dec 20 13:46:30  drbd2: I am(S):
1:00000003:00000001:00000005:00000001:00
Dec 20 13:46:30  drbd2: Peer(S):
1:00000002:00000001:00000004:00000001:00
Dec 20 13:46:30 : cstate WFReportParams --> WFBitMapS
Dec 20 13:46:30 : cstate WFBitMapS --> BrokenPipe
Dec 20 13:46:30  drbd2: asender terminated
Dec 20 13:46:30  drbd2: Secondary/Unknown --> Secondary/Secondary
Dec 20 13:46:30  drbd2: sock was shut down by peer
Dec 20 13:46:30 : cstate BrokenPipe --> BrokenPipe
Dec 20 13:46:30  drbd2: worker terminated
Dec 20 13:46:30 : cstate BrokenPipe --> Unconnected
Dec 20 13:46:30  drbd2: Connection lost.
Dec 20 13:46:30 : cstate Unconnected --> WFConnection
Dec 20 13:46:32 : cstate WFConnection --> WFReportParams
Dec 20 13:46:32  drbd1: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:32  drbd1: Connection established.
Dec 20 13:46:32  drbd1: I am(S):
1:00000003:00000001:00000005:00000001:00
Dec 20 13:46:32  drbd1: Peer(S):
1:00000002:00000001:00000004:00000001:00
Dec 20 13:46:32 : cstate WFReportParams --> WFBitMapS
Dec 20 13:46:32 : cstate WFBitMapS --> BrokenPipe
Dec 20 13:46:32  drbd1: Secondary/Unknown --> Secondary/Secondary
Dec 20 13:46:32  drbd1: asender terminated
Dec 20 13:46:32  drbd1: sock was shut down by peer
Dec 20 13:46:32 : cstate BrokenPipe --> BrokenPipe
Dec 20 13:46:32  drbd1: worker terminated
Dec 20 13:46:32 : cstate BrokenPipe --> Unconnected
Dec 20 13:46:32  drbd1: Connection lost.
Dec 20 13:46:32 : cstate Unconnected --> WFConnection


Log of host VU0EM005

Dec 20 13:45:25 drbd: initialised. Version: 0.7.20 (api:79/proto:74)
Dec 20 13:45:25 drbd: SVN Revision: 2260 build by root at VU0EM005,
2006-12-20 11:19:31
Dec 20 13:45:25 drbd: registered as block device major 147
Dec 20 13:45:26 drbd0: resync bitmap: bits=366210812 words=5722044
Dec 20 13:45:26 drbd0: size = 1396 GB (1464843248 KB)
Dec 20 13:45:31 drbd0: 0 KB marked out-of-sync by on disk bit-map.
Dec 20 13:45:31 drbd0: No usable activity log found.
Dec 20 13:45:31 drbd0: drbdsetup [8259]: cstate Unconfigured -->
StandAlone
Dec 20 13:45:31 drbd1: resync bitmap: bits=366210812 words=5722044
Dec 20 13:45:31 drbd1: size = 1396 GB (1464843248 KB)
Dec 20 13:45:36 drbd1: 0 KB marked out-of-sync by on disk bit-map.
Dec 20 13:45:36 drbd1: No usable activity log found.
Dec 20 13:45:36 drbd1: drbdsetup [8261]: cstate Unconfigured -->
StandAlone
Dec 20 13:45:36 drbd2: resync bitmap: bits=366210044 words=5722032
Dec 20 13:45:36 drbd2: size = 1396 GB (1464840176 KB)
Dec 20 13:45:40 drbd2: 0 KB marked out-of-sync by on disk bit-map.
Dec 20 13:45:40 drbd2: No usable activity log found.
Dec 20 13:45:40 drbd2: drbdsetup [8263]: cstate Unconfigured -->
StandAlone
Dec 20 13:45:40 drbd0: drbdsetup [8268]: cstate StandAlone -->
Unconnected
Dec 20 13:45:40 drbd0: drbd0_receiver [8269]: cstate Unconnected -->
WFConnection
Dec 20 13:45:40 drbd1: drbdsetup [8270]: cstate StandAlone -->
Unconnected
Dec 20 13:45:40 drbd1: drbd1_receiver [8271]: cstate Unconnected -->
WFConnection
Dec 20 13:45:40 drbd2: drbdsetup [8272]: cstate StandAlone -->
Unconnected
Dec 20 13:45:40 drbd2: drbd2_receiver [8273]: cstate Unconnected -->
WFConnection
Dec 20 13:45:42 drbd0: drbd0_receiver [8269]: cstate WFConnection -->
WFReportParams
Dec 20 13:45:42 drbd0: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:45:42 drbd0: drbd0_receiver [8269]: cstate WFReportParams
--> StandAlone
Dec 20 13:45:42 drbd0: worker terminated
Dec 20 13:45:42 drbd0: asender terminated
Dec 20 13:45:42 drbd0: drbd0_receiver [8269]: cstate StandAlone -->
StandAlone
Dec 20 13:45:42 drbd0: Connection lost.
Dec 20 13:45:42 drbd0: receiver terminated
Dec 20 13:46:28 drbd2: drbd2_receiver [8273]: cstate WFConnection -->
WFReportParams
Dec 20 13:46:28 drbd2: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:28 drbd2: drbd2_receiver [8273]: cstate WFReportParams
--> StandAlone
Dec 20 13:46:28 drbd2: worker terminated
Dec 20 13:46:28 drbd2: asender terminated
Dec 20 13:46:28 drbd2: drbd2_receiver [8273]: cstate StandAlone -->
StandAlone
Dec 20 13:46:28 drbd2: Connection lost.
Dec 20 13:46:28 drbd2: receiver terminated
Dec 20 13:46:29 drbd1: drbd1_receiver [8271]: cstate WFConnection -->
WFReportParams
Dec 20 13:46:29 drbd1: Handshake successful: DRBD Network Protocol
version 74
Dec 20 13:46:29 drbd1: drbd1_receiver [8271]: cstate WFReportParams
--> StandAlone
Dec 20 13:46:29 drbd1: worker terminated
Dec 20 13:46:29 drbd1: asender terminated
Dec 20 13:46:29 drbd1: drbd1_receiver [8271]: cstate StandAlone -->
StandAlone
Dec 20 13:46:29 drbd1: Connection lost.
Dec 20 13:46:29 drbd1: receiver terminated




I already tuned my config file for the HP cciss device, I don't know
if this is neccessary, because only the meta data is on this device. 

global {
  minor-count 5;
  dialog-refresh 5;
}

resource r0 {
  protocol C;
  incon-degr-cmd "echo'!DRBD! pri on icon-degr' | wall; sleep 60; halt
-f";

 on VU0EM003 {
  device /dev/drbd0;
  disk /dev/sda;
  address 10.60.7.190:7788;
  meta-disk /dev/cciss/c0d0p3[0];
 }

 on VU0EM005 {
  device /dev/drbd0;
  disk /dev/sdf;
  address 10.60.7.189:7788;
  meta-disk /dev/sda3[0];
 }

 disk {
  on-io-error detach;
 }

 net {
  sndbuf-size      1M;
  max-buffers      20480;
  max-epoch-size   16384;
  unplug-watermark 20480;
  ko-count 4;
  on-disconnect reconnect;
 }

 syncer {
  rate 100M;
  group 1;
  al-extents 257;
 }

 startup {
  wfc-timeout 0;
  degr-wfc-timeout 120;
 }
}

resource r1 {
  protocol C;
  incon-degr-cmd "echo'!DRBD! pri on icon-degr' | wall; sleep 60; halt
-f";

 on VU0EM003 {
  device /dev/drbd1;
  disk /dev/sdb;
  address 10.60.7.190:7789;
  meta-disk /dev/cciss/c0d0p3[1];
 }

 on VU0EM005 {
  device /dev/drbd1;
  disk /dev/sdg;
  address 10.60.7.189:7789;
  meta-disk /dev/sda3[1];
 }

 disk {
  on-io-error detach;
 }

 net {
  sndbuf-size      1M;
  max-buffers      20480;
  max-epoch-size   16384;
  unplug-watermark 20480;
  ko-count 4;
  on-disconnect reconnect;
 }

 syncer {
  rate 100M;
  group 1;
  al-extents 257;
 }

 startup {
  wfc-timeout 0;
  degr-wfc-timeout 120;
 }
}

resource r2 {
  protocol C;
  incon-degr-cmd "echo'!DRBD! pri on icon-degr' | wall; sleep 60; halt
-f";

 on VU0EM003 {
  device /dev/drbd2;
  disk /dev/sdc;
  address 10.60.7.190:7790;
  meta-disk /dev/cciss/c0d0p3[2];
 }

 on VU0EM005 {
  device /dev/drbd2;
  disk /dev/sdh;
  address 10.60.7.189:7790;
  meta-disk /dev/sda3[2];
 }

 disk {
  on-io-error detach;
 }

 net {
  sndbuf-size      1M;
  max-buffers      20480;
  max-epoch-size   16384;
  unplug-watermark 20480;
  ko-count 4;
  on-disconnect reconnect;
 }

 syncer {
  rate 100M;
  group 1;
  al-extents 257;
 }

 startup {
  wfc-timeout 0;
  degr-wfc-timeout 120;
 }
}





More information about the drbd-user mailing list