Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi,
i'm using drbd8.3 under redhat 5.5 and when I try to *sync* the two nodes,
the *sync* seems to *stall* out indefinitely. I tried to find some help but
i couldn't
rpm -qa |grep drbd
drbd83-8.3.8-1.el5.centos
kmod-drbd83-8.3.8-1.el5.centos
rpm -qa |grep kernel
kernel-headers-2.6.18-194.el5
kernel-devel-2.6.18-194.el5
kernel-2.6.18-194.el5
cat /proc/drbd
version: 8.3.8 (api:88/proto:86-94)
GIT-hash: d78846e52224fd00562f7c225bcc25b2d422321d build by
mockbuild at builder10.centos.org, 2010-06-04 08:04:09
0: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent C r----
ns:83712000 nr:0 dw:0 dr:83712000 al:0 bm:5109 lo:0 pe:0 ua:0 ap:0 ep:1
wo:b oos:209292364
[>....................] sync'ed: 0.1% (204384/204384)M delay_probe:
31064
stalled
drbd driver loaded OK; device status:
version: 8.3.8 (api:88/proto:86-94)
GIT-hash: d78846e52224fd00562f7c225bcc25b2d422321d build by
mockbuild at builder10.centos.org, 2010-06-04 08:04:09
m:res cs ro ds
p mounted fstype
stalled
... sync'ed: 0.1% (204384/204384)M
delay_probe:
0:cdrs SyncSource Primary/Secondary UpToDate/Inconsistent C
[/var/log/messages on server1]
Jul 12 18:27:47 LMS1 kernel: block drbd0: peer( Secondary -> Unknown ) conn(
SyncSource -> TearDown )
Jul 12 18:27:47 LMS1 kernel: block drbd0: meta connection shut down by peer.
Jul 12 18:27:47 LMS1 kernel: block drbd0: asender terminated
Jul 12 18:27:47 LMS1 kernel: block drbd0: Terminating asender thread
Jul 12 18:27:47 LMS1 kernel: block drbd0: Connection closed
Jul 12 18:27:47 LMS1 kernel: block drbd0: conn( TearDown -> Unconnected )
Jul 12 18:27:47 LMS1 kernel: block drbd0: receiver terminated
Jul 12 18:27:47 LMS1 kernel: block drbd0: Restarting receiver thread
Jul 12 18:27:47 LMS1 kernel: block drbd0: receiver (re)started
Jul 12 18:27:47 LMS1 kernel: block drbd0: conn( Unconnected -> WFConnection
)
Jul 12 18:28:48 LMS1 kernel: block drbd0: Handshake successful: Agreed
network protocol version 94
Jul 12 18:28:48 LMS1 kernel: block drbd0: Peer authenticated using 20 bytes
of 'sha1' HMAC
Jul 12 18:28:48 LMS1 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Jul 12 18:28:48 LMS1 kernel: block drbd0: Starting asender thread (from
drbd0_receiver [12058])
Jul 12 18:28:48 LMS1 kernel: block drbd0: data-integrity-alg: <not-used>
Jul 12 18:28:48 LMS1 kernel: block drbd0: drbd_sync_handshake:
Jul 12 18:28:48 LMS1 kernel: block drbd0: self
D015F68F0A997E7F:3FE1237214F4A834:0000000000000004:0000000000000000
bits:52323091 flags:0
Jul 12 18:28:48 LMS1 kernel: block drbd0: peer
3FE1237214F4A834:0000000000000000:0000000000000000:0000000000000000
bits:52323091 flags:0
Jul 12 18:28:48 LMS1 kernel: block drbd0: uuid_compare()=1 by rule 70
Jul 12 18:28:48 LMS1 kernel: block drbd0: Becoming sync source due to disk
states.
Jul 12 18:28:48 LMS1 kernel: block drbd0: peer( Unknown -> Secondary ) conn(
WFReportParams -> WFBitMapS )
Jul 12 18:28:49 LMS1 kernel: block drbd0: conn( WFBitMapS -> SyncSource )
Jul 12 18:28:49 LMS1 kernel: block drbd0: Began resync as SyncSource (will
sync 209292364 KB [52323091 bits set]).
[/var/log/messages on server2]
Jul 12 18:16:02 LMS2 kernel: device eth0 entered promiscuous mode
Jul 12 18:16:21 LMS2 kernel: device eth0 left promiscuous mode
Jul 12 18:16:43 LMS2 kernel: device eth0 entered promiscuous mode
Jul 12 18:16:52 LMS2 kernel: device eth0 left promiscuous mode
Jul 12 18:27:58 LMS2 kernel: block drbd0: peer( Primary -> Unknown ) conn(
SyncTarget -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Jul 12 18:27:58 LMS2 kernel: block drbd0: short read expecting header on
sock: r=-512
Jul 12 18:27:58 LMS2 kernel: block drbd0: asender terminated
Jul 12 18:27:58 LMS2 kernel: block drbd0: Terminating asender thread
Jul 12 18:27:58 LMS2 kernel: block drbd0: Connection closed
Jul 12 18:27:58 LMS2 kernel: block drbd0: conn( Disconnecting -> StandAlone
)
Jul 12 18:27:58 LMS2 kernel: block drbd0: receiver terminated
Jul 12 18:27:58 LMS2 kernel: block drbd0: Terminating receiver thread
Jul 12 18:28:38 LMS2 kernel: block drbd0: conn( StandAlone -> Unconnected )
Jul 12 18:28:38 LMS2 kernel: block drbd0: Starting receiver thread (from
drbd0_worker [14427])
Jul 12 18:28:38 LMS2 kernel: block drbd0: receiver (re)started
Jul 12 18:28:38 LMS2 kernel: block drbd0: conn( Unconnected -> WFConnection
)
Jul 12 18:28:59 LMS2 kernel: block drbd0: Handshake successful: Agreed
network protocol version 94
Jul 12 18:28:59 LMS2 kernel: block drbd0: Peer authenticated using 20 bytes
of 'sha1' HMAC
Jul 12 18:28:59 LMS2 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Jul 12 18:28:59 LMS2 kernel: block drbd0: Starting asender thread (from
drbd0_receiver [24260])
Jul 12 18:28:59 LMS2 kernel: block drbd0: data-integrity-alg: <not-used>
Jul 12 18:28:59 LMS2 kernel: block drbd0: drbd_sync_handshake:
Jul 12 18:28:59 LMS2 kernel: block drbd0: self
3FE1237214F4A834:0000000000000000:0000000000000000:0000000000000000
bits:52323091 flags:0
Jul 12 18:28:59 LMS2 kernel: block drbd0: peer
D015F68F0A997E7F:3FE1237214F4A834:0000000000000004:0000000000000000
bits:52323091 flags:0
Jul 12 18:28:59 LMS2 kernel: block drbd0: uuid_compare()=-1 by rule 50
Jul 12 18:28:59 LMS2 kernel: block drbd0: Becoming sync target due to disk
states.
Jul 12 18:28:59 LMS2 kernel: block drbd0: peer( Unknown -> Primary ) conn(
WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
Jul 12 18:28:59 LMS2 kernel: block drbd0: conn( WFBitMapT -> WFSyncUUID )
Jul 12 18:28:59 LMS2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-target minor-0
Jul 12 18:28:59 LMS2 kernel: block drbd0: helper command: /sbin/drbdadm
before-resync-target minor-0 exit code 0 (0x0)
Jul 12 18:28:59 LMS2 kernel: block drbd0: conn( WFSyncUUID -> SyncTarget )
Jul 12 18:28:59 LMS2 kernel: block drbd0: Began resync as SyncTarget (will
sync 209292364 KB [52323091 bits set]).
[drbd.conf]
global {
usage-count no;
}
common {
syncer {
rate 500M; # in MBytes
}
net {
cram-hmac-alg sha1;
shared-secret "password";
after-sb-0pri discard-zero-changes;
after-sb-1pri discard-secondary;
after-sb-2pri disconnect;
}
}
resource cdrs {
protocol C;
on LMS1{
device /dev/drbd0;
disk /dev/cciss/c0d1p1;
address 192.168.1.1:7790;
meta-disk internal;
}
on LMS2{
device /dev/drbd0;
disk /dev/cciss/c0d1p1;
address 192.168.1.2:7790;
meta-disk internal;
}
}
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20110712/47755b8d/attachment.htm>