Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi All,
We're encountering a resync problem with 8.3.6 where after we resync,
the target node transitions to UpToDate, which the peer sees, but then
another state transition happens that claims the pdsk state is UpToDate
-> Inconsitent. The circumstances surrounding the fault were we lost
connectivity to our peer which was then rebooted, after which point the
resync began.
Here's the config (same for all resources):
/sbin/drbdsetup /dev/drbd16 show
disk {
size 0s _is_default; # bytes
on-io-error detach;
fencing dont-care _is_default;
max-bio-bvecs 0 _is_default;
}
net {
timeout 60 _is_default; # 1/10 seconds
max-epoch-size 2048 _is_default;
max-buffers 2048 _is_default;
unplug-watermark 128 _is_default;
connect-int 10 _is_default; # seconds
ping-int 10 _is_default; # seconds
sndbuf-size 0 _is_default; # bytes
rcvbuf-size 0 _is_default; # bytes
ko-count 2;
allow-two-primaries;
after-sb-0pri discard-zero-changes;
after-sb-1pri violently-as0p;
after-sb-2pri violently-as0p;
rr-conflict violently;
ping-timeout 20; # 1/10 seconds
}
syncer {
rate 30720k; # bytes/second
after 15;
al-extents 709;
}
protocol C;
_this_host {
device minor 16;
disk "/dev/disk-drbd16";
meta-disk internal;
address ipv4 169.254.84.220:8916;
}
_remote_host {
address ipv4 169.254.214.196:8916;
}
and the log snippets from both sides, I have full logs if needed. I
tried sending them
to the list, even zipped I can't get them across.
(Source)
Feb 6 01:57:13 node0 kernel: block drbd16: Starting asender thread
(from drbd16_receiver [4790]) Feb 6 01:57:13 node0 kernel: block
drbd16: data-integrity-alg: <not-used> Feb 6 01:57:13 node0 kernel:
block drbd16: drbd_sync_handshake:
Feb 6 01:57:13 node0 kernel: block drbd16: self
93E3C6B459596F95:3F4D478748F24FE7:6E1F4F316DBF9291:0000000000000006
bits:0 flags:0 Feb 6 01:57:13 node0 kernel: block drbd16: peer
3F4D478748F24FE6:0000000000000000:6E1F4F316DBF9290:0000000000000006
bits:0 flags:0 Feb 6 01:57:13 node0 kernel: block drbd16:
uuid_compare()=1 by rule 70 Feb 6 01:57:13 node0 kernel: block drbd16:
peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk(
DUnknown -> UpToDate ) peer_isp( 0 -> 1 ) Feb 6 01:57:13 node0 kernel:
block drbd16: conn( WFBitMapS -> PausedSyncS ) pdsk( UpToDate ->
Inconsistent ) Feb 6 01:57:13 node0 kernel: block drbd16: Began resync
as PausedSyncS (will sync 0 KB [0 bits set]).
Feb 6 01:57:14 node0 kernel: block drbd16: aftr_isp( 1 -> 0 ) Feb 6
01:57:15 node0 kernel: block drbd16: Resync done (total 2 sec; paused 0
sec; 0 K/sec) Feb 6 01:57:15 node0 kernel: block drbd16: conn(
PausedSyncS -> Connected ) pdsk( Inconsistent -> UpToDate ) Feb 6
01:57:15 node0 kernel: block drbd16: pdsk( UpToDate -> Inconsistent )
peer_isp( 1 -> 0 )
(Target)
Feb 6 01:57:13 node1 kernel: block drbd16: Starting asender thread
(from drbd16_receiver [18186]) Feb 6 01:57:13 node1 kernel: block
drbd16: data-integrity-alg: <not-used> Feb 6 01:57:13 node1 kernel:
block drbd16: drbd_sync_handshake:
Feb 6 01:57:13 node1 kernel: block drbd16: self
3F4D478748F24FE6:0000000000000000:6E1F4F316DBF9290:0000000000000006
bits:0 flags:0 Feb 6 01:57:13 node1 kernel: block drbd16: peer
93E3C6B459596F95:3F4D478748F24FE7:6E1F4F316DBF9291:0000000000000006
bits:0 flags:0 Feb 6 01:57:13 node1 kernel: block drbd16:
uuid_compare()=-1 by rule 50 Feb 6 01:57:13 node1 kernel: block drbd16:
peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk(
DUnknown -> UpToDate ) peer_isp( 0 -> 1 ) Feb 6 01:57:13 node1 kernel:
block drbd16: conn( WFBitMapT -> WFSyncUUID ) Feb 6 01:57:13 node1
kernel: block drbd16: helper command:
/usr/lib/spine/bin/avance_drbd_helper before-resync-target minor-16 Feb
6 01:57:13 node1 kernel: block drbd16: helper command:
/usr/lib/spine/bin/avance_drbd_helper before-resync-target minor-16 exit
code 0 (0x0) Feb 6 01:57:13 node1 kernel: block drbd16: conn(
WFSyncUUID -> PausedSyncT ) disk( UpToDate -> Inconsistent ) Feb 6
01:57:13 node1 kernel: block drbd16: Began resync as PausedSyncT (will
sync 0 KB [0 bits set]).
Feb 6 01:57:14 node1 kernel: block drbd16: aftr_isp( 1 -> 0 ) Feb 6
01:57:15 node1 kernel: block drbd16: Resync done (total 2 sec; paused 0
sec; 0 K/sec) Feb 6 01:57:15 node1 kernel: block drbd16: conn(
PausedSyncT -> Connected ) disk( Inconsistent -> UpToDate ) Feb 6
01:57:15 node1 kernel: block drbd16: helper command:
/usr/lib/spine/bin/avance_drbd_helper after-resync-target minor-16 Feb
6 01:57:15 node1 kernel: block drbd16: helper command:
/usr/lib/spine/bin/avance_drbd_helper after-resync-target minor-16 exit
code 0 (0x0) Feb 6 01:57:15 node1 kernel: block drbd16: peer_isp( 1 ->
0 )
We're trying to reproduce it now but haven't had any success so far. Any
ideas? Thanks.
Peter