Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Hi all I'm using drbd 8.3.0 with fence configured for my resources and I'd like to use discard-zero-changes for after-sb-0pri and consensus for after-sb-1pri; the problem is that when a node fence the other looks like that they both create a new UUID and the discard-zero-changes doesn't work. Is my analysis correct? Is there something wrong in my configuration? Thank you for your help. common { protocol C; net { after-sb-0pri discard-zero-changes; after-sb-1pri consensus; after-sb-2pri disconnect; } handlers { outdate-peer "/usr/lib/drbd/obliterate-peer.sh"; } } resource mystorage { startup { become-primary-on both; } net { allow-two-primaries; } disk { fencing resource-and-stonith; } on node1 { device /dev/drbd0; disk /dev/md3; address 10.0.0.1:7788; flexible-meta-disk internal; } on node2 { device /dev/drbd0; disk /dev/md3; address 10.0.0.2:7788; flexible-meta-disk internal; } } # cat /proc/drbd version: 8.3.0 (api:88/proto:86-89) GIT-hash: 9ba8b93e24d842f0dd3fb1f9b90e8348ddb95829 build by federico at nethesis.it, 2009-02-20 15:23:22 0: cs:Connected ro:Primary/Primary ds:UpToDate/UpToDate C r--- ns:0 nr:8192 dw:8192 dr:236 al:0 bm:4 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b oos:0 [ --- node1 --- ] Feb 26 09:57:17 node1 kernel: drbd0: PingAck did not arrive in time. Feb 26 09:57:17 node1 kernel: drbd0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 ) Feb 26 09:57:17 node1 kernel: drbd0: asender terminated Feb 26 09:57:17 node1 kernel: drbd0: Terminating asender thread Feb 26 09:57:17 node1 kernel: drbd0: short read expecting header on sock: r=-512 Feb 26 09:57:17 node1 kernel: drbd0: Creating new current UUID Feb 26 09:57:17 node1 /usr/lib/drbd/obliterate-peer.sh: Local node ID: 1 / Remote node: node2 Feb 26 09:57:17 node1 kernel: drbd0: Connection closed Feb 26 09:57:17 node1 kernel: drbd0: helper command: /sbin/drbdadm fence-peer minor-0 Feb 26 09:57:29 node1 openais[1922]: [TOTEM] The token was lost in the OPERATIONAL state. Feb 26 09:57:29 node1 openais[1922]: [TOTEM] Receive multicast socket recv buffer size (288000 bytes). Feb 26 09:57:29 node1 openais[1922]: [TOTEM] Transmit multicast socket send buffer size (219136 bytes). Feb 26 09:57:29 node1 openais[1922]: [TOTEM] entering GATHER state from 2. Feb 26 09:57:30 node1 fence_node[3561]: Fence of "node2" was successful Feb 26 09:57:30 node1 kernel: drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 7 (0x700) Feb 26 09:57:30 node1 kernel: drbd0: fence-peer helper returned 7 (peer was stonithed) Feb 26 09:57:30 node1 kernel: drbd0: pdsk( DUnknown -> Outdated ) Feb 26 09:57:30 node1 kernel: drbd0: susp( 1 -> 0 ) Feb 26 09:57:30 node1 kernel: drbd0: conn( NetworkFailure -> Unconnected ) Feb 26 09:57:30 node1 kernel: drbd0: receiver terminated Feb 26 09:57:30 node1 kernel: drbd0: Restarting receiver thread Feb 26 09:57:30 node1 kernel: drbd0: receiver (re)started Feb 26 09:57:30 node1 kernel: drbd0: conn( Unconnected -> WFConnection ) [ --- node2 --- ] Feb 26 09:57:19 node2 kernel: drbd0: PingAck did not arrive in time. Feb 26 09:57:19 node2 kernel: drbd0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) susp( 0 -> 1 ) Feb 26 09:57:19 node2 kernel: drbd0: asender terminated Feb 26 09:57:19 node2 kernel: drbd0: Terminating asender thread Feb 26 09:57:19 node2 kernel: drbd0: short read expecting header on sock: r=-512 Feb 26 09:57:19 node2 kernel: drbd0: Creating new current UUID Feb 26 09:57:19 node2 kernel: drbd0: Connection closed Feb 26 09:57:19 node2 kernel: drbd0: helper command: /sbin/drbdadm fence-peer minor-0 Feb 26 09:57:20 node2 /usr/lib/drbd/obliterate-peer.sh: Local node ID: 2 / Remote node: node1 [ --- node2 fenced --- ] [ --- boot node2 ---] Feb 26 09:58:32 node2 kernel: drbd0: drbd_sync_handshake: Feb 26 09:58:32 node2 kernel: drbd0: self CA3C3A2ADBB72A1E:32FD37CBFD965ABB:FB2B6776FC9D45FA:0002D187F14A403B Feb 26 09:58:32 node2 kernel: drbd0: peer 09BF6602E2EFF0BD:32FD37CBFD965ABB:FB2B6776FC9D45FA:0002D187F14A403B Feb 26 09:58:32 node2 kernel: drbd0: uuid_compare()=100 by rule 9 Feb 26 09:58:32 node2 kernel: drbd0: Split-Brain detected, dropping connection! Feb 26 09:58:32 node2 kernel: drbd0: self CA3C3A2ADBB72A1E:32FD37CBFD965ABB:FB2B6776FC9D45FA:0002D187F14A403B Feb 26 09:58:32 node2 kernel: drbd0: peer 09BF6602E2EFF0BD:32FD37CBFD965ABB:FB2B6776FC9D45FA:0002D187F14A403B Feb 26 09:58:32 node2 kernel: drbd0: helper command: /sbin/drbdadm split-brain minor-0 Feb 26 09:58:32 node2 kernel: drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0) Feb 26 09:58:32 node2 kernel: drbd0: conn( WFReportParams -> Disconnecting ) Feb 26 09:58:32 node2 kernel: drbd0: error receiving ReportState, l: 4! Feb 26 09:58:32 node2 kernel: drbd0: asender terminated Feb 26 09:58:32 node2 kernel: drbd0: Terminating asender thread Feb 26 09:58:32 node2 kernel: drbd0: Connection closed Feb 26 09:58:32 node2 kernel: drbd0: conn( Disconnecting -> StandAlone ) Feb 26 09:58:32 node2 kernel: drbd0: receiver terminated Feb 26 09:58:32 node2 kernel: drbd0: Terminating receiver thread -- Federico.