Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
hI: i am from china,i have a some serious problem with my drbd,we did a ha for our mysql database with drbd ,everything is ok when we did not use NIC bond,but when we bond the nic with mode 1(active-backup),the link between the drbd perfom strange,just disconect-unknown-connect,following is the log: ------------------------------ Mar 27 13:57:58 mysql03 kernel: block drbd0: updated sync UUID B0A60E84594BD05F:9C83E27AB8FC9A75:9C82E27AB8FC9A75:9C81E27AB8FC9A75 Mar 27 13:57:58 mysql03 kernel: block drbd0: Digest mismatch, buffer modified by upper layers during write: 999570272s +4096 Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: sock was shut down by peer Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: peer( Secondary -> Unknown ) conn( SyncSource -> BrokenPipe ) Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: short read (expected size 16) Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: asender terminated Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: Terminating drbd_a_clusterd Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: Connection closed Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: conn( BrokenPipe -> Unconnected ) Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: receiver terminated Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: Restarting receiver thread Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: receiver (re)started Mar 27 13:57:58 mysql03 kernel: drbd clusterdb_res: conn( Unconnected -> WFConnection ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Handshake successful: Agreed network protocol version 101 Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Agreed to support TRIM on protocol level Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Peer authenticated using 20 bytes HMAC Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: conn( WFConnection -> WFReportParams ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Starting asender thread (from drbd_r_clusterd [60474]) Mar 27 13:57:59 mysql03 kernel: block drbd0: drbd_sync_handshake: Mar 27 13:57:59 mysql03 kernel: block drbd0: self B0A60E84594BD05F:9C83E27AB8FC9A75:9C82E27AB8FC9A75:9C81E27AB8FC9A75 bits:155141533 flags:0 Mar 27 13:57:59 mysql03 kernel: block drbd0: peer 9C83E27AB8FC9A74:0000000000000000:86B283665E3AEF84:86B183665E3AEF85 bits:155141256 flags:0 Mar 27 13:57:59 mysql03 kernel: block drbd0: uuid_compare()=1 by rule 70 Mar 27 13:57:59 mysql03 kernel: block drbd0: Becoming sync source due to disk states. Mar 27 13:57:59 mysql03 kernel: block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) Mar 27 13:57:59 mysql03 kernel: block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 952(1), total 952; compression: 100.0% Mar 27 13:57:59 mysql03 kernel: block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 952(1), total 952; compression: 100.0% Mar 27 13:57:59 mysql03 kernel: block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 Mar 27 13:57:59 mysql03 kernel: block drbd0: helper command: /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0) Mar 27 13:57:59 mysql03 kernel: block drbd0: conn( WFBitMapS -> SyncSource ) Mar 27 13:57:59 mysql03 kernel: block drbd0: Began resync as SyncSource (will sync 620566148 KB [155141537 bits set]). Mar 27 13:57:59 mysql03 kernel: block drbd0: updated sync UUID B0A60E84594BD05F:9C84E27AB8FC9A75:9C83E27AB8FC9A75:9C82E27AB8FC9A75 Mar 27 13:57:59 mysql03 kernel: block drbd0: Digest mismatch, buffer modified by upper layers during write: 999570424s +4096 Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: sock was shut down by peer Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: peer( Secondary -> Unknown ) conn( SyncSource -> BrokenPipe ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: short read (expected size 16) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: asender terminated Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Terminating drbd_a_clusterd Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Connection closed Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: conn( BrokenPipe -> Unconnected ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: receiver terminated Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Restarting receiver thread Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: receiver (re)started Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: conn( Unconnected -> WFConnection ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Handshake successful: Agreed network protocol version 101 Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Agreed to support TRIM on protocol level Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Peer authenticated using 20 bytes HMAC Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: conn( WFConnection -> WFReportParams ) Mar 27 13:57:59 mysql03 kernel: drbd clusterdb_res: Starting asender thread (from drbd_r_clusterd [60474]) Mar 27 13:57:59 mysql03 kernel: block drbd0: drbd_sync_handshake: --------------------------------------------------------- and this hanppend again and again we can not find where the problem is ,i have read the DRBD'S user guide,it recommand to use bond with mode active-backup,i want to know is it the problem for our situation? the following is our config: resource clusterdb_res { device /dev/drbd0; disk /dev/sda3; meta-disk internal; on mysql03 { address 10.10.10.70:7791; } on mysql01 { address 10.10.10.71:7791; } } [root at mysql03 drbd.d]# cat global_common.conf global { usage-count yes; } common { protocol C; handlers { pri-on-incon-degr "/usr/lib/drbd/notify-pri-on-incon-degr.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f"; pri-lost-after-sb "/usr/lib/drbd/notify-pri-lost-after-sb.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f"; local-io-error "/usr/lib/drbd/notify-io-error.sh; /usr/lib/drbd/notify-emergency-shutdown.sh; echo o > /proc/sysrq-trigger ; halt -f"; fence-peer "/usr/lib/drbd/crm-fence-peer.sh"; #split-brain "/usr/lib/drbd/notify-split-brain.sh root"; #out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root"; after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh"; } startup { } disk { on-io-error detach; } net { cram-hmac-alg sha1; shared-secret "UcfGroup"; data-integrity-alg sha1; after-sb-0pri disconnect; after-sb-1pri disconnect; after-sb-2pri disconnect; } syncer { #on-no-data-accessible io-error; #verify-alg sha1; rate 1000M; } } -----------------------next is the bond information----- [root at mysql03 bonding]# cat bond0 Ethernet Channel Bonding Driver: v3.6.0 (September 26, 2009) Bonding Mode: fault-tolerance (active-backup) Primary Slave: None Currently Active Slave: eth0 MII Status: up MII Polling Interval (ms): 100 Up Delay (ms): 0 Down Delay (ms): 0 Slave Interface: eth0 MII Status: up Speed: 1000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: ac:16:2d:88:28:08 Slave queue ID: 0 Slave Interface: eth4 MII Status: up Speed: 1000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: f4:ce:46:a5:81:94 Slave queue ID: 0 ------------------------------------------ 10: bond0: <BROADCAST,MULTICAST,MASTER,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP link/ether ac:16:2d:88:28:08 brd ff:ff:ff:ff:ff:ff inet 10.10.10.70/24 brd 10.10.10.255 scope global bond0 inet 10.10.10.100/24 brd 10.10.10.255 scope global secondary bond0 inet6 fe80::ae16:2dff:fe88:2808/64 scope link valid_lft forever preferred_lft forever --------------------------------------release info-------------------------------------------------- [root at mysql03 bonding]# uname -a Linux mysql03 2.6.32-431.5.1.el6.x86_64 #1 SMP Wed Feb 12 00:41:43 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux [root at mysql03 bonding]# drbdadm -V DRBDADM_BUILDTAG=GIT-hash:\ 599f286440bd633d15d5ff985204aff4bccffadd\ build\ by\ mockbuild at Build64R6\,\ 2013-10-25\ 09:52:00 DRBDADM_API_VERSION=1 DRBD_KERNEL_VERSION_CODE=0x080404 DRBDADM_VERSION_CODE=0x080404 DRBDADM_VERSION=8.4.4 ------------------------------------------------------------- so ,i am so appreciate if you can give me so clue to solve the problem,if you want more information about our config ,just let me know, 3q waiting online ~~~ -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20140328/050e1c97/attachment.htm>