Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
On 03/06/2012 05:39 PM, Ivan Pavlenko wrote:
> Hi ALL,
>
> I've found a rather strange thing. It looks like a server try to use
> wrong network interface for drbd connection Could you explain me how it
> is possible? See details below:
>
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering GATHER state from 11.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> Creating commit token because I am the rep.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> Storing new sequence id for ring 1518
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering COMMIT state.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering RECOVERY state.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> position [0] member 10.102.1.55:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> previous ring seq 5396 rep 10.10.24.10
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM] aru 63
> high delivered 63 received flag 1
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM] Did
> not need to originate any messages in recovery.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> Sending initial ORF token
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ] CLM
> CONFIGURATION CHANGE
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ] New
> Configuration:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.102.1.55)
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Left:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.10.24.10)
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Joined:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ] CLM
> CONFIGURATION CHANGE
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ] New
> Configuration:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.102.1.55)
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Left:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Joined:
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [SYNC ] This
> node is within the primary component and will provide service.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering OPERATIONAL state.
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CLM ] got
> nodejoin message 10.102.1.55
> Mar 7 11:30:35 infplsm018 <daemon.notice> openais[3142]: [CPG ] got
> joinlist message from node 2
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering GATHER state from 11.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> Storing new sequence id for ring 151c
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering COMMIT state.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering RECOVERY state.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> position [0] member 10.10.24.10:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> previous ring seq 5400 rep 10.10.24.10
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM] aru c
> high delivered c received flag 1
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> position [1] member 10.102.1.55:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> previous ring seq 5400 rep 10.102.1.55
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM] aru d
> high delivered d received flag 1
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM] Did
> not need to originate any messages in recovery.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] CLM
> CONFIGURATION CHANGE
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] New
> Configuration:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.102.1.55)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Left:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Joined:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] CLM
> CONFIGURATION CHANGE
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] New
> Configuration:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.10.24.10)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.102.1.55)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Left:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Joined:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] CLM
> CONFIGURATION CHANGE
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] New
> Configuration:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.10.24.10)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.102.1.55)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Left:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> Members Joined:
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ]
> r(0) ip(10.10.24.10)
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [SYNC ] This
> node is within the primary component and will provide service.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [TOTEM]
> entering OPERATIONAL state.
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] got
> nodejoin message 10.10.24.10
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CLM ] got
> nodejoin message 10.102.1.55
> Mar 7 11:32:03 infplsm018 <daemon.notice> openais[3142]: [CPG ] got
> joinlist message from node 2
> Mar 7 11:32:08 infplsm018 <kern.err> kernel: dlm: got connection from 1
>
> My interfaces:
>
> [root at infplsm018 ~]# ifconfig eth1
> eth1 Link encap:Ethernet HWaddr 00:50:56:83:02:1D
> inet addr:10.102.1.55 Bcast:10.255.255.255 Mask:255.0.0.0
> inet6 addr: fe80::250:56ff:fe83:21d/64 Scope:Link
> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> RX packets:8644 errors:0 dropped:0 overruns:0 frame:0
> TX packets:328 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:1000
> RX bytes:1566503 (1.4 MiB) TX bytes:60377 (58.9 KiB)
>
> [root at infplsm018 ~]# ifconfig eth2
> eth2 Link encap:Ethernet HWaddr 00:50:56:83:02:54
> inet addr:10.10.24.11 Bcast:10.10.24.255 Mask:255.255.255.0
> inet6 addr: fe80::250:56ff:fe83:254/64 Scope:Link
> UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
> RX packets:130333 errors:0 dropped:0 overruns:0 frame:0
> TX packets:685561 errors:0 dropped:0 overruns:0 carrier:0
> collisions:0 txqueuelen:1000
> RX bytes:21234140 (20.2 MiB) TX bytes:1013834449 (966.8 MiB)
>
> [root at infplsm018 ~]#
>
> My config files:
>
> [root at infplsm018 ~]# cat /etc/drbd.conf
> #
> # please have a a look at the example configuration file in
> # /usr/share/doc/drbd83/drbd.conf
> #
> include "/etc/drbd.d/global_common.conf";
> include "/etc/drbd.d/r0.res";
> [root at infplsm018 ~]#
> [root at infplsm018 ~]# cat /etc/drbd.d/r0.res
> resource r0 {
> on infplsm017 {
> device /dev/drbd1;
> disk /dev/sdb1;
> address 10.10.24.10:7789;
> meta-disk internal;
> }
> on infplsm018 {
> device /dev/drbd1;
> disk /dev/sdb1;
> address 10.10.24.11:7789;
> meta-disk internal;
> }
> }
> [root at infplsm018 ~]#
>
> Does somebody have any idea why my server (only one the second is fine!)
> tries to use wrong interface?
>
>
> Thank you in advance,
> Ivan
I think you are confusing totem (corosync)'s communication (the log file
entries and DRBD.
Can you paste your cluster config to confirm?
Digimer
PS - I am tired and may be missing something obvious. :P
--
Digimer
E-Mail: digimer at alteeve.com
Papers and Projects: https://alteeve.com