[DRBD-user] communication constantly terminated, always re-syncing

Raoul Bhatia [IPAX] r.bhatia at ipax.at
Fri Mar 4 14:18:50 CET 2011

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


On 03/02/2011 06:42 PM, Cory Coager wrote:
> On 02/25/2011 10:01 AM, Coager, Cory wrote:
>>> there should be more information before the fencing handler being
>>> called.
>>>
>>> cheers,
>>> raoul
> 
> Still looking for help from anyone out there.
> 
> node1 dmesg: http://pastebin.com/LeRtm89H
> node1 /var/log/messages: http://pastebin.com/u3hL8Nxa
> node2 dmesg: http://pastebin.com/vymSpqep
> node2 /var/log/messages: http://pastebin.com/fyUe4zzS

hi,

is this an *unfiltered*, *unmodified* /var/log/messages?

where did this messages go:
> Feb 24 11:24:38 node1 crm-fence-peer.sh[30361]: invoked for postgres
> Feb 24 11:24:39 node1 crm-fence-peer.sh[30361]: Call cib_query failed (-41): Remote node did not respond
> Feb 24 11:24:41 node1 cib: [30431]: info: write_cib_contents: Archived previous version as /var/lib/heartbeat/crm/cib-39.raw
> Feb 24 11:24:41 node1 kernel: [505827.918922] block drbd0: helper command: /sbin/drbdadm fence-peer minor-0 exit code 4 (0x400)

i actually wanted to see /var/log/syslog - perhaps there are not
logged at /var/log/messages.


basically, what i wood do:


analyse what happens around such an event:
> Feb 25 10:00:06 node1 kernel: [587152.376002] block drbd0: sock was shut down by peer
> Feb 25 10:00:06 node1 kernel: [587152.376011] block drbd0: peer( Secondary -> Unknown ) conn( SyncSource -> BrokenPipe )
> Feb 25 10:00:06 node1 kernel: [587152.376032] block drbd0: asender terminated
> Feb 25 10:00:06 node1 kernel: [587152.376034] block drbd0: Terminating asender thread
> Feb 25 10:00:06 node1 kernel: [587152.392944] block drbd0: Connection closed
> Feb 25 10:00:06 node1 kernel: [587152.392948] block drbd0: conn( BrokenPipe -> Unconnected )
> Feb 25 10:00:06 node1 kernel: [587152.392954] block drbd0: receiver terminated
> ...
> Feb 25 10:00:06 node1 kernel: [587152.689967] block drbd0: Becoming sync source due to disk states.
> Feb 25 10:00:06 node1 kernel: [587152.689971] block drbd0: peer( Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS )
> Feb 25 10:00:06 node1 kernel: [587152.988008] block drbd0: conn( WFBitMapS -> SyncSource )
> Feb 25 10:00:06 node1 kernel: [587152.988023] block drbd0: Began resync as SyncSource (will sync 203224292 KB [50806073 bits set]).


on *both* nodes, via /var/log/syslog (where hopefully everything
is correctly logged)

cheers,
raoul
-- 
____________________________________________________________________
DI (FH) Raoul Bhatia M.Sc.          email.          r.bhatia at ipax.at
Technischer Leiter

IPAX - Aloy Bhatia Hava OG          web.          http://www.ipax.at
Barawitzkagasse 10/2/2/11           email.            office at ipax.at
1190 Wien                           tel.               +43 1 3670030
FN 277995t HG Wien                  fax.            +43 1 3670030 15
____________________________________________________________________



More information about the drbd-user mailing list