Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
>>From: drbd-user-bounces at lists.linbit.com [mailto:drbd-user-bounces
at lists.linbit.com] On Behalf Of Lars Ellenberg
>>On Wed, Sep 24, 2014 at 11:31:58PM +1000, Klint Gore wrote:
>>> Looks like it exists. Same file exists on both nodes (md5 matches).
>>> Is there a way to tell what version it is? Should there be other
>>> files as well?
>>
>>rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
>
>[root at hans0 log]# rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
>drbd84-utils-8.9.1-1.el7.elrepo.x86_64
>
>> do the resources listed by "drbdadm dump" match the resource names
used in the pacemaker configuration?
>
>yes
>
>> do you get something different for "drbdadm -c /etc/drbd.conf dump"?
>
>They're the same
>
>[root at hans0 tmp]# drbdadm dump >admdump
>[root at hans0 tmp]# drbdadm -c /etc/drbd.conf dump >confdump
>[root at hans0 tmp]# ll *dump
>-rw-r--r--. 1 root root 3256 Sep 25 09:30 admdump
>-rw-r--r--. 1 root root 3256 Sep 25 09:30 confdump
>[root at hans0 tmp]# diff admdump confdump
>[root at hans0 tmp]#
>
>
>In trying stuff yesterday, I seem to have caused a change in something
that there's new error in the log
>
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226:
/var/log/pacemaker.log: Permission denied ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Could not
establish cib_rw connection: Permission denied (13) ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Error signing
on to the CIB service: Transport endpoint is not connected ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226:
/var/log/pacemaker.log: Permission denied ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226:
/var/log/pacemaker.log: Permission denied ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice:
operation_finished: drbd_homeagbu_notify_0:17296:stderr [
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226:
/var/log/pacemaker.log: Permission denied ]
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: info:
log_finished: finished - rsc:drbd_homeagbu action:notify call_id:117
pid:17296 exit-code:5 exec-time:15058ms queue-time:0ms
>
>Line 226 in ocf-shellfunc is in function ha_log and says
> 222 if
> 223 [ -n "$HA_LOGFILE" ]
> 224 then
> 225 : appending to $HA_LOGFILE
> 226 echo "$HA_LOGTAG: "`hadate`"${*}" >> $HA_LOGFILE
> 227 fi
>
>The permissions on /usr/lib/ocf/lib/heartbeat/* were all 644 so I
changed them to 755 (owner and group is root). The permission on
/var/log/pacemaker.log is 660, owner is hacluster, group is haclient.
>I changed that to 666 but it doesn't seem to help.
>
>And now it's giving me the not installed message instead of the not
configured.
>Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: error:
unpack_rsc_op: No further recovery can be attempted for
drbd_homeagbu:0: stop action failed with 'not installed' (5)
>Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: notice:
unpack_rsc_op: Preventing master_drbd from re-starting on hans0:
operation stop failed 'not installed' (rc=5)
>Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: warning:
unpack_rsc_op: Processing failed op stop for drbd_homeagbu:0 on hans0:
not installed (5)
>
>
>Attempting manual start on one of the drbd resources
>[root at hans0 log]# pcs resource debug-start drbd_homeagbu
>Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
> > stdout:
> > stdout:
> > stdout:
> > stderr: WARNING: You may be disappointed: This RA is intended for
pacemaker 1.0 or better!
> > stderr: WARNING: homeagbu already Primary, demoting.
> > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf
secondary homeagbu
> > stderr: DEBUG: homeagbu: Exit code 0
> > stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust
homeagbu
> > stderr: DEBUG: homeagbu: Exit code 0
> > stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot
-v 10000
> > stderr: DEBUG: homeagbu: Exit code 0
> > stderr: DEBUG: homeagbu: Command output:
>[root at hans0 log]# drbd-overview
> 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate
> 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate
> 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate
> 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate
> 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate
>[root at hans0 log]# pcs resource debug-start drbd_homeagbu
>Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
> > stdout:
> > stdout:
> > stderr: WARNING: You may be disappointed: This RA is intended for
pacemaker 1.0 or better!
> > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust
homeagbu
> > stderr: DEBUG: homeagbu: Exit code 0
> > stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot
-v 10000
> > stderr: DEBUG: homeagbu: Exit code 0
> > stderr: DEBUG: homeagbu: Command output:
>[root at hans0 log]# less -S pacemaker.log
>[root at hans0 log]# drbd-overview
> 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate
> 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate
> 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate
> 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate
> 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate
>
Have you managed to fix this issue? I got the exactly same problem as
you and I can't find the solution.
cheers,
Steve