Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
>>From: drbd-user-bounces at lists.linbit.com [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Lars Ellenberg >>On Wed, Sep 24, 2014 at 11:31:58PM +1000, Klint Gore wrote: >>> Looks like it exists. Same file exists on both nodes (md5 matches). >>> Is there a way to tell what version it is? Should there be other >>> files as well? >> >>rpm -qf /usr/lib/ocf/resource.d/linbit/drbd > >[root at hans0 log]# rpm -qf /usr/lib/ocf/resource.d/linbit/drbd >drbd84-utils-8.9.1-1.el7.elrepo.x86_64 > >> do the resources listed by "drbdadm dump" match the resource names used in the pacemaker configuration? > >yes > >> do you get something different for "drbdadm -c /etc/drbd.conf dump"? > >They're the same > >[root at hans0 tmp]# drbdadm dump >admdump >[root at hans0 tmp]# drbdadm -c /etc/drbd.conf dump >confdump >[root at hans0 tmp]# ll *dump >-rw-r--r--. 1 root root 3256 Sep 25 09:30 admdump >-rw-r--r--. 1 root root 3256 Sep 25 09:30 confdump >[root at hans0 tmp]# diff admdump confdump >[root at hans0 tmp]# > > >In trying stuff yesterday, I seem to have caused a change in something that there's new error in the log > >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Could not establish cib_rw connection: Permission denied (13) ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Error signing on to the CIB service: Transport endpoint is not connected ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: info: log_finished: finished - rsc:drbd_homeagbu action:notify call_id:117 pid:17296 exit-code:5 exec-time:15058ms queue-time:0ms > >Line 226 in ocf-shellfunc is in function ha_log and says > 222 if > 223 [ -n "$HA_LOGFILE" ] > 224 then > 225 : appending to $HA_LOGFILE > 226 echo "$HA_LOGTAG: "`hadate`"${*}" >> $HA_LOGFILE > 227 fi > >The permissions on /usr/lib/ocf/lib/heartbeat/* were all 644 so I changed them to 755 (owner and group is root). The permission on /var/log/pacemaker.log is 660, owner is hacluster, group is haclient. >I changed that to 666 but it doesn't seem to help. > >And now it's giving me the not installed message instead of the not configured. >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: error: unpack_rsc_op: No further recovery can be attempted for drbd_homeagbu:0: stop action failed with 'not installed' (5) >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: notice: unpack_rsc_op: Preventing master_drbd from re-starting on hans0: operation stop failed 'not installed' (rc=5) >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: warning: unpack_rsc_op: Processing failed op stop for drbd_homeagbu:0 on hans0: not installed (5) > > >Attempting manual start on one of the drbd resources >[root at hans0 log]# pcs resource debug-start drbd_homeagbu >Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0 > > stdout: > > stdout: > > stdout: > > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better! > > stderr: WARNING: homeagbu already Primary, demoting. > > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf secondary homeagbu > > stderr: DEBUG: homeagbu: Exit code 0 > > stderr: DEBUG: homeagbu: Command output: > > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu > > stderr: DEBUG: homeagbu: Exit code 0 > > stderr: DEBUG: homeagbu: Command output: > > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000 > > stderr: DEBUG: homeagbu: Exit code 0 > > stderr: DEBUG: homeagbu: Command output: >[root at hans0 log]# drbd-overview > 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate > 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate > 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate > 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate > 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate >[root at hans0 log]# pcs resource debug-start drbd_homeagbu >Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0 > > stdout: > > stdout: > > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better! > > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu > > stderr: DEBUG: homeagbu: Exit code 0 > > stderr: DEBUG: homeagbu: Command output: > > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000 > > stderr: DEBUG: homeagbu: Exit code 0 > > stderr: DEBUG: homeagbu: Command output: >[root at hans0 log]# less -S pacemaker.log >[root at hans0 log]# drbd-overview > 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate > 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate > 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate > 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate > 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate > Have you managed to fix this issue? I got the exactly same problem as you and I can't find the solution. cheers, Steve