Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
>From: drbd-user-bounces at lists.linbit.com [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Lars Ellenberg >On Wed, Sep 24, 2014 at 11:31:58PM +1000, Klint Gore wrote: >> Looks like it exists. Same file exists on both nodes (md5 matches). >> Is there a way to tell what version it is? Should there be other >> files as well? > >rpm -qf /usr/lib/ocf/resource.d/linbit/drbd [root at hans0 log]# rpm -qf /usr/lib/ocf/resource.d/linbit/drbd drbd84-utils-8.9.1-1.el7.elrepo.x86_64 > do the resources listed by "drbdadm dump" match the resource names used in the pacemaker configuration? yes > do you get something different for "drbdadm -c /etc/drbd.conf dump"? They're the same [root at hans0 tmp]# drbdadm dump >admdump [root at hans0 tmp]# drbdadm -c /etc/drbd.conf dump >confdump [root at hans0 tmp]# ll *dump -rw-r--r--. 1 root root 3256 Sep 25 09:30 admdump -rw-r--r--. 1 root root 3256 Sep 25 09:30 confdump [root at hans0 tmp]# diff admdump confdump [root at hans0 tmp]# In trying stuff yesterday, I seem to have caused a change in something that there's new error in the log Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Could not establish cib_rw connection: Permission denied (13) ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Error signing on to the CIB service: Transport endpoint is not connected ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: info: log_finished: finished - rsc:drbd_homeagbu action:notify call_id:117 pid:17296 exit-code:5 exec-time:15058ms queue-time:0ms Line 226 in ocf-shellfunc is in function ha_log and says 222 if 223 [ -n "$HA_LOGFILE" ] 224 then 225 : appending to $HA_LOGFILE 226 echo "$HA_LOGTAG: "`hadate`"${*}" >> $HA_LOGFILE 227 fi The permissions on /usr/lib/ocf/lib/heartbeat/* were all 644 so I changed them to 755 (owner and group is root). The permission on /var/log/pacemaker.log is 660, owner is hacluster, group is haclient. I changed that to 666 but it doesn't seem to help. And now it's giving me the not installed message instead of the not configured. Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: error: unpack_rsc_op: No further recovery can be attempted for drbd_homeagbu:0: stop action failed with 'not installed' (5) Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: notice: unpack_rsc_op: Preventing master_drbd from re-starting on hans0: operation stop failed 'not installed' (rc=5) Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: warning: unpack_rsc_op: Processing failed op stop for drbd_homeagbu:0 on hans0: not installed (5) Attempting manual start on one of the drbd resources [root at hans0 log]# pcs resource debug-start drbd_homeagbu Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0 > stdout: > stdout: > stdout: > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better! > stderr: WARNING: homeagbu already Primary, demoting. > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf secondary homeagbu > stderr: DEBUG: homeagbu: Exit code 0 > stderr: DEBUG: homeagbu: Command output: > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu > stderr: DEBUG: homeagbu: Exit code 0 > stderr: DEBUG: homeagbu: Command output: > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000 > stderr: DEBUG: homeagbu: Exit code 0 > stderr: DEBUG: homeagbu: Command output: [root at hans0 log]# drbd-overview 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate [root at hans0 log]# pcs resource debug-start drbd_homeagbu Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0 > stdout: > stdout: > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better! > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu > stderr: DEBUG: homeagbu: Exit code 0 > stderr: DEBUG: homeagbu: Command output: > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000 > stderr: DEBUG: homeagbu: Exit code 0 > stderr: DEBUG: homeagbu: Command output: [root at hans0 log]# less -S pacemaker.log [root at hans0 log]# drbd-overview 1:homeagbu/0 Connected Secondary/Secondary UpToDate/UpToDate 2:backdesk/0 Connected Primary/Secondary UpToDate/UpToDate 3:genomics/0 Connected Primary/Secondary UpToDate/UpToDate 4:backserv/0 Connected Primary/Secondary UpToDate/UpToDate 5:agbudata/0 Connected Primary/Secondary UpToDate/UpToDate