[DRBD-user] operation monitor failed 'not configured' - how to tell what's not configured?

Steve steeeeeveee at gmx.net
Wed Oct 1 23:10:49 CEST 2014

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.


 >>From: drbd-user-bounces at lists.linbit.com [mailto:drbd-user-bounces 
at lists.linbit.com] On Behalf Of Lars Ellenberg
 >>On Wed, Sep 24, 2014 at 11:31:58PM +1000, Klint Gore wrote:
 >>> Looks like it exists.  Same file exists on both nodes (md5 matches).
 >>> Is there a way to tell what version it is? Should there be other
 >>> files as well?
 >>
 >>rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
 >
 >[root at hans0 log]# rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
 >drbd84-utils-8.9.1-1.el7.elrepo.x86_64
 >
 >> do the resources listed by "drbdadm dump" match the resource names 
used in the pacemaker configuration?
 >
 >yes
 >
 >> do you get something different for "drbdadm -c /etc/drbd.conf dump"?
 >
 >They're the same
 >
 >[root at hans0 tmp]# drbdadm dump >admdump
 >[root at hans0 tmp]# drbdadm -c /etc/drbd.conf dump >confdump
 >[root at hans0 tmp]# ll *dump
 >-rw-r--r--. 1 root root 3256 Sep 25 09:30 admdump
 >-rw-r--r--. 1 root root 3256 Sep 25 09:30 confdump
 >[root at hans0 tmp]# diff admdump confdump
 >[root at hans0 tmp]#
 >
 >
 >In trying stuff yesterday, I seem to have caused a change in something 
that there's new error in the log
 >
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ 
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: 
/var/log/pacemaker.log: Permission denied ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Could not 
establish cib_rw connection: Permission denied (13) ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Error signing 
on to the CIB service: Transport endpoint is not connected ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ 
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: 
/var/log/pacemaker.log: Permission denied ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ 
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: 
/var/log/pacemaker.log: Permission denied ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au       lrmd: notice: 
operation_finished: drbd_homeagbu_notify_0:17296:stderr [ 
/usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: 
/var/log/pacemaker.log: Permission denied ]
 >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd:     info: 
log_finished:    finished - rsc:drbd_homeagbu action:notify call_id:117 
pid:17296 exit-code:5 exec-time:15058ms queue-time:0ms
 >
 >Line 226 in ocf-shellfunc is in function ha_log and says
 >    222         if
 >    223           [ -n "$HA_LOGFILE" ]
 >    224         then
 >    225           : appending to $HA_LOGFILE
 >    226           echo "$HA_LOGTAG:     "`hadate`"${*}" >> $HA_LOGFILE
 >    227         fi
 >
 >The permissions on /usr/lib/ocf/lib/heartbeat/* were all 644 so I 
changed them to 755 (owner and group is root).  The permission on 
/var/log/pacemaker.log is 660, owner is hacluster, group is haclient.
 >I changed that to 666 but it doesn't seem to help.
 >
 >And now it's giving me the not installed message instead of the not 
configured.
 >Sep 25 14:26:03 [19406] hans0.une.edu.au    pengine: error: 
unpack_rsc_op:   No further recovery can be attempted for 
drbd_homeagbu:0: stop action failed with 'not installed' (5)
 >Sep 25 14:26:03 [19406] hans0.une.edu.au    pengine: notice: 
unpack_rsc_op:   Preventing master_drbd from re-starting on hans0: 
operation stop failed 'not installed' (rc=5)
 >Sep 25 14:26:03 [19406] hans0.une.edu.au    pengine: warning: 
unpack_rsc_op:   Processing failed op stop for drbd_homeagbu:0 on hans0: 
not installed (5)
 >
 >
 >Attempting manual start on one of the drbd resources
 >[root at hans0 log]# pcs resource debug-start drbd_homeagbu
 >Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
 > >  stdout:
 > >  stdout:
 > >  stdout:
 > >  stderr: WARNING: You may be disappointed: This RA is intended for 
pacemaker 1.0 or better!
 > >  stderr: WARNING: homeagbu already Primary, demoting.
 > >  stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf 
secondary homeagbu
 > >  stderr: DEBUG: homeagbu: Exit code 0
 > >  stderr: DEBUG: homeagbu: Command output:
 > >  stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust 
homeagbu
 > >  stderr: DEBUG: homeagbu: Exit code 0
 > >  stderr: DEBUG: homeagbu: Command output:
 > >  stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot 
-v 10000
 > >  stderr: DEBUG: homeagbu: Exit code 0
 > >  stderr: DEBUG: homeagbu: Command output:
 >[root at hans0 log]# drbd-overview
 > 1:homeagbu/0  Connected Secondary/Secondary UpToDate/UpToDate
 > 2:backdesk/0  Connected Primary/Secondary UpToDate/UpToDate
 > 3:genomics/0  Connected Primary/Secondary UpToDate/UpToDate
 > 4:backserv/0  Connected Primary/Secondary UpToDate/UpToDate
 > 5:agbudata/0  Connected Primary/Secondary UpToDate/UpToDate
 >[root at hans0 log]# pcs resource debug-start drbd_homeagbu
 >Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
 > >  stdout:
 > >  stdout:
 > >  stderr: WARNING: You may be disappointed: This RA is intended for 
pacemaker 1.0 or better!
 > >  stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust 
homeagbu
 > >  stderr: DEBUG: homeagbu: Exit code 0
 > >  stderr: DEBUG: homeagbu: Command output:
 > >  stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot 
-v 10000
 > >  stderr: DEBUG: homeagbu: Exit code 0
 > >  stderr: DEBUG: homeagbu: Command output:
 >[root at hans0 log]# less -S pacemaker.log
 >[root at hans0 log]# drbd-overview
 > 1:homeagbu/0  Connected Secondary/Secondary UpToDate/UpToDate
 > 2:backdesk/0  Connected Primary/Secondary UpToDate/UpToDate
 > 3:genomics/0  Connected Primary/Secondary UpToDate/UpToDate
 > 4:backserv/0  Connected Primary/Secondary UpToDate/UpToDate
 > 5:agbudata/0  Connected Primary/Secondary UpToDate/UpToDate
 >
Have you managed to fix this issue? I got the exactly same problem as 
you and I can't find the solution.


cheers,

Steve




More information about the drbd-user mailing list