Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
For reference: the DRBD resource configuration files... > san1:~ # cat /etc/drbd.d/r*.res > resource r0 { > volume 0 { > device /dev/drbd0 ; > disk /dev/sda7 ; > meta-disk internal ; > } > volume 1 { > device /dev/drbd1 ; > disk /dev/sda8 ; > meta-disk internal ; > } > volume 2 { > device /dev/drbd2 ; > disk /dev/sda9 ; > meta-disk internal ; > } > volume 3 { > device /dev/drbd3 ; > disk /dev/sda10 ; > meta-disk internal ; > } > on san1 { > address 192.168.1.1:7789 ; > } > on san2 { > address 192.168.1.2:7789 ; > } > } > > resource r1 { > volume 0 { > device /dev/drbd1000 ; > disk /dev/sdb7 ; > meta-disk internal ; > } > volume 1 { > device /dev/drbd1001 ; > disk /dev/sdb8 ; > meta-disk internal ; > } > volume 2 { > device /dev/drbd1002 ; > disk /dev/sdb9 ; > meta-disk internal ; > } > volume 3 { > device /dev/drbd1003 ; > disk /dev/sdb10 ; > meta-disk internal ; > } > on san1 { > address 192.168.1.1:7790 ; > } > on san2 { > address 192.168.1.2:7790 ; > } > } TIA, Eric Pretorious Truckee, cA >________________________________ > From: Eric <epretorious at yahoo.com> >To: "drbd-user at lists.linbit.com" <drbd-user at lists.linbit.com> >Sent: Sunday, January 20, 2013 10:24 PM >Subject: Diagnosing a Failed Resource > > >I've configured corosync+pacemaker to managee a simple two-resource DRBD cluster: > > >> san1:~ # crm configure show | cat - >> node san1 \ >> attributes standby="off" >> node san2 \ >> attributes standby="off" >> primitive p_DRBD-r0 ocf:linbit:drbd \ >> params drbd_resource="r0" \ >> op monitor interval="60s" >> primitive p_DRBD-r1 ocf:linbit:drbd \ >> params drbd_resource="r1" \ >> op monitor interval="60s" >> primitive p_IP-1_253 ocf:heartbeat:IPaddr2 \ >> params ip="192.168.1.253" cidr_netmask="24" \ >> op monitor interval="30s" >> primitive p_IP-1_254 ocf:heartbeat:IPaddr2 \ >> params ip="192.168.1.254" cidr_netmask="24" \ >> op monitor interval="30s" >> primitive p_iSCSI-san1 ocf:heartbeat:iSCSITarget \ >> params iqn="iqn.2012-11.com.example.san1:sda" \ >> op monitor interval="10s" >> primitive p_iSCSI-san1_0 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san1:sda" lun="0" path="/dev/drbd0" \ >> op monitor interval="10s" >> primitive p_iSCSI-san1_1 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san1:sda" lun="1" path="/dev/drbd1" \ >> op monitor interval="10s" >> primitive p_iSCSI-san1_2 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san1:sda" lun="2" path="/dev/drbd2" \ >> op monitor interval="10s" >> primitive p_iSCSI-san1_3 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san1:sda" lun="3" path="/dev/drbd3" \ >> op monitor interval="10s" >> primitive p_iSCSI-san2 ocf:heartbeat:iSCSITarget \ >> params iqn="iqn.2012-11.com.example.san2:sda" \ >> op monitor interval="10s" >> primitive p_iSCSI-san2_0 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san2:sda" lun="0" path="/dev/drbd1000" \ >> op monitor interval="10s" >> primitive p_iSCSI-san2_1 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san2:sda" lun="1" path="/dev/drbd1001" \ >> op monitor interval="10s" >> primitive p_iSCSI-san2_2 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san2:sda" lun="2" path="/dev/drbd1002" \ >> op monitor interval="10s" >> primitive p_iSCSI-san2_3 ocf:heartbeat:iSCSILogicalUnit \ >> params target_iqn="iqn.2012-11.com.example.san2:sda" lun="3" path="/dev/drbd1003" \ >> op monitor interval="10s" >> group g_iSCSI-san1 p_iSCSI-san1 p_iSCSI-san1_0 p_iSCSI-san1_1 p_iSCSI-san1_2 p_iSCSI-san1_3 p_IP-1_254 >> group g_iSCSI-san2 p_iSCSI-san2 p_iSCSI-san2_0 p_iSCSI-san2_1 p_iSCSI-san2_2 p_iSCSI-san2_3 p_IP-1_253 >> ms ms_DRBD-r0 p_DRBD-r0 \ >> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" >> ms ms_DRBD-r1 p_DRBD-r1 \ >> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" >> location l_iSCSI-san1_and_DRBD-r0 p_IP-1_254 10240: san1 >> location l_iSCSI-san2_and_DRBD-r1 p_IP-1_253 10240: san2 >> colocation c_iSCSI_with_DRBD-r0 inf: g_iSCSI-san1 ms_DRBD-r0:Master >> colocation c_iSCSI_with_DRBD-r1 inf: g_iSCSI-san2 ms_DRBD-r1:Master >> order o_DRBD-r0_before_iSCSI-san1 inf: ms_DRBD-r0:promote g_iSCSI-san1:start >> order o_DRBD-r1_before_iSCSI-san2 inf: ms_DRBD-r1:promote g_iSCSI-san2:start >> property $id="cib-bootstrap-options" \ >> dc-version="1.1.7-77eeb099a504ceda05d648ed161ef8b1582c7daf" \ >> cluster-infrastructure="openais" \ >> expected-quorum-votes="2" \ >> stonith-enabled="false" \ >> no-quorum-policy="ignore" > > > >The cluster appears to be functioning correctly: > > > >> san1:~ # crm_mon -1 >> ============ >> Last updated: Sun Jan 20 22:20:17 2013 >> Last change: Sun Jan 20 21:59:15 2013 by root via crm_attribute on san1 >> Stack: openais >> Current DC: san1 - partition with quorum >> Version: 1.1.7-77eeb099a504ceda05d648ed161ef8b1582c7daf >> 2 Nodes configured, 2 expected votes >> 16 Resources configured. >> ============ >> >> Online: [ san1 san2 ] >> >> Master/Slave Set: ms_DRBD-r0 [p_DRBD-r0] >> Masters: [ san1 ] >> Slaves: [ san2 ] >> Resource Group: g_iSCSI-san1 >> p_iSCSI-san1 (ocf::heartbeat:iSCSITarget): Started san1 >> p_iSCSI-san1_0 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_1 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_2 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_3 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_IP-1_254 (ocf::heartbeat:IPaddr2): Started san1 >> Master/Slave Set: ms_DRBD-r1 [p_DRBD-r1] >> Masters: [ san2 ] >> Slaves: [ san1 ] >> Resource Group: g_iSCSI-san2 >> p_iSCSI-san2 (ocf::heartbeat:iSCSITarget): Started san2 >> p_iSCSI-san2_0 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_1 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_2 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_3 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_IP-1_253 (ocf::heartbeat:IPaddr2): Started san2 > >> san2:~ # crm_mon -1 >> ============ >> Last updated: Sun Jan 20 22:20:17 2013 >> Last change: Sun Jan 20 21:59:15 2013 by root via crm_attribute on san1 >> Stack: openais >> Current DC: san1 - partition with quorum >> Version: 1.1.7-77eeb099a504ceda05d648ed161ef8b1582c7daf >> 2 Nodes configured, 2 expected votes >> 16 Resources configured. >> ============ >> >> Online: [ san1 san2 ] >> >> Master/Slave Set: ms_DRBD-r0 [p_DRBD-r0] >> Masters: [ san1 ] >> Slaves: [ san2 ] >> Resource Group: g_iSCSI-san1 >> p_iSCSI-san1 (ocf::heartbeat:iSCSITarget): Started san1 >> p_iSCSI-san1_0 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_1 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_2 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_iSCSI-san1_3 (ocf::heartbeat:iSCSILogicalUnit): Started san1 >> p_IP-1_254 (ocf::heartbeat:IPaddr2): Started san1 >> Master/Slave Set: ms_DRBD-r1 [p_DRBD-r1] >> Masters: [ san2 ] >> Slaves: [ san1 ] >> Resource Group: g_iSCSI-san2 >> p_iSCSI-san2 (ocf::heartbeat:iSCSITarget): Started san2 >> p_iSCSI-san2_0 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_1 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_2 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_iSCSI-san2_3 (ocf::heartbeat:iSCSILogicalUnit): Started san2 >> p_IP-1_253 (ocf::heartbeat:IPaddr2): Started san2 > > >However, the two DRBD resources do not appear to be communicating: > > >> san1:~ # cat /proc/drbd >> version: 8.4.1 (api:1/proto:86-100) >> GIT-hash: 91b4c048c1a0e06777b5f65d312b38d47abaea80 build by phil at fat-tyre, 2011-12-20 12:43:15 >> 0: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:3259080 >> 1: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 2: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 3: cs:StandAlone ro:Primary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> >> 1000: cs:StandAlone ro:Secondary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1001: cs:StandAlone ro:Secondary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1002: cs:StandAlone ro:Secondary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1003: cs:StandAlone ro:Secondary/Unknown ds:UpToDate/DUnknown r----- >> ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 > > >> san2:~ # cat /proc/drbd >> version: 8.4.1 (api:1/proto:86-100) >> GIT-hash: 91b4c048c1a0e06777b5f65d312b38d47abaea80 build by phil at fat-tyre, 2011-12-20 12:43:15 >> 0: cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:140 >> 1: cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 2: cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 3: cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> >> 1000: cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1001: cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1002: cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 >> 1003: cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown C r----- >> ns:0 nr:0 dw:0 dr:664 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0 > > >How can I begin to troubleshoot this error? > > >Eric Pretorious >Truckee, cA > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20130120/ee3bbfbc/attachment.htm>