[DRBD-user] Newbie: IDE Flush, system halt when drbdadm Primary all

Jason W. Allen jallen at mpgis.net
Thu Aug 4 18:25:27 CEST 2005

Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.



Thanks for the help.  I changed my drbd.conf, commented out the
incon-degr-cmd line, and changed the meta-disks to /dev/sda3 and /dev/hda3
respectively.  Then /# /etc/init.d/drbd start on both systems, and /#
drbdadm -- --do-what-I-say primary all.  Everything started fine.  But I'm
back to the original problem  I'm only getting 4K/sec for the sync.

So I enabled my second network cards on both machines and crossover
connected them.  Still same problem:

/# cat /proc/drbd
version 0.7.10 (api:77/proto:74)
SVN Revision: 1743 build by phil at mescak, 2005-01-31 12:22:07
 0: cs:SyncSource st:Primary/Secondary Id:Consistant
    ns:3600 nr:0 dw:0 dr:3624 al:0 bm:0 lo:6 pe:6 ua:6 ap:0
        [>...................] sync'ed:  0.2% (4765/4769)M
        finish: 27:06:43 speed: 4 (4) K/sec
 1: cs:Unconfigured
 2: cs:Unconfigured
 3: cs:Unconfigured
 4: cs:Unconfigured

Is there something wrong with my config?  Or is this hardware related?  I
get the same numbers from the other network cards.

--Jason W. Allen

-----Original Message-----
From: drbd-user-bounces at lists.linbit.com
[mailto:drbd-user-bounces at lists.linbit.com]On Behalf Of Lars Ellenberg
Sent: Friday, July 29, 2005 2:11 PM
To: drbd-user at lists.linbit.com
Subject: Re: [DRBD-user] Newbie: IDE Flush,system halt when drbdadm
Primary all


/ 2005-07-29 13:17:33 -0400
\ Jason W. Allen:
> I've finally seen the error of my ways, so after 15 years of being a
Windows
> Admin, I've been seduced by the light side...
>
> I've only been working with Linux for a week now, so please bear with me.
>
> I have a lab setup of two virgin Debian Sarge systems.  I believe I was
> successful in getting drbd to load and run yesterday, but it was only
> running 4 K/s in the sync and only ever got to .6% done (even overnight).
> So I decided to rebuild my computers from the ground up and see if that
> changed anything.  One the computer I want as the primary when I run
> 'drbdadm primary all' it returns:
>
> ioctl(,SET_STATE,) failed: Input/Output error
> Local replica is inconsistent (--do-what-I-sya ?)
> Command '/sbin/drbdsetup /dev/drbd0 primary' terminated with exit code 21
> flushing ide devices: hda
> System halted.
>
> Message from syslod2 at localhost at Fri Jul 29 16:49:40 2005 ...
> localhost kernel: System halted.
> test1/etc/network# hub.c: new USB device 00:1d.1-1, assigned address 2
> hub.c: USB hub found
> hub.c: 2 ports detected
> drbd0: Ping Ack did not arrive in time.
> drbd0: drbd0_asender [3866]: cstate Connected --> NetworkFailure
> drbd0: asender terminated
> drbd0: drbd0_receiver [3859]: cstate NetworkFailure --> BrokenPipe
> drbd0: short read expecting header on sock: r=-512
> drbd0: worker terminated
> drbd0: drbd0_receiver [3859]: cstate BrokenPipe --> Unconnected
> drbd0: Connection Lost
> drbd0: drbd0_receiver [3859]: cstate UnConnected --> WFConnection
> hub.c: error resetting hub 2 - disconnecting
> usb.c: USB disconnect on device 00:1d.7-3 address 2
>
> At this point the network no longer works etc.
>
> I thought it may be a fluke, so I reinstalled the entire system from
scratch
> and I get the exact same problem.  I have outlined the below steps,
because
> I really have no idea where to start looking, since I'm not quite sure
what
> I have done so far :-)
>
> 1. Build System from Debian Sarge iso Disk 1
> 2. # apt-get install vim ssh module-assistant dpatch
kernel-headers-2.4.27-2
> kernel-headers-2.4.27-2-386 drbd0.7-module-source drbd0.7-utils
> 3. edit /etc/network/interfaces to assign static IP to my NIC
> 4. # m-a prepare
> 5. # m-a a-i drbd0.7-module-source
> 6. edit /etc/drbd.conf:
>
> "    global {
>         minor-count 5
>       }
>
>       resource r0 {
>         protocol C;
>         incon-degr-cmd "halt -f";
                          ^^^^^^^

I'd recommend to comment out this line until you have it setup and
running reliably, or set it to something like
"echo drbd: incon-degr-cmd: would have done halt -f | wall"
(just in case,
 wall is not build from bricks, nor has it windows,
 and it does not make musik either.
 it is "write a message to all (currently logged in (pseudo) terminal)
users"
)

this command is intended to stop a stupid cluster manager from trying to
make an inconsistent replica of a degraded cluster primary.

when the peers never saw each other before, and never have been primary
either, than they are exactly in this state,
and your attempt to promote it to primary goes directly into this
callback (and, as configured, halts the machine).

doh.

>         on test1 {
>           device /dev/drbd0;
>           disk /dev/sda2;
>           address 192.168.1.39:7789;
>           meta-disk internal;

did you know that we strongly recommend external meta data?

> 7. # scp /etc/drbd.conf 192.168.1.40:/etc/drbd.conf
> 8. # /etc/init.d/drbd start

  * go to other box, drbd start there, too.

> 9. # drbdadm primary all

no, but drbdadm -- --do-what-I-say primary all
(and you should be pretty sure that you mean what you say).

see also http://www.linux-ha.org/DRBD/QuickStart07

(btw, I personally dislike reiserfs,
 even though I used it in that example)


cheers, and have a nice weekend ...


--
: Lars Ellenberg                                  Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH            Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe   http://www.linbit.com :
__
please use the "List-Reply" function of your email client.
_______________________________________________
drbd-user mailing list
drbd-user at lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user




More information about the drbd-user mailing list