[DRBD-user] Re: Huge max latency

Tue Jan 9 12:58:24 CET 2007

/ 2007-01-08 23:39:37 -0600
\ Weilin Gong:
> Hi Lars,
> 
> Thanks so much for your help.
> 
> >* try again with a higher numruns, to get better statistics
> >
> >* try again with 0.7.22
> >* try again with drbd 8 (well, rcX, or svn trunk)
> >* try again with large "al-extends"
> >  and then again in a second run...
> >  
> I have not been able to run 0.7.22 and 8 yet due to our system constraint. I did try a large
> "al-extents" value (2570) with numruns 5 (it was 3 before), which resulted in at least the
> better max latency data:
> 
> ┌─────────────────┬─────┬─────┬────┬─────┬───────┬───────┬─────────┬───────┬───────┬─────┐
> │Identifier       │File │Blk  │Num │Rate │Maximum│Avg    │Max      │Lat% > │Lat% > │CPU  │
> │                 │Size │Size │Thr │     │CPU%   │Latency│Latency  │2s     │10s    │Eff  │
> ├─────────────────┼─────┼─────┼────┼─────┼───────┼───────┼─────────┼───────┼───────┼─────┤
> │Raw-device       │5000 │4096 │4   │32.19│28.27% │0.347  │5956.10  │0.00244│0.00000│114  │
> ├─────────────────┼─────┼─────┼────┼─────┼───────┼───────┼─────────┼───────┼───────┼─────┤
> │Drbd-disconnected│5000 │4096 │4   │31.52│29.85% │0.365  │5054.72  │0.00080│0.00000│106  │
> ├─────────────────┼─────┼─────┼────┼─────┼───────┼───────┼─────────┼───────┼───────┼─────┤
> │Drbd-connected   │5000 │4096 │4   │28.28│40.48% │0.394  │16223.82 │0.00122│0.00003│70   │
> └─────────────────┴─────┴─────┴────┴─────┴───────┴───────┴─────────┴───────┴───────┴─────┘
                                                                      ^^^^^^^

if you compare the percentage of latency > 2s, it actually decreases,
compared to the "raw" statistics.

so, if you envison a histogram of latency,
your raw device would look like

 |
 |    ...
 | .... .....
 |..        ...  .
 +--------------------------------------------->

and the effect of drbd would be
 |   .
 |  ...
 | .. ..
 |..   ..   .   .             .           .
 +------------------------------------------->

so it in fact seems to "compact" most of the requests,
but then has a few requests that have additional latency
because of transactional meta data updates.

> According the comments in drbd.conf, increasing "al-extents" would
> cause longer resync time if the primary crashes.  Now
> here are my questions:
> 
> 1) Is this the only side-effect of a large "al-extents" value?

yes. bigger al-extents --> less likely for a particular request to cause
additional meta data updates, but larger "active set" to be synced on
primary crash.

 make sure you use 0.7.22,
 we have had bugs in the apply-al-after-primary-crash code!

> 2) Does the activity log works the way similar to the ext3 file system
> journaling,  each transaction is written to the log until
> it is full.

not at all. different concept altogether.

its more like a "cache" where we remember which "extents" are "active".
so in case we recover from a primary crash, we know that any extent
_not_ mentioned in this cache may use the dirty bitmap directly (it was not active),
and any extent that _is_ in this cache is assumed to be completely
dirty, because we cannot possibly know in more detail.

whenever the set of active elements changes, we need a meta data
transaction. as long as your requests stay in the same area,
we don't. the configuratino parameter al-extents tunes the size of this
"set of active elements".

> >* what is the real storage device?  sdX ? mdX ? lvm something?
> >* what is the meta data storage device?
> 
> Our storage device is the SCSI disk sda. LVM is being considered.  The
> meta data resides on the same storage device (internal).

unless that "SCSI disk" is some better hw-raid box with lots of spindles,
you probably can decrease maximum latency by using external meta data on
some other disk (think seek time from data area to meta data area).

> One more thing. When running "tiobench" with DRBD in a connected mode,
> I noticed the throughput would increase ~4-6% if "jnettop" tool is
> turned on to monitor the interface carrying the traffic. Do we know
> what happens?

network voodoo.
 :->

-- 
: Lars Ellenberg                            Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH      Fax +43-1-8178292-82 :
: Vivenotgasse 48, A-1120 Vienna/Europe    http://www.linbit.com :
__
please use the "List-Reply" function of your email client.