[DRBD-announce] drbd-8.3.5.tar.gz

Philipp Reisner philipp.reisner at linbit.com
Wed Oct 28 12:57:07 CET 2009


I am not proud to write another announcement e-mail only 12 days after
the previous one.

When Bug 258 strikes, DRBD's receiver thread is blocked, IO is frozen,
your only way out is to reboot the machine.

To trigger it, you need to drive DRBD into "starving on activity log",
and then interrupt the network connection.

  "starving on activity log" means, that maximal number of hot activity
  log entries is reached, and therefore new IO requests has to wait
  until a previous one is finished. You can watch this by:

  echo 1 > /sys/module/drbd/parameters/proc_details
  watch -n 1 cat /proc/drbd

  Look for "act_log: used:XX/YY". You trigger the bug when XX gets as
  high as YY and you have an interruption of you network connectivity
  in that moment.

  The likelihood gets increased by...
    * heavy write IO on your DRBD device.
    * the replication link has higher latency, i.e. "long distance".

It is really hard to trigger, but I recommend to upgrade from 8.3.3 or
8.3.4 to 8.3.5.

If you are mirroring via a long distance link, typically in a tree
node configuration I _URGE_ you to upgrade.

8.3.5 (api:86/proto:86-91)
 * Fixed a regression introduced shortly before 8.3.3, which might
   case a deadlock in DRBD's disconnect code path. (Bugz 258)
 * Fixed drbdsetup X resume-io which is needed for the recovery
   from the effects of broken fence-peer scripts. (Bugz 256)
 * Do not reduce master score of a current Primary on connection loss,
   to avoid unnecessary migrations
 * Do not display the usage count dialog for /etc/inti.d/drbd status


: Dipl-Ing Philipp Reisner
: LINBIT | Your Way to High Availability
: Tel: +43-1-8178292-50, Fax: +43-1-8178292-82
: http://www.linbit.com

DRBD(R) and LINBIT(R) are registered trademarks of LINBIT, Austria.

More information about the drbd-announce mailing list