Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
List, This morning I found the following error in /var/log/messages. I use heartbeat to manage drbd. I am running drbd 8.2.5. I was running a moderate level of load testing on my server, though the stats I see from the crash indicate the system was lightly loaded. This error caused heartbeat to think drbd had failed, and it restarted everything that depended upon drbd (including postgres). Everything came back up fine with the heartbeat restart of drbd. My questions are: 1) What does exit code 20 from drbdsetup mean? 2) The driver was definitely loaded, so what could cause the "no response from driver" message? Can a heavy system load cause some kind of timeout? Excerpt from messages: Aug 11 07:13:12 arc-stgsky-agg1 lrmd: [5233]: info: RA output: (rsc_drbd_7788:monitor:stderr) No response from the DRBD driver! Is the module loaded? Aug 11 07:13:12 arc-stgsky-agg1 lrmd: [5233]: info: RA output: (rsc_drbd_7788:monitor:stderr) Command '/sbin/drbdsetup /dev/drbd0 state' terminated with exit code 20 drbdadm aborting Aug 11 07:13:16 arc-stgsky-agg1 crmd: [5238]: info: process_lrm_event: LRM operation rsc_drbd_7788_monitor_120000 (call=477, rc=7) complete My current /proc/drbd: version: 8.2.5 (api:88/proto:86-88) GIT-hash: 9faf052fdae5ef0c61b4d03890e2d2eab550610c build by root at arc-stgsky-agg1.wsicorp.com, 2008-05-19 10:01:19 0: cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate C r--- ns:593314408 nr:296596 dw:592558332 dr:124756728 al:2019770 bm:1535 lo:0 pe:0 ua:0 ap:0 resync: used:0/31 hits:65601 misses:191 starving:0 dirty:0 changed:191 act_log: used:0/257 hits:682916576 misses:2027491 starving:4 dirty:7717 changed:2019770 Thanks, Doug Knight -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20080811/337549d8/attachment.htm>