Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
On Wed, Aug 31, 2016 at 3:49 PM, Mia Lueng <xiaozunvlg at gmail.com> wrote: > Hi: > I have a cluster with four drbd devices. I found oracle stopped > timeout while drbd is in resync state. > oracle is blocked like following: > > oracle 6869 6844 0.0 0.0 71424 12616 ? S 16:28 > 00:00:00 pipe_wait > /oracle/app/oracle/dbhome_1/bin/sqlplus > @/tmp/ora_ommbb_shutdown.sql > oracle 6870 6869 0.0 0.1 4431856 26096 ? Ds 16:28 > 00:00:00 get_write_access oracleommbb > (DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq))) > > > drbd state > > 2016-08-30 16:33:32 Dump [/proc/drbd] ... > ========================================= > version: 8.3.16 (api:88/proto:86-97) > GIT-hash: bbf851ee755a878a495cfd93e1a76bf90dc79442 Makefile.in build > by drbd at build 2012-06-07 16:03:04 > 0: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent B r----- > ns:2777568 nr:0 dw:492604 dr:3305833 al:4761 bm:439 lo:31 pe:613 > ua:0 ap:31 ep:1 wo:d oos:4144796 > [======>.............] sync'ed: 35.7% (4044/6280)M > finish: 0:10:19 speed: 6,680 (3,664) K/sec > 1: cs:SyncSource ro:Secondary/Secondary ds:UpToDate/Inconsistent B r----- > ns:3709600 nr:0 dw:854764 dr:7632085 al:7689 bm:3401 lo:38 pe:3299 > ua:38 ap:0 ep:1 wo:d oos:6204676 > [=======>............] sync'ed: 41.5% (6056/10340)M > finish: 0:22:14 speed: 4,640 (10,016) K/sec > 2: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent B r----- > ns:3968883 nr:0 dw:127937 dr:5179641 al:190 bm:304 lo:1 pe:139 ua:0 > ap:7 ep:1 wo:d oos:2124792 > [============>.......] sync'ed: 66.3% (2072/6144)M > finish: 0:06:12 speed: 5,692 (6,668) K/sec > 3: cs:SyncSource ro:Primary/Secondary ds:UpToDate/Inconsistent B r----- > ns:89737 nr:0 dw:439073 dr:2235186 al:724 bm:35 lo:0 pe:45 ua:0 ap:7 > ep:1 wo:d oos:8131104 > [>....................] sync'ed: 1.6% (7940/8064)M > finish: 10:44:09 speed: 208 (204) K/sec (stalled) > > Is this a known bug and fixed in the further version? > _______________________________________________ > drbd-user mailing list > drbd-user at lists.linbit.com > http://lists.linbit.com/mailman/listinfo/drbd-user > Maybe provide more details about the term "cluster" you are using. Do you have DRBD under control of crm like Pacemaker? If so are you running DRBD in dual primary mode maybe? And when does this state happen and under what conditions i.e restarted a node etc. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linbit.com/pipermail/drbd-user/attachments/20160831/f166069d/attachment.htm>