Note: "permalinks" may not be as permanent as we would like,
direct links of old sources may well be a few messages off.
Heiko, any hint from you about actual crash itself (stack trace) as well as OS and software versions involved (xen/heartbeat/drbd etc.)? BR, Ivars Heiko wrote: > Hello, > > i am investigating why our server pairs reboot themselves from time to > time. > This is very annoing because these machines are in production and i always > have to fix mysql replications or drbd splitbrains after these reboots. > > We have 3 pairs that use a drbd/xen/heartbeat setup and 2 of these > pairs crash, > sometimes every 2 week sometimes only twice a year. > > I first thought it could be heartbeat, but I stopped the service on 1 > pair and we also had a crash. > Are there other people who had these kind of crashes? > I dont even know if it is a crash, i never can find anything in my > logfiles about problems, or about heartbeat that does a safety reboot. > > this is one drbd.conf entry: > > resource drbd_backend { > protocol C; > startup { > degr-wfc-timeout 120; # 2 minutes. > } > disk { > on-io-error detach; > } > net { > } > syncer { > rate 500M; > al-extents 257; > } > > on xen-B1.fra1 { > device /dev/drbd0; > disk /dev/md3; > address 172.20.2.1:7788 <http://172.20.2.1:7788>; > meta-disk internal; > } > on xen-A1.fra1 { > device /dev/drbd0; > disk /dev/md3; > address 172.20.1.1:7788 <http://172.20.1.1:7788>; > meta-disk internal; > } > } > > > this the ha.cf <http://ha.cf> > > debugfile /var/log/ha-debug > logfile /var/log/ha-log > logfacility local0 > keepalive 2 > deadtime 60 > #warntime 10 > initdead 120 > udpport 694 > ucast eth0 172.20.1.1 > ucast eth0 172.20.2.1 > auto_failback on > node xen-A1.fra1 > node xen-B1.fra1 > > > and this the xen config > > debugfile /var/log/ha-debug > logfile /var/log/ha-log > logfacility local0 > keepalive 2 > deadtime 60 > #warntime 10 > initdead 120 > udpport 694 > ucast eth0 172.20.1.1 > ucast eth0 172.20.2.1 > auto_failback on > node xen-A1.fra1 > node xen-B1.fra1 > > > can you please give me some assistance? > > greetings > > Rupert > ------------------------------------------------------------------------ > > _______________________________________________ > drbd-user mailing list > drbd-user at lists.linbit.com > http://lists.linbit.com/mailman/listinfo/drbd-user >