Skip to main content

Our Lotus Domino Cluster Failover Problem

In certain computing circles, "clustering" is a dirty word. I've heard of situations where, far from providing business continuity peace of mind, it creates more work and greater risk than it would if it were not involved.

This is not the case with Domino clustering. Done properly, it is extremely reliable

Our Problem
Recently, our cluster seems to have "picked up a slight flutter". Actually, I think that perhaps the rules behind it may have changed sometime around our 8.0 or 8.5 migration.

So, first I want to cover off basically what our cluster looks like;


Ok, this is quite a simplistic view and there are servers missing. I'm concentrating on the problem area only.

We have an onsite and offsite clustered Lotus Domino server, both running Lotus Domino 8.5 HF 1021. We'll call them "Onsite" and "Offsite" for ease of reference. The servers are quite a distance apart because we're clustering for business continuity purposes.

The theory is that our onsite staff members should access the onsite server unless it is down. The majority of our agents also run on this server, as does an intranet, extranet and several web sites. It's a busy and powerful box.

We discovered recently that many of our clients have been using the offsite server but we don't know exactly why.

It seems that if you open a database for which you don't already have a desktop icon, then the Notes client will default to opening it from the offsite server. What has exacerbated this problem is that we upgraded our clients to 8.5.1 and blew away their desktops. Now, suddenly all the computers are trying to access everything off the offsite server.

The reasons?
We don't know but were thinking that it was either;

  • Alphabetic: Because "Offsite" is lower in the alphabet than "Onsite"
or
  • Task Related: Because the Onsite server is much busier than the offsite one.

Does anyone have any ideas as to how we could go about finding out?

Comments

Craig said…
We have a similar set up, but our servers are alphabetically. We'll call the offsite one "Notes" and the onsite one "Domino" (that'll hint at the age of the installation.

Folks' clients go to the onsite one first, so based on that I'd vote for the alphabetica explanation.

But it's most likely because computers hate us and enjoy making life hard. ;-)
Alex said…
A couple of thoughts:
1) Do you enforce the mail server by desktop policy?

a) There was an SPR address in 8.0.2 FP5 AJAS7PDKER
SPR# AJAS7PDKER - After restart, the top of the workspace icon stack does not honor the Mail file location set defined in the location document. This regression was introduced from 8.0.2

2)Have you tried server_restricted on the outside server, to make sure the users only access the inside server.
Keith Brooks said…
If the threshold is set to low on usage they will end up at the offsite.
If you use policies to push out the applications and such to be only n the onsite server then they should find it first.
Domino does lookup by alpha so it is possible you have multiple ways to do this.
You could also disable the offsite server from anyone using it by setting its threshold to maximum and stop people from accessing it, if you so desired.
Gavin Bollard said…
Thanks for your comments. We do enforce the mail server by desktop policy but we have a lot of other databases and these are our biggest failover problem.

I looked at server_restricted and although it looks interesting, it seems to be suggesting that replication with a restricted server will fail. Since we need the other server in case of DR, I need things replicated on it.

I'll have a look at the Server_Availability_Threshold notes .ini option.
Paul Mooney said…
First thing... alpahbet comes to mind.

Second thing. Use a server user restricted to stop users hitting the box, but that will still let replication do its thing.

Last - check SAI and expansion factor / trans info range.

By all means email me if I can help.
P.
Unknown said…
We had a similar situation that we discovered. Turns out ours was a new feature actually introduced with the R7 client/server and is called Replication Triangulation. This works cluster or if replicas exist on another server. If the primary server is down, user fails to cluster server or using the replication history actually finds the replica on any server in your domain the user has access to and a replica exists. Since we are not a cluster shop (currently), we had to deny users at the server level to access the backup server where all replicas exist in a DR site. There are a few articles posted about this and policy and notes.ini settings you can use to control this. Lotus calls it a feature enhancement to not force full replication from scratch on every server but as you can see it has some negative effect in shops where we need to control server usage more strictly.
Chad Scott said…
Based on the description, my guess is that you have a Domain Catalog and that users have a catalog server defined in the Location document. In that scenario, and absent a workspace icon, the first database found in the catalog (alphabetical search) will be the one opened.

While you could use SERVER_RESTRICTED=2 to prevent users from accessing the DR server, this isn't ideal because access won't be seamless in the event the primary server is down. Instead, the better solution is to set SERVER_AVAILABILITY_THRESHOLD=100 on the DR server, which means it is always in a busy state and will only take user connections if no other server is available.
Randy Bye said…
I'll vote for alphabetical too. See this technote
http://www-01.ibm.com/support/docview.wss?rs=0&uid=swg21260389
Anonymous said…
Hello,

I agree with the previous pot, the best way to solve you problem is the use of SERVER_AVAILABILITY_THRESHOLD=100.

But you need to know that this parameter is actually bugged and doesnt work as you can see in the following email i got with IBM:

"I have managed to reproduce the issue using 8.5 and 8.5.1 versions however Development is already aware of the situation and SPR # JSMN825TC8 is opened with the issue. We are expecting the issue to be resolved in 8.5.2 but the status of the SPR is still open and the only available workaround is to use the server_restricted=1 notes.ini value instead.

At the moment we have to wait for the specific SPRs resolution any progress can be checked from the Fixlist database or by directly calling HelpDesk. Please inform me If you require any further assistance from my side or should l conclude the PMR from now on.

Thanks in advance for your understanding "

Popular posts from this blog

How to Change Your Notification Options for New Lotus Notes Mail in version 8.x

Don't worry, I'm not patronizing you (my readers), I just decided to re-document this for one of our internal users and thought you might want to be able to use it in your own user documentation. WHAT IS THIS DOCUMENT ABOUT? Some people who don't get a lot of mail, like to be notified when such an event occurs. Notification can be; via a sound via a pop-up box via the system tray (where the computer clock is) The pop up box looks like this; Other people, who like myself, get too much mail would rather not be notified. The aim of this document is to tell you how (and where) to turn these options on and off. CHANGING YOUR SETTINGS To change your settings from the Notes 8.x client; On the Menu, click File , then Preferences... On the left hand side , click on the little plus sign to the left of Mail to expand the options. Click on the option marked Sending and Receiving . In the middle section, under receiving, you can control your notifications. If you untick the box mark...

How to Create a Bootable DVD Using Nero Burning ROM 9

I often need to create bootable CDs and DVDs but it's weird because I frequently end up buring myself a new coaster instead. It's not that the process is difficult, just that nero has a few too many options and I forget which ones to choose and end up picking the wrong one. I figured that the best way to avoid this mistake in future would be to write the steps down. Procedure Insert CD or DVD into your DVD Burner. Start Nero Burning ROM 9 Choose DVD-ROM (Boot) or CD-ROM (Boot) depending on what you're creating You'll be prompted for a disk image source. Choose a Nero Source - you'll usually find them somewhere like this... C:\Program Files\Nero\Nero9\Nero Burning Rom\DOSBootImage.ima Leave the Boot Locale as English - unless you really need a different keyboard layout Tick the box marked [X] Enable Expert Settings Choose Hard Drive Emulation and leave any other settings as they are. Click the button marked New Add any files you want but don't try to add operati...

How to Create an Auto-Response Mail Message in Lotus Notes 8.5.3+

Why would you do this? Suppose that you have an externally accessible generic email address for your company; support@mycompany.com or info@mycompany.com. You might expose this to the web and allow people to send messages to you. Setting up an auto-response email will tell the senders that their message reached its destination and that it will be dealt with accordingly.  It's also good practice to include links to FAQs or other useful information. Why 8.5.3 The techniques we'll be using here work in older versions of Notes but some of the options seem to have moved around in 8.5.3.  I figured it was a good time to show you where they've moved to. The Procedure Start Domino Designer and open the Mail file to be modified.  A really quick way to do this is to right-click on the application tab and choose "Open in Designer". In the Left hand panel of designer, expand Code and then double-click Agents.  A new window should appear. Click the action ...