3.0 Post Office Agent Problems
For a list of error messages related to the Post Office Agent, see Post Office Agent Error Messages
in GroupWise 8 Troubleshooting 1: Error Messages.
This section suggests ways to fix the following problems:
Problem:
The POA does not start.
Possible Cause:
The /home switch is missing.
Possible Cause:
The /home switch points to an unavailable location.
Action:
Make sure the location specified by the /home startup switch is currently available to the POA. If the post office is located on a different server from where the POA will run, preparations for the connection between the POA and the post office are required:
-
If you are using the NetWare POA, you must use the /dn startup switch or the /user and /password startup switches to enable the POA to log in to the server where the post office is located, or you can provide the username and password in ConsoleOne, on the Post Office Settings page of the Post Office object.
-
If you are using the Linux POA, you must mount the file system where the post office is located to the server where the POA is running.
-
If you are using the Windows POA, you must create the drive mapping to the post office before starting the POA.
See Setting Up the GroupWise Agents
in the GroupWise 8 Installation Guide.
Possible Cause:
The server where the post office resides is down.
Action:
Check the status of the server where the post office resides.
Possible Cause:
The POA does not have sufficient rights to the post office directory.
Action:
Make sure the network rights in the post office are correct. Start the POA using the /rights switch to determine the specific problem the POA is encountering and make corrections as needed.
Possible Cause:
The post office database (wphost.db) is damaged.
Possible Cause:
The POA server has inadequate resources.
Possible Cause:
The POA is not installed correctly.
Possible Cause:
A remote document storage area is unavailable.
Action:
Make sure the remote server where the document storage area is located is running.
Action:
Make sure the POA has sufficient rights to log in.
Action:
Make sure the /user and /password or /dn switches have been provided in the POA startup file.
Action:
As a temporary workaround, you can start the POA with the /noconfig switch, so it can start without trying to access the remote document storage area.
Possible Cause:
Language files are missing.
Possible Cause:
A POA is already running on the server.
Action:
If you have defined multiple POAs for the same post office in ConsoleOne, the /name switch is required in each startup file to specify which POA configuration to use when you start each instance of the POA.
Possible Cause:
The POA encounters an error condition.
POA Shuts Down Unexpectedly
Problem:
The POA has been running smoothly, but stops unexpectedly.
Action:
If the POA agent console is still displayed, exit it. If the normal exit procedure does not work, use the system procedure for terminating a program.
-
If you are using the NetWare POA, use the NetWare unload command. If you have other POAs running on the same server, you should exit them before using the unload command. The unload command unloads all POAs running on the same server and might not enable them to terminate gracefully.
-
If you are using the Linux POA, kill the first POA process. You might need to use kill -9.
-
If you are using the Windows POA, close the POA window.
Possible Cause:
Occasionally, a badly damaged message file can cause the POA to shut down.
Action:
Check the contents of the POA input queue in the post office. For the location of the POA input queue in the post office, see Post Office Directory
in GroupWise 8 Troubleshooting 3: Message Flow and Directory Structure.
Move the message files out of the input queue subdirectories, start the POA, then copy the message files back in groups, watching the POA carefully to see if it shuts down on a particular message file. If it does, delete the problem message file so normal processing can resume.
Action:
Rename the post_office\wpcsout\ofs directory, then start the POA. This creates a new, empty input queue and the POA should run smoothly. Then copy the message files from the 0-7 priority subdirectories of wpcsout\ofs back into the correct priority subdirectory so the POA can process them. Copy them in small groups so the damaged message file can be identified and removed.
Possible Cause:
Occasionally, a damaged database in the post office can cause the POA to shut down.
Possible Cause:
Although increasing the number of POA threads from their default settings can, in many cases, improve POA performance, creating too many POA threads can have undesirable results.
Possible Cause:
On a NetWare server, Upgrade Low Priority Threads is set to On.
Action:
The POA might run more smoothly if you set Upgrade Low Priority Threads to Off. At the server console where the POA is running, type set upgrade low priority threads=off.
Possible Cause:
Another program on the server is interfering with the operation of the POA.
Action:
If the POA continues to be unstable, eliminate other programs running on the server. If the POA is stable when another specific program is not running on the same server with it, a conflict might exist between the two programs.
POA Statistics Box Shows Requests Pending
Problem:
GroupWise client users are using TCP/IP links to the post office. At the POA console, the Statistics box shows a large number of pending client/server requests.
POA Statistics Box Shows Users Timed Out
Problem:
GroupWise client users are using TCP/IP links to the post office. At the POA console, the Statistics box shows a large number of users timed out.
Action:
Having users timed out does not indicate a problem with the POA, but rather a problem with users. Users who have timed out are users for which the POA has closed the connection because the GroupWise client was no longer communicating. Timed out users might not be exiting GroupWise normally or might be having other problems with their workstations. The number of timed-out users might tend to increase on a daily basis during the hour after users leave to go home. This is not a problem.
POA Statistics Box Shows Undeliverable Users
Problem:
At the POA console, the Statistics box shows a large number of undeliverable users. Undeliverable users can be encountered using either mapped drive connections or TCP/IP connections. Undeliverable users are counted differently from undeliverable messages. For example, a single message could be addressed to 10 users; perhaps 9 users received the message successfully but 1 user was undeliverable.
Possible Cause:
If messages cannot be delivered to a particular user, that user might have a damaged user database (userxxx.db).
POA Statistics Box Shows Problem Messages
Problem:
At the POA console, the Statistics box shows that problem messages have been encountered.
POA Redirection List Shows Failed TCP/IP Connection
Problem:
At the POA console, client/server statistics show a failed TCP/IP connection.
Action:
Under Client/Server Statistics, use to list existing POAs and the IP addresses of the computers they are running on.
Action:
Under Client/Server Statistics, use to determine which connections are currently valid.
Action:
If a connection is listed as failed for a POA, use the ping command to see if the server is alive. If the server does not respond to the PING command, you must resolve the TCP/IP problem before the POA can use the link successfully.
Possible Cause:
TCP/IP is not functioning correctly on the POA server.
Action:
If the POA is running on a Windows server, make sure TCP/IP is correctly installed and set up on the server where the POA is running.
Possible Cause:
Multiple servers are trying to use the same IP address.
Action:
Check for conflicting IP addresses between those used by POA servers and those used by other servers. Only one server at a time can use the same IP address.
POA Message Transfer Status Box Shows Closed Link
Problem:
The POA is configured to communicate with the MTA by way of TCP/IP. However, the link between the POA and the MTA displays as Closed.
Action:
Check the last closure reason. This information can help you determine the source of the problem. Common last closure reasons include:
POA Starts Unwanted TCP/IP Thread
Problem:
Even though you have started the POA using the /notcpip startup switch or disabled the Enable TCP/IP option in the POA Agent Settings page in ConsoleOne, the POA still starts a TCP/IP thread.
Explanation:
When you select Client/Server Only or Client/Server and Direct as the access mode for a post office and use the /notcpip switch when starting a POA, that POA does not accept incoming client/server connections from GroupWise clients. However, it still starts a single TCP/IP handler thread if TCP/IP is configured on the server. The purpose of this TCP/IP thread is to notify any GroupWise clients connecting to another POA in the post office via TCP/IP they should reread the database because a new message has been delivered by a POA that is not using TCP/IP.
Action:
To totally disable TCP/IP processing in a post office, set the access mode for the post office to Direct Only and start all POAs servicing that post office with the /notcpip switch, or deselect Enable TCP/IP in the Agent Settings page in ConsoleOne.
POA Fails to Deliver Messages
Problem:
The POA is running but no messages are being delivered.
Possible Cause:
The post office is closed.
Possible Cause:
Message file processing has been turned off.
Action:
Make sure that message file processing for the POA has not been turned off using the /nomf, /nomfhigh, or /nomflow switches.
POA Fails to Respond to MTP Configuration Changes in ConsoleOne
Problem:
You change a POA link configuration or network address setting in ConsoleOne, but the POA does not respond to the change. For example, you change from a mapped or UNC link to a TCP/IP link between the POA and the MTA, or you move the POA to a different server and change its IP address. If the configuration change does not replicate successfully to the post office database, the MTA link to the post office becomes closed.
Action:
If you do not want to stop and restart the POA to open a closed post office link, change the configuration information back to what it was before you tried to change it. The MTA should then be able to open the post office link again. After communication between the POA and MTA is re-established, make the configuration changes again. Wait for the configuration changes to be replicated to the post office database (wphost.db), then start the POA in its new location.
Action:
Stop the POA, then start the POA using the /mtpinipaddr and /mtpinport switches to specify the new IP address and port the POA should use for Message Transfer Protocol (MTP) communication with the MTA. After the link is established and all administrative messages have been processed, you do not need to use these startup switches again.
POA Fails to Respond to Other Configuration Changes in ConsoleOne
Problem:
You change a POA configuration setting in ConsoleOne, but the POA does not respond to the change. For example, you change from direct access to client/server mode, or you want to turn off error mail to the administrator.
POA Fails to Update the Post Office Database Version
Problem:
You are updating the agent software on a server where both the POA and the MTA are running. The MTA successfully updates the database version for the domain but the POA fails to update the database version for the post office.
Possible Cause:
You let the Installation program start both agents automatically. As a result, the POA checked the domain version in the post office database before it had a chance to process the administrative message from the MTA that would update the domain version in the post office database.
Action:
Wait until the MTA has updated the domain database. For a large domain database, you might need to wait as much as 20 minutes or more. Verify that the database version has been updated by checking the Domain object’s Identification page in ConsoleOne, then stop and restart the POA to update the post office database. Check the Post Office object’s Identification page in ConsoleOne to verify that the database version has been updated.
Action:
If restarting the POA does not update the post office database version:
-
Compare the dates on the .dc files (gwpo.dc and ngwguard.dc) in the post office directory with the dates on the .dc files in the update source .
-
If the dates on the .dc files in the post office are older than the dates on the .dc files in the update source, copy the .dc files from the update source into the post office directory.
-
At the POA console, recover the post office database. See Recovering the Post Office Database Automatically or Immediately
in Post Office Agent
in the GroupWise 8 Administration Guide.
When the recovery process is finished, the database version should be updated.
-
View the Post Office object’s Identification page in ConsoleOne to verify that the database version has been updated.
POA Runs Repeated Recoveries on the Same Database
Problem:
The POA continually attempts to recover a particular database but never succeeds.
POA Starts in the Wrong Language
Problem:
You have installed the POA in more than one language and it is starting in a different language than you want.
Action:
Start the POA using the /language switch to specify the language.
POA Is Involved with Network Operating System or Hardware Problems
Problem:
The POA is interacting with the network operating system or hardware in an undesirable way.
Possible Cause:
If you just updated the NetWare POA software, you might not have unloaded the agent engine (gwenn5.nlm), resulting in a series of Loader cannot find public symbol: symbol errors on the server console.
Action:
Unload gwenn5.nlm, then start the NetWare POA, so that the newly installed agent engine is loaded along with it.
Possible Cause:
The NetWare POA server is running older software, resulting in TCP_HANDLER errors on the server console.
Action:
Make sure you are using the most current TCP/IP stack for NetWare.
Possible Cause:
The POA server is overburdened, resulting in SYN attacks.
Possible Cause:
The POA server is overburdened. CPU utilization on the server where the NetWare POA is running jumps to 100% and the POA seems to be taking all available resources. GroupWise and other objects on the server are out of sync with other servers.
Action:
Run DSREPAIR.