Interpreting the Cluster Log |
The entries in this section are the record of a failed JOIN operation. They record the attempts of the joining node to find a sponsor from the available addresses and node names.
The first entry is a state entry, reporting the [INIT] state of the node attempting to join.
378.380::1999/06/09-18:00:19.734 [INIT] Attempting to join cluster
CLUSTER1
In all the following entries, the state of the node is [JOIN].
The following nine entries record the spawning of threads that try to connect to the sponsor node and record the node's requests to join the cluster. The service always tries to join a cluster through the following:
378.380::1999/06/09-18:00:19.742 [JOIN] Spawning thread to connect to
sponsor 172.16.64.254
378.3c4::1999/06/09-18:00:19.742 [JOIN] Asking 172.16.64.254 to sponsor
us.
378.380::1999/06/09-18:00:19.742 [JOIN] Spawning thread to connect to
sponsor 172.16.64.2
378.3c8::1999/06/09-18:00:19.742 [JOIN] Asking 172.16.64.2 to sponsor
us.
378.380::1999/06/09-18:00:19.742 [JOIN] Spawning thread to connect to
sponsor 10.10.0.2
378.3cc::1999/06/09-18:00:19.742 [JOIN] Asking 10.10.0.2 to sponsor us.
378.380::1999/06/09-18:00:19.742 [JOIN] Spawning thread to connect to
sponsor NODE2
378.380::1999/06/09-18:00:19.742 [JOIN] Waiting for all connect threads
to terminate.
378.3d0::1999/06/09-18:00:19.742 [JOIN] Asking NODE2 to sponsor us.
The following entries report the failures (or successes) of the requests to join. Using the net helpmsg error number command, you can find out that "status=1722" means "The RPC server is unavailable." In other words, the node attempting to join cannot contact the Cluster service on the target node.
Note the consistency of the process ID in both the request entry and response entries. Note also the consistency of the thread ID for each thread generated to a given sponsor.
378.3c8::1999/06/09-18:00:46.600 [JOIN] Sponsor 172.16.64.2 is not
available (JoinVersion), status=1722.
378.3c4::1999/06/09-18:00:46.600 [JOIN] Sponsor 172.16.64.254 is not
available (JoinVersion), status=1722.
378.3cc::1999/06/09-18:00:46.600 [JOIN] Sponsor 10.10.0.2 is not
available (JoinVersion), status=1722.
378.3c8::1999/06/09-18:00:46.600 [JOIN] JoinVersion data for sponsor
172.16.64.2 is invalid, status 1722.
378.3c4::1999/06/09-18:00:46.600 [JOIN] JoinVersion data for sponsor
172.16.64.254 is invalid, status 1722.
378.3cc::1999/06/09-18:00:46.600 [JOIN] JoinVersion data for sponsor
10.10.0.2 is invalid, status 1722.
378.3d0::1999/06/09-18:00:46.615 [JOIN] Sponsor NODE2 is not available
(JoinVersion), status=1722.
378.3d0::1999/06/09-18:00:46.615 [JOIN] JoinVersion data for sponsor
NODE2 is invalid, status 1722.
All the threads are terminated; failure to find a sponsor for joining a cluster is reported; and, consequently, failure of the attempted join is reported:
378.380::1999/06/09-18:00:46.615 [JOIN] All connect threads have
terminated.
378.380::1999/06/09-18:00:46.615 [JOIN] Unable to connect to any sponsor
node.
378.380::1999/06/09-18:00:46.615 [INIT] Failed to join cluster, status
53
Next, the Cluster service on this node tries to form a cluster.