Status-At-A-Glance Help

Overview

Status-At-A-Glance provides the visual status of an Enstore system separated into 4 major sub-systems; the Enstore servers, the storage library, the network between the computers on which Enstore is running and any alarms that have been raised. In addition a more detailed status is given for each of the Enstore servers/movers that belong to the system. The storage library refers to one or more tape robots. Once a problem has been identified it is expected that the more complete and detailed view of the Enstore servers, the network and the alarms will be examined.

There are 4 levels of status represented on this page -

In general a Minor problem will indicate that the Enstore system is still operational as a whole but some piece of it has a problem which should be investigated. A Major problem will imply that the Enstore system is not operational. Minor problems may escalate to major ones and should not be ignored. When a server (or enstore) is marked with a star.gif it means that an unusual situation has been detected and will be monitored. Noone will be contacted until the situation has escalated to a problem.

In addition to showing the status of the system, this page will indicate when pieces of the Enstore system will be undergoing a scheduled outage by placing a checkmark.gif after the item and including some information about the outage.

Sometimes a server or will be known to be down. In this case, the item will have a line drawn through it.

The last section on the page is for informational purposes only. It lists the nodes in the Enstore system and the servers that run on each of them.

Enstore Overall Status

This section presents a summary of the information on the rest of the page. The following table defines what the different status levels mean for each of the sub-systems. Currently the colored ball next to the storage library can only be controlled manually.

Status Levels
Sub-Systemgreenball.gifyelball.gifredball.gifstar.gif
enstore All elements are marked greenball.gif
  • The Inquisitor is down
  • Some Movers are down (less than half) or marked yelball.gif for a Library Manager
  • At least one of the following is down : Alarm Server, Configuration Server, File Clerk, Logger, Volume Clerk, a Library Manager
  • More than one half of all Movers (for a Library Manager) are down
At least one element is marked star.gif
alarms No alarms have been raised One or more alarms have been raised    

A checkmark.gif next to one of these sub-systems, indicates the entire sub-system will be unavailable.

If enstore is crossed out, the entire sub-system (the servers/movers) can be considered to be down.

Enstore Individual Server Status

This section presents the status of each individual server in Enstore. Individual servers are checked to see if they are alive. A server is not reported as being dead until it has been found to be not alive for a configurable number of times (this number is set in the configuration file under the key system and allowed_down). If a server does not seem to be alive, it is marked with a star.gif until the number of times it is seen dead is greater than the number specified in the configuration file as mentioned above. In the following table, n represents this number.

So, for example, if there is an element in the configuration file -

configdict['system'] = { 'allowed_down' : {'log_server' : [1,10], 'library_manager' : [2,20], 'default' : [1,20] }}

then the log_server will be marked as a major problem (redball.gif) the first time it appears to be down. However, the following happens with respect to any library_manager -

Num times seen down in a rowmarker used
1star.gif
2redball.gif


In the following table n refers to the number set in the configuration file (in the above example, n = 2 for the library_manager).

Status Levels
Sub-System Elementgreenball.gifyelball.gifredball.gifstar.gif
Alarm Server alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
Configuration Server alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
File Clerk alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
Logger alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
Inquisitor alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
Volume clerk alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
All Library Managers alive
  • More than half of the associated Movers are marked redball.gif
  • In a pause state
  • In a locked state
  • In an ignore state.
Not seen alive when checked n or more times Not seen alive when checked less than n times
All Media Changers alive   Not seen alive when checked n or more times Not seen alive when checked less than n times
All Movers alive
  • In a DRAINING state
  • In an OFFLINE state
  • Not seen alive when checked n or more times
  • In an ERROR state
Not seen alive when checked less than n times



Legal Notices
Last modified: Thu Mar 21 11:10:14 CST 2002