There are three elements to a query: where to search, what to search for, and what information to view.
A query scope specifies the set of documents that must be searched. Typically scopes are specified by a directory path on a storage volume, such as d:\docs. IIS publishes Web "sites," which correspond to a virtual root that points to a collection of documents. This is illustrated in Figure 2.
Figure 2 - IIS Sites
"/Windows NT" and "/Applications" are two virtual roots on the server. The virtual roots map to physical storage locations on the local machine or another connected file server. Index Server indexes documents based on these virtual roots. An administrator can index all the virtual roots on a server or select a subset to index. Queries can be run against multiple virtual roots, against a single virtual root, or even against a single physical directory within a virtual root.
In addition to specifying the scope of a search by virtual directory, physical directory, and volume, Index Server allows query restrictions to be built by combining a number of "tests." Query restrictions available for use in any combination include:
Index Server also allows free-text queries, in which the user can enter any set of words or phrases, or even a complete sentence as the query restriction. The system looks at the text and identifies all the nouns and noun phrases. If a complicated phrase were typed in, Index Server would extract two- and three-word combinations, as well as all remaining content words, while eliminating the "noise" words such as "a", "of", and "the".
For example, assume the user typed in the following free-text query:
"The Fulton County Grand Jury said Friday an investigation of Atlanta's recent primary election produced no evidence that any irregularities took place."
The system would identify the following words and noun phrases:
Words |
Phrases |
Fulton county grand jury Friday investigation Atlanta recent primary election produce evidence irregularity |
Fulton county grand jury primary election grand jury Atlanta's recent primary election |
The words and phrases are combined into a restriction, weighted for proper ranking and posted as a query against the corpus. Free-text queries make querying very simple for the beginning user, because the user doesn't need to learn any type of query language.