[English | Japanese]
Index file for word searching. (inverted file)
[data length for word1][documentID][score][documentID][score]... [data length for word2][documentID][score][documentID][score]... [data length for word3][documentID][score][documentID][score]... :
Index for 'seek'ing NMZ.i.
[position of word 1 in NMZ.i][position of word 2 in NMZ.i] [position of word 3 in NMZ.i]...
List of words.
A simple line-oriented text. Sorted in ascending order. You can seek NMZ.ii by line number. (Note: line number = wordID)
Index for 'seek'ing NMZ.w
[position of word 1 in NMZ.w][position of word 2 in NMZ.w] [position of word 3 in NMZ.w]...
List of files registered in index.
Each line records a document file which is registered in the index file. However, a line beginning with '#' indicates a file deleted from the index. A line beginning with '##' indicates comment. Example:
/home/foo/bar1.html /home/foo/bar2.html /home/foo/bar3.html ## indexed: Sun, 08 Jan 2006 02:28:00 +0900 (an empty line) # /home/foo/bar1.html ## deleted: Sun, 08 Jan 1998 12:34:56 +0900
Index for phrase searching.
Two words are converted to a 16 bit hash value. For phrase searching, all words in a phrase are 'AND'ed and searched, then check the word order by referring NMZ.p. Note that the word order are recorded for each two word pairs. So, to search "foo bar baz", documents including "foo bar" or "bar baz" are retrieved. By collision of hash values, inappropriate documents may also be retrieved. Though phrase search is inaccurate, it usually works fine.
|<------ data byte count (1) ------->| [data byte count(1)][documentID including hash value \x0000]... |<------ data byte count (2) ------->| [data byte count(2)][documentID including hash value \x0001]... ... [data byte count(n)][documentID including hash value \xffff]...
Index of index for phrase searching.
[position of \x0000 in NMZ.p][position of \x0001 in NMZ.p] ... [position of \xffff in NMZ.p]
Record information about time stamps and deleted documents.
File time stamps are recorded in 32 bits. This is used for sorting search results by date. Also, if value is -1, then the document is regarded as deleted.
[time stamp of documentID1][time stamp of documentID2]...
File to record field information.
Used in field-specified searching. A simple line-oriented text. grep'ed by the regular expression engine. A line number can be used as a documentID. Also, used in displaying the search results.
A simple line-oriented text. (line number = documentID)
Since it is a line-oriented text, it can be edited by an editor or other tools. In case you edit, you should rebuild NMZ.field.{subject,from,date,message-id,...}.i files by rfnmz.
Index for 'seek'ing NMZ.field.{subject,from,date,message-id,...}
[field position in documentID1][field position in documentID2]...
Configuration file for user access control.
Access control by IP address, host name and/or domain name.
deny
defines hosts from which you deny user
access, and allow
defines hosts from which you
allow user access. When host is specified by IP address,
prefix matching is used, and when host if specified by host
name or domain name, suffix matching is
used. all
indicates all hosts. Configuration is
evaluated from the top. Example:
deny all allow localhost allow 123.123.123. allow .example.jp
This configuration allows access from the localhost, hosts with IP address 123.123.123.*, or hosts with domain name *.example.jp. Access from other hosts are denied.
For Apache web sever, access control by host name and/or domain name requires the following description in "httpd.conf".
HostnameLookups On
Data necessary to update index is stored.
File to specify the style of search results.
${field name} is replaced by the contents of the field. For example, ${title} is replaced by the contents of NMZ.field.title. ${namazu::counter} and ${namazu::score} have special meanings. They are replaced by the counter of search results and its score respectively.
By default, NMZ.result.normal and NMZ.result.short are provided. Users can freely create NMZ.result.*.
Header of search results.
Footer of search results.
Query description. Displayed when no keyword is given.
Tips for searching. Displayed when no document is retrieved.
Log file of index updating.
Lock file to prevent searching.
Lock file to prevent updating/making the same index simultaneously.
Log file for searched keywords.