htfuzzy
ht://Dig © 1995, 1996, 1997 Andrew Scherpbier andrew@contigo.com
Please see the file COPYING for license
information.
Synopsis
- htfuzzy [-c configfile][-v] algorithm
...
Description
- Htfuzzy creates indexes for different ``fuzzy'' search
algorithms. These indexes can then be used by the htsearch program.
Options
- -c configfile
- Use the specified configuration file instead of
the default.
- -v
- Verbose mode. Used once will provide progress
feedback, used more than once will overflow even
the biggest buffers. :-)
Algorithms
- Indexes for the following search algorithms can currently
be created:
- soundex
- Creates a slightly modified soundex key database.
Differences with the standard soundex algorithm
are:
- Keys are 6 digits.
- The first letter is also encoded.
- metaphone
- Creates a metaphone key database. This algorithm
is more specific to English, but will get fewer
"weird" matches than the soundex
algorithm.
- endings
- Creates two databases which can be used to match
common word endings. The creation of these
databases requires a list of affix rules and a
dictionary which uses those affix rules. The
format of the affix rules and dictionary files
are the ones used by the ispell
program. Included with the distribution are the
affix rules for English and a fairly small
English dictionary. Other languages can be
supported by getting the appropriate affix rules
and dictionaries. These are available for many
languages; check the ispell distribution for more
details.
- synonyms
- Creates a database of synonyms for words. It
reads a text database of synonyms and creates a
database that htsearch can then use. Each line of
the text database consists of words where the
first word will have the other words on that line
as synonyms.
Files
- CONFIG_DIR/htdig.conf
- The default configuration file.
See Also
- htdig, htmerge,
htsearch, Configuration file format, ispell.
Andrew Scherpbier
<andrew@contigo.com>
Last modified: Wed Jan 1 20:46:31 PST