Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hunspell/mythes


https://github.com/hunspell/mythes

Last synced: 28 days ago
JSON representation

Awesome Lists containing this project

README

        

MyThes is a simple thesaurus that uses a structured
text data file and an index file with binary search
to lookup words and phrases and return information
on part of speech, meanings, and synonyms

MyThes was originall written to provide a thesaurus
for the OpenOffice.org project

The Main features of MyThes are:

1. written in C++ to make it easier to interface with
LibreOffice, OpenOffice, AbiWord, Pspell, etc

2. it is stateless, uses no static variables and
should be completely reentrant with no ifdefs

3. it compiles with -ansi and -pedantic and -Wall
with no warnigns so it shouldbe quite portable

4. it uses a simple perl program to read the structured
text file and create the index needed for binary
searching

5. it is very simple with *lots* of comments.
The main "smarts" are in the structure of the
text file that makes up the thesaurus data

6. It comes with a ready-to-go structured thesaurus
data file for en_US extracted from the WordNet-2.0 data.

Please see WordNet_license.txt and WordNet_readme.txt
for more information on the very useful project!

See http://www.danielnaber.de/wn2ooo/ for utilities to
regenerate an up to date English thesaurus from the most
recent WordNet data.

7. The source code has a BSD license (and no advertising clause)

MyThes comes with a simple example program that looks up some words and returns
meanings and synonyms.

To build it simply do the following:

unzip mythes.zip
cd mythes
./configure
make

To run the example program:
./example th_en_US_new.idx th_en_US_new.dat checkme.lst

To run the example program with stemming and morphological generation:
e.g. to check mouse, mice, rodents, eats, eaten, ate, eating etc. words
./example morph.idx morph.dat morph.lst morph.aff morph.dic

NOTE: this is only an example and test environment for dictionary developers,
full English stemming and morphological generation needs an improved
English Hunspell dictionary.

László Németh
Kevin Hendricks