Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hunspell/mythes
https://github.com/hunspell/mythes
Last synced: 28 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/hunspell/mythes
- Owner: hunspell
- License: other
- Created: 2015-06-11T12:04:33.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2023-07-01T12:41:14.000Z (over 1 year ago)
- Last Synced: 2024-08-04T04:07:21.110Z (4 months ago)
- Language: C++
- Size: 4.65 MB
- Stars: 36
- Watchers: 11
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README
- License: COPYING
- Authors: AUTHORS
Awesome Lists containing this project
- low-resource-languages - mythes - MyThes is a simple thesaurus that uses a structured text data file and an index file with binary search to lookup words and phrases and return information on part of speech, meanings, and synonyms. (Software / Utilities)
README
MyThes is a simple thesaurus that uses a structured
text data file and an index file with binary search
to lookup words and phrases and return information
on part of speech, meanings, and synonymsMyThes was originall written to provide a thesaurus
for the OpenOffice.org projectThe Main features of MyThes are:
1. written in C++ to make it easier to interface with
LibreOffice, OpenOffice, AbiWord, Pspell, etc2. it is stateless, uses no static variables and
should be completely reentrant with no ifdefs3. it compiles with -ansi and -pedantic and -Wall
with no warnigns so it shouldbe quite portable4. it uses a simple perl program to read the structured
text file and create the index needed for binary
searching5. it is very simple with *lots* of comments.
The main "smarts" are in the structure of the
text file that makes up the thesaurus data6. It comes with a ready-to-go structured thesaurus
data file for en_US extracted from the WordNet-2.0 data.Please see WordNet_license.txt and WordNet_readme.txt
for more information on the very useful project!See http://www.danielnaber.de/wn2ooo/ for utilities to
regenerate an up to date English thesaurus from the most
recent WordNet data.
7. The source code has a BSD license (and no advertising clause)MyThes comes with a simple example program that looks up some words and returns
meanings and synonyms.To build it simply do the following:
unzip mythes.zip
cd mythes
./configure
makeTo run the example program:
./example th_en_US_new.idx th_en_US_new.dat checkme.lstTo run the example program with stemming and morphological generation:
e.g. to check mouse, mice, rodents, eats, eaten, ate, eating etc. words
./example morph.idx morph.dat morph.lst morph.aff morph.dicNOTE: this is only an example and test environment for dictionary developers,
full English stemming and morphological generation needs an improved
English Hunspell dictionary.László Németh
Kevin Hendricks