Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/znerol/ot2solr
OpenThesaurus Dump to SORL Synonyms
https://github.com/znerol/ot2solr
Last synced: 25 days ago
JSON representation
OpenThesaurus Dump to SORL Synonyms
- Host: GitHub
- URL: https://github.com/znerol/ot2solr
- Owner: znerol
- Created: 2012-12-05T09:30:58.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2013-03-28T12:58:50.000Z (over 11 years ago)
- Last Synced: 2024-04-16T00:33:50.967Z (7 months ago)
- Language: Python
- Size: 99.6 KB
- Stars: 5
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
OpenThesaurus Dump to SORL Synonyms
===================================ot2solr.py is a simple python script capable of turning text-dumps from
[openthesaurus.de] into the synonym file format used in [Apache SOLR]. Terms
can be optionally filtered using regular expressions. When [Unidecode] module
is installed, the result also can be transliterated such that only ASCII chars
< 128 remain in the output.[openthesaurus.de]:
http://www.openthesaurus.de/
[Apache SOLR]:
http://lucene.apache.org/solr/
[Unidecode]:
http://pypi.python.org/pypi/UnidecodeUsage
-----python ot2solr.py < dump.txt > synonyms.txt
Contribute
----------Source code and issue tracker are available from:
https://github.com/znerol/ot2solrLicense
-------This work is released to the public domain. No rights reserved.