Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zseder/hundict

bilingual dictionary extractor from parallel corpora
https://github.com/zseder/hundict

Last synced: 29 days ago
JSON representation

bilingual dictionary extractor from parallel corpora

Awesome Lists containing this project

README

        

hundict is an experimental python project, that creates bilingual dictionary
from parallel corpora
Features (planned or done):
- easy to use (see hundict -h)
- fast (python fast, of course, not C fast)
- unigram pairs
- A - B
- ngram-ngram extraction, not only unigram-unigram
- ABC - DE
- multiple choice pairs
- (A or B) - C
- stopword remove
- remaining corpora print