Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shivam5992/spellbee
spell checking in python
https://github.com/shivam5992/spellbee
Last synced: 1 day ago
JSON representation
spell checking in python
- Host: GitHub
- URL: https://github.com/shivam5992/spellbee
- Owner: shivam5992
- Created: 2014-09-06T11:58:10.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-09-06T12:02:18.000Z (over 10 years ago)
- Last Synced: 2024-04-16T03:51:50.677Z (8 months ago)
- Language: Python
- Size: 11.1 MB
- Stars: 5
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
Spellbee
---Spellbee is an english spelling corrector and suggestor. It uses standard nlp tasks to achieve a better accuraccy level in terms of suggestions and corrections. Every query term undergoes a set of hierarichial well written rules. Queries are checked against various standardised lookup tables and dictionaries. If terms are not found in the tables, then most similar words are shown as spelling suggestions.
**Standardised LookUps:** Standardised lookup tables are created by processing the wordnet database which involves processes like stemming, lemmatization, stopword removal, tokenization, normalization, encoding handling, html unescaping. Wordnet database provides list of all possible words(almost) of english language, though they are not standardised hence needs to be processd once.
**Word Similarity:** It uses weighted edit distance (levenshtien) algorithm to find out number of operations required to change one word to another. To get most similar single word, word similarity ratio is calculated. The one with maximum ratio is choosen as suggestion.
Usage
---
from spellbee.spellbee import spellbeeif __name__ == '__main__':
string = 'Autimattic leaarning procejures can made uss of stastical algorithmas'
suggestions = spellbee._correct(string)
for x in suggestions:
print xReferences:
---
Peter Norvig blog