Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yididiyan/amharic_spell_corrector

Amharic Spelling Corrector based on SymSpell - Spelling corrector which is 1 million times faster through Symmetric Delete spelling correction algorithm
https://github.com/yididiyan/amharic_spell_corrector

amharic-corpus amharic-words crawling spelling-correction symspell

Last synced: about 2 months ago
JSON representation

Amharic Spelling Corrector based on SymSpell - Spelling corrector which is 1 million times faster through Symmetric Delete spelling correction algorithm

Awesome Lists containing this project

README

        

# Amharic Spelling Corrector

Spelling corrector can be handy where you need to post process outputs of an OCR(Optical Character Recognition), STT(Speech to text).

### Dependencies

* PySymSpell - pure Python port of [SymSpell](https://github.com/wolfgarbe/SymSpell).
* BeautifulSoap

### Installation
```bash
pip install -r requirements.txt
```

### Usage

#### Spelling correctortion

```python
from spell_corrector import lookup
suggestions = lookup('አሽናቆት')
suggestions[0].term
>> 'አድናቆት'
```

#### Edit Distance Calculation
The function calculates the number of edit [ insertion, update, delete ] needed to correct a word to another

```python
from spell_corrector import calculate_distance
calculate_distance('አክብሮትና', 'አክብሮት')
>> 1
```

#### Crawling More Words
The `crawler.py` script crawls around **1 million** frequently used Amharic words on the Web.
```bash
python crawler.py
```