https://github.com/farhad-here/persian_text_processing
It is Persian Text processing with parsivar library
https://github.com/farhad-here/persian_text_processing
nlp nltk-python parsivar persiantextprocessing python text-pre text-processing textpreprocessing
Last synced: 2 months ago
JSON representation
It is Persian Text processing with parsivar library
- Host: GitHub
- URL: https://github.com/farhad-here/persian_text_processing
- Owner: farhad-here
- License: mit
- Created: 2024-11-12T21:46:24.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-11-13T01:04:48.000Z (6 months ago)
- Last Synced: 2025-01-22T18:32:23.846Z (4 months ago)
- Topics: nlp, nltk-python, parsivar, persiantextprocessing, python, text-pre, text-processing, textpreprocessing
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# 📓This is my persian text processing for my university
# 📘Manual```
pip install -r requirements.txt
```then in code first we normalize then we tokenize, and after tokenize we stammer and in the end for spell detection you need to download these two files and put it in the this below path:
```
first create a spell folder in this path:
venv\Lib\site-packages\parsivar\resource
```
```
then replace these two file in the spell folder:
- onegram.pckl
- mybigram_lm.pckl
```
#### 🔽Download two files from here# 🎥preview

# 📳technology
- python
- nltk
- parsivar
- bidi
- arabic_reshaper