Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mojtaba-khallash/NHazm
A C# version of Hazm (Python library for digesting Persian text)
https://github.com/mojtaba-khallash/NHazm
natural-language-processing persian
Last synced: 25 days ago
JSON representation
A C# version of Hazm (Python library for digesting Persian text)
- Host: GitHub
- URL: https://github.com/mojtaba-khallash/NHazm
- Owner: mojtaba-khallash
- License: mit
- Created: 2014-06-06T21:24:14.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2017-12-10T19:51:32.000Z (about 7 years ago)
- Last Synced: 2024-08-03T18:15:53.823Z (4 months ago)
- Topics: natural-language-processing, persian
- Language: C#
- Homepage:
- Size: 12.8 MB
- Stars: 38
- Watchers: 5
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-persian - NHazm - C# library for digesting Persian texts. (<a name="csharp"/>C#</a>)
README
NHazm
=====
[![Build status](https://ci.appveyor.com/api/projects/status/el9vqyfy45vxsu1w?svg=true)](https://ci.appveyor.com/project/mojtaba-khallash/nhazm)
[![codecov](https://codecov.io/gh/mojtaba-khallash/NHazm/branch/master/graph/badge.svg)](https://codecov.io/gh/mojtaba-khallash/NHazm)
[![Dependency Status](https://www.versioneye.com/user/projects/58e6585c24ef3e003b526e78/badge.svg?style=flat)](https://www.versioneye.com/user/projects/58e6585c24ef3e003b526e78)[![GitHub release](https://img.shields.io/github/release/mojtaba-khallash/NHazm.svg)](https://github.com/mojtaba-khallash/NHazm/releases)
[![License](http://img.shields.io/:license-mit-blue.svg)](http://badges.mit-license.org)A C# version of [Hazm](https://github.com/sobhe/hazm) (Python library for digesting Persian text)
+ Text cleaning
+ Sentence and word tokenizer
+ Word lemmatizer
+ POS tagger
+ Dependency parser
+ Corpus readers for:
* [Hamshahri](http://ece.ut.ac.ir/dbrg/hamshahri/)
* [Bijankhan](http://ece.ut.ac.ir/dbrg/bijankhan/)
* [Persica](https://sourceforge.net/projects/persica/)
* [Verb Valency](http://dadegan.ir/catalog/pervallex)## Requirements
* [Stanford Log-linear Part-Of-Speech Tagger for .NET](http://sergey-tihon.github.io/Stanford.NLP.NET/StanfordPOSTagger.html): can be installed from NuGet:
> Install-Package Stanford.NLP.POSTagger* [MaltParser for .NET](http://sergey-tihon.github.io/MaltParser.NET/) can be installed from NuGet:
> Install-Package MaltParser* You can download [pre-trained tagger](https://www.dropbox.com/s/rfbo13u11wkh0yu/resources.zip?dl=0) and [parser models](https://www.dropbox.com/s/vuchhc4tlriiudk/resources-extra.zip?dl=0) for persian and put these models in the `Resources` folder of your project.