https://github.com/urduhack/urduhack
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
https://github.com/urduhack/urduhack
backer deep-learning deeplearning machine-learning nlp-library python sponsors tensorflow urdu urdu-hack urdu-language urdu-nlp urdu-text-processsing urduhack
Last synced: about 5 hours ago
JSON representation
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
- Host: GitHub
- URL: https://github.com/urduhack/urduhack
- Owner: urduhack
- License: mit
- Created: 2018-12-27T06:11:05.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2024-01-04T16:16:28.000Z (over 2 years ago)
- Last Synced: 2025-10-13T12:24:51.918Z (9 months ago)
- Topics: backer, deep-learning, deeplearning, machine-learning, nlp-library, python, sponsors, tensorflow, urdu, urdu-hack, urdu-language, urdu-nlp, urdu-text-processsing, urduhack
- Language: Python
- Homepage: https://urduhack.readthedocs.io/en/stable/
- Size: 475 KB
- Stars: 302
- Watchers: 11
- Forks: 44
- Open Issues: 14
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
- awesome-nlp - urduhack - NLP library for Urdu. (NLP per Language / Libraries)
README
# Urduhack: A Python NLP library for Urdu language
[](https://pypi.org/project/urduhack/)
[](https://pypi.org/project/urduhack/)
[](https://dev.azure.com/Urduhack/Urduhack/_build?definitionId=2)
[](https://dev.azure.com/Urduhack/Urduhack/_build?definitionId=2)
[](https://travis-ci.org/urduhack/urduhack)
[](https://www.codefactor.io/repository/github/urduhack/urduhack)
[](https://codecov.io/gh/urduhack/urduhack)
[](https://github.com/urduhack/urduhack/graphs/contributors)
[](https://pepy.tech/project/urduhack)
[](https://gitter.im/urduhack)
[](https://github.com/urduhack/urduhack/blob/master/LICENSE)
Urduhack is a NLP library for urdu language. It comes with a lot of battery included features to help you process Urdu
data in the easiest way possible.
You can reach out core contributor Mr Ikram Ali @ https://github.com/akkefa
Our Goal
--------
- **Academic users** Easier experimentation to prove their hypothesis without coding from scratch.
- **NLP beginners** Learn how to build an NLP project with production level code quality.
- **NLP developers** Build a production level application within minutes.
🔥 Features Support
-------------------
- [x] Normalization
- [x] Preprocessing
- [x] Tokenization
- [x] Pipeline Module
- [x] Models
- [x] Pos tagger
- [x] Lemmatizer
- [x] Name entity recognition
- [ ] Sentimental analysis
- [ ] Image to text
- [ ] Question answering system
- [x] Datasets loader
🛠 Installation
---------------
Urduhack officially supports Python 3.6–3.7, and runs great on PyPy.
Installing with tensorflow cpu version.
``` {.sourceCode .bash}
$ pip install urduhack[tf]
```
Installing with tensorflow gpu version.
``` {.sourceCode .bash}
$ pip install urduhack[tf-gpu]
```
Usage
-----
```python
import urduhack
# Downloading models
urduhack.download()
nlp = urduhack.Pipeline()
text = ""
doc = nlp(text)
for sentence in doc.sentences:
print(sentence.text)
for word in sentence.words:
print(f"{word.text}\t{word.pos}")
for token in sentence.tokens:
print(f"{token.text}\t{token.ner}")
```
🔗 Documentation
----------------
Fantastic documentation is available at
| Documentation | |
| --------------- | -------------------------------------------------------------- |
| [Installation] | How to install Urduhack and download models |
| [Quickstart] | New to Urduhack? Here's everything you need to know! |
| [API Reference] | The detailed reference for Urduhack's API. |
| [Contribute] | How to contribute to the code base. |
[Installation]: https://urduhack.readthedocs.io/en/stable/installation.html
[Quickstart]: https://urduhack.readthedocs.io/en/stable/quickstart/index.html
[Api reference]: https://urduhack.readthedocs.io/en/stable/reference/index.html
[Contribute]: https://github.com/urduhack/urduhack/blob/master/CONTRIBUTING.md
👍 Contributors
----------------
Special thanks to everyone who contributed to getting the Urduhack to the current state.
Backers [](#backers)
---------------------------------------------------------------------------------------------------------
Thank you to all our backers! 🙏 [[Become a backer](https://opencollective.com/urduhack#backer)]
Sponsors [](#sponsors)
------------------------------------------------------------------------------------------------------------
Support this project by becoming a sponsor. [[Become a sponsor](https://opencollective.com/urduhack#sponsor)]
📝 Copyright and license
------------------------
Code released under the [MIT License](ttps://github.com/urduhack/urduhack/blob/master/LICENSE).