Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/TRoboto/Maha

Maha is a text processing library specially developed to deal with Arabic text.
https://github.com/TRoboto/Maha

arabic-cleaners arabic-nlp arabic-parsers arabic-text

Last synced: about 1 month ago
JSON representation

Maha is a text processing library specially developed to deal with Arabic text.

Lists

README

        









CI

Documentation Status

codecov
Language grade: Python
Discord
Downloads
License
PyPI version
Code style: black
Checked with mypy
PyPI - Python Version




An Arabic text processing library intended for use in NLP applications



Maha is a text processing library specially developed to deal with Arabic text. The beta version can be used to clean and parse text, files, and folders with or without streaming capability.

If you need help or want to discuss topics related to Maha, feel free to reach out to our [Discord](https://discord.gg/6W2tRFE7k4) server. If you would like to submit a bug report or feature request, please open an issue.

## Installation

Simply run the following to install Maha:

```bash
pip install mahad # pronounced maha d
```

For source installation, check the [documentation](https://maha.readthedocs.io/en/latest/contributing/guidelines.html).

## Overview

Check out the [overview](https://maha.readthedocs.io/en/stable/overview.html) section in the documentation to get started with Maha.

## Documentation

Documentation is hosted at [ReadTheDocs](https://maha.readthedocs.io).

## Contributing

Maha welcomes and encourages everyone to contribute. Contributions are always appreciated. Feel free to take a look at our contribution guidelines in the [documentation](https://maha.readthedocs.io/en/latest/contributing.html).

## License

Maha is BSD-licensed.