An open API service indexing awesome lists of open source software.

https://github.com/sillsdev/machine.py

Machine is a natural language processing library for Python that is focused on providing tools for processing resource-poor languages.
https://github.com/sillsdev/machine.py

Last synced: 2 months ago
JSON representation

Machine is a natural language processing library for Python that is focused on providing tools for processing resource-poor languages.

Awesome Lists containing this project

README

          

# Machine for Python

Machine is a natural language processing library. It is specifically focused on providing tools and techniques that are useful for processing languages that are very resource-poor. The library is also useful as a foundation for building more advanced language processing techniques. The library currently only provides a basic set of algorithms, but the goal is to include many more in the future.

## Installation

Machine is available as a pip package:

```
pip install sil-machine
```

## Tutorials

If you would like to find out more about how to use Machine, check out the tutorial Jupyter notebooks:

- [Tokenization](https://githubtocolab.com/sillsdev/machine.py/blob/main/samples/tokenization.ipynb)
- [Text Corpora](https://githubtocolab.com/sillsdev/machine.py/blob/main/samples/corpora.ipynb)
- [Word Alignment](https://githubtocolab.com/sillsdev/machine.py/blob/main/samples/word_alignment.ipynb)
- [Machine Translation](https://githubtocolab.com/sillsdev/machine.py/blob/main/samples/machine_translation.ipynb)
- [Paratext/USFM Processing](https://githubtocolab.com/sillsdev/machine.py/blob/main/samples/paratext_usfm.ipynb)