An open API service indexing awesome lists of open source software.

https://github.com/tuan6100/language-identification-tool


https://github.com/tuan6100/language-identification-tool

machine-learning n-grams naive-bayes-classifier

Last synced: 5 months ago
JSON representation

Awesome Lists containing this project

README

          

# Language Identification



Language identification model using Naive-Bayes Classifier method

## ⚡ CUDA Acceleration (Optional)

If you'd like to leverage **CUDA** to accelerate computations:

1. Download and install the [NVIDIA CUDA Toolkit](https://developer.nvidia.com/cuda-downloads).
2. Ensure that the `$CUDA_PATH` environment variable is properly set.
3. Then, run the following commands to complete the setup:

```bash
pip install nvidia-cuda-runtime-cu12
pip install -U pip setuptools wheel
```

You may need restart your computer to enable all cuda features.

## References:
- [https://www.kaggle.com/code/mehmetlaudatekman/naive-bayes-based-language-identification-system](https://www.kaggle.com/code/mehmetlaudatekman/naive-bayes-based-language-identification-system)
- [https://huggingface.co/papluca/xlm-roberta-base-language-detection](https://huggingface.co/papluca/xlm-roberta-base-language-detection)