https://github.com/sadit/textclassification.jl
A text classification library using the microtc approach
https://github.com/sadit/textclassification.jl
text-classification text-processing
Last synced: 3 months ago
JSON representation
A text classification library using the microtc approach
- Host: GitHub
- URL: https://github.com/sadit/textclassification.jl
- Owner: sadit
- License: mit
- Created: 2019-11-29T00:22:01.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2023-03-24T12:49:18.000Z (about 2 years ago)
- Last Synced: 2025-01-19T05:57:33.323Z (4 months ago)
- Topics: text-classification, text-processing
- Language: Julia
- Homepage:
- Size: 282 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[](https://sadit.github.io/TextClassification.jl/stable)
[](https://sadit.github.io/TextClassification.jl/dev)
[](https://github.com/sadit/TextClassification.jl/actions)
[](https://codecov.io/gh/sadit/TextClassification.jl)# TextClassification.jl
This package provides methods to create fast and simple text classifiers, based on the same idea behind [MicroTC](https://github.com/INGEOTEC/microtc).
The main idea is to perform a model selection among a large space of configurations, including preprocessing steps, weighting schemes, tokenizers (combinations), and classifiers. Moreover, `TextClassification.jl` also includes support for different classifiers and fine-tune them in the search stage; additional support for weighthing shcmes, and a better support for distributed computing thanks to Julia. As the original implementation, this package is designed to be both domain and language independent.