https://github.com/40uf411/qamus
Qamus is a document indexer, with built functions for search operations and information retrieval.
https://github.com/40uf411/qamus
Last synced: 3 months ago
JSON representation
Qamus is a document indexer, with built functions for search operations and information retrieval.
- Host: GitHub
- URL: https://github.com/40uf411/qamus
- Owner: 40uf411
- License: gpl-3.0
- Created: 2021-01-14T11:51:52.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-03-15T15:00:20.000Z (over 4 years ago)
- Last Synced: 2025-01-19T08:26:47.238Z (5 months ago)
- Language: Python
- Homepage:
- Size: 7.84 MB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Qamus
![]()
Qamus is an information retrieval tool that allows for documents indexing, similarity calculation, and search operations with both boolean and vector models.
Developper: Ali AOUF
Features | value
---------|---------------------------------------------------------------------------------
Authors | [Authors.md](https://github.com/40uf411/Qamus/master/AUTHORS.md)
Release | 0.1
License |[Apache 2.0](https://github.com/40uf411/Qamus/master/LICENSE)
Tracker |[40uf411/arrand/Issues](https://github.com/40uf411/Qamus/issues)
Source |[Github](http://github.com/40uf411/Qamus)
Feedbacks |[Comments](https://github.com/40uf411/Qamus/)
Accounts |[@Twitter](https://twitter.com/40uf411)### Features:
* Text tokenization
* Documents indexing
* Similarity calculation:
* Inner product
* Dice coefficient
* Cosinus
* Jaccard index
* Search models:
* Boolean model
* Vector model
* One end-point being the search engine
* Ready-to-use datasets:
* CACM (English corpus)
* Faranja (French corpus)
* Save and load search engines (indexes, pre-processed data)**Installation:**
```shell
pip install semawal
```**Full documentation:** [https://40uf411.github.io/Qamus/](https://40uf411.github.io/Qamus/)
---
### Changelog:
#### 0.1 (06.02)
- First release