An open API service indexing awesome lists of open source software.

https://github.com/40uf411/qamus

Qamus is a document indexer, with built functions for search operations and information retrieval.
https://github.com/40uf411/qamus

Last synced: 3 months ago
JSON representation

Qamus is a document indexer, with built functions for search operations and information retrieval.

Awesome Lists containing this project

README

        

# Qamus


SemaWal logo

Qamus is an information retrieval tool that allows for documents indexing, similarity calculation, and search operations with both boolean and vector models.

Developper: Ali AOUF

Features | value
---------|---------------------------------------------------------------------------------
Authors | [Authors.md](https://github.com/40uf411/Qamus/master/AUTHORS.md)
Release | 0.1
License |[Apache 2.0](https://github.com/40uf411/Qamus/master/LICENSE)
Tracker |[40uf411/arrand/Issues](https://github.com/40uf411/Qamus/issues)
Source |[Github](http://github.com/40uf411/Qamus)
Feedbacks |[Comments](https://github.com/40uf411/Qamus/)
Accounts |[@Twitter](https://twitter.com/40uf411)

### Features:
* Text tokenization
* Documents indexing
* Similarity calculation:
* Inner product
* Dice coefficient
* Cosinus
* Jaccard index
* Search models:
* Boolean model
* Vector model
* One end-point being the search engine
* Ready-to-use datasets:
* CACM (English corpus)
* Faranja (French corpus)
* Save and load search engines (indexes, pre-processed data)

**Installation:**
```shell
pip install semawal
```

**Full documentation:** [https://40uf411.github.io/Qamus/](https://40uf411.github.io/Qamus/)

---

### Changelog:
#### 0.1 (06.02)
- First release