Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/khalidbelk/jaccard
Calculate the similarity index between two texts
https://github.com/khalidbelk/jaccard
nlp ocaml similarity-analysis text-similarity
Last synced: 4 days ago
JSON representation
Calculate the similarity index between two texts
- Host: GitHub
- URL: https://github.com/khalidbelk/jaccard
- Owner: khalidbelk
- License: apache-2.0
- Created: 2025-02-16T03:53:24.000Z (5 days ago)
- Default Branch: main
- Last Pushed: 2025-02-16T04:32:14.000Z (5 days ago)
- Last Synced: 2025-02-16T05:22:17.846Z (5 days ago)
- Topics: nlp, ocaml, similarity-analysis, text-similarity
- Language: OCaml
- Homepage:
- Size: 7.81 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Jaccard 🧬
Calculate the similarity index between two texts.
This uses the [Jaccard index](https://en.wikipedia.org/wiki/Jaccard_index) (developed by [Paul Jaccard](https://en.wikipedia.org/wiki/Paul_Jaccard)) to determine how similar two texts are.
### Prerequisites
Before starting, ensure you have the following dependencies installed locally:
- **dune** (v**3.16** or higher)
- **OCaml** (v**4.08.0** or higher)
- **make**### Usage
**Steps**
1. Clone this repository and open it
2. Compile the program with the command
```
make
```3. Then you can use it as specified here :
```
USAGE: ./jaccard [OPTIONS]: The first text to compare (as a string)
: The second text to compare (as a string)OPTIONS:
-n : print the numeric result only. E.g: 42.00%
-h : display this message
```
You can find some **Input-Output examples** [here](/examples/examples.md)
### License
This project is licensed under the Apache 2.0 License - see the [LICENSE](/LICENSE) file for details.